WO2021213293A1 - Ubiquitous operating system oriented toward group intelligence perception - Google Patents

Ubiquitous operating system oriented toward group intelligence perception Download PDF

Info

Publication number
WO2021213293A1
WO2021213293A1 PCT/CN2021/087993 CN2021087993W WO2021213293A1 WO 2021213293 A1 WO2021213293 A1 WO 2021213293A1 CN 2021087993 W CN2021087993 W CN 2021087993W WO 2021213293 A1 WO2021213293 A1 WO 2021213293A1
Authority
WO
WIPO (PCT)
Prior art keywords
task
data
agent
tasks
state
Prior art date
Application number
PCT/CN2021/087993
Other languages
French (fr)
Chinese (zh)
Inventor
於志文
刘一萌
郭斌
廖佳豪
苏江宾
Original Assignee
西北工业大学
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 西北工业大学 filed Critical 西北工业大学
Publication of WO2021213293A1 publication Critical patent/WO2021213293A1/en

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F9/00Arrangements for program control, e.g. control units
    • G06F9/06Arrangements for program control, e.g. control units using stored programs, i.e. using an internal store of processing equipment to receive or retain programs
    • G06F9/46Multiprogramming arrangements
    • G06F9/48Program initiating; Program switching, e.g. by interrupt
    • G06F9/4806Task transfer initiation or dispatching
    • G06F9/4843Task transfer initiation or dispatching by program, e.g. task dispatcher, supervisor, operating system
    • G06F9/4881Scheduling strategies for dispatcher, e.g. round robin, multi-level priority queues
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F11/00Error detection; Error correction; Monitoring
    • G06F11/36Preventing errors by testing or debugging software
    • G06F11/3604Software analysis for verifying properties of programs
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F8/00Arrangements for software engineering
    • G06F8/20Software design

Definitions

  • the present invention relates to the field of mobile computing and ubiquitous operating systems and the field of group intelligence perception, in particular to system-based
  • the more well-known systems include Common Sense, Ear-Phone, Chimera, Creekwatch, and PhotoCity.
  • group intelligence perception technology has also been extensively studied. For example, Wang et al. studied the problem of multi-task cooperative allocation in the application of group intelligence. Guo et al. put forward the challenge of optimizing the sensing data. F. Restuccia et al. studied the quality improvement methods of crowd perception data.
  • the incentive mechanism and privacy protection of crowdsourcing workers have become a subject of in-depth research. These applications and technologies are mostly used in scientific research, with the goal of collecting some kind of sensor data. The developed software can only complete a single task and cannot be reused, and cannot be migrated to other tasks.
  • the crowdsourcing platform usually only summarizes the task result data without further evaluation and analysis.
  • Some simple results filtering methods may be preset in the Qunzhi application, but because these methods are usually for specific types of tasks or data, it is difficult to generalize to other tasks or data types and cannot support complex data processing tasks (such as data semantic understanding). Wait).
  • the present invention provides a ubiquitous operating system oriented to group intelligence perception, specifically a ubiquitous operating system oriented to group intelligence, CrowdOS, through the in-depth understanding of the complex environment and diversified characteristics of group intelligence tasks Analysis, and then designed a set of comprehensive processing mechanism and core functional components based on the operating system framework.
  • CrowdOS runs as a middleware between the native operating system and the application layer of heterogeneous devices.
  • the present invention focuses on the core architecture of CrowdOS and its three core mechanisms, namely: task semantic analysis and user scheduling, system resource management and deep feedback interaction of task results.
  • the task publisher inputs the original task data through the smart terminal and submits it to the platform. After the task is captured by the platform, the task is analyzed and a unique task ID is assigned to the task; After entering the platform, perform task analysis and generate the corresponding task feature vector, and perform feature splicing with other known discrete features, and extract the characteristics through the task vector.
  • the characteristics include but are not limited to the task type, the number of participants required, and the task
  • the task of interest is executed and the collected perception data or design plan is uploaded to the ubiquitous operating system for group intelligence perception; when the data enters the ubiquitous operating system for group intelligence perception, the task, user, and process resources are performed Abstract and software definition, according to the description of task feature information, the ubiquitous operating system for group intelligence selects the task middleware to be used and summarizes the collected data; finally the results are returned to the task publisher, and the publisher performs the task results Evaluation and feedback, when the publisher receives the results, the life cycle of the group intelligence task ends.
  • CrowdOS runs between the native operating system and upper-layer applications, and includes the sensing end and the server.
  • the software carrier of the sensing end includes two types of equipment, the first type is Portable intelligent sensing devices with human-computer interaction functions.
  • the second category is fixed sensors deployed in the physical world;
  • CrowdOS adopts a cloud-side-to-end deployment method; the sensing end is deployed on various terminal sensing devices to collect data, information and services
  • the end is deployed on a cloud server or edge server to comprehensively manage system resources and data resources and respond to system operations in real time;
  • the publisher uploads the task to the CrowdOS-based system through the interactive function of the smart terminal, and the participant browses through the perception terminal and executes the tasks that have been published in the system; when the participant receives the task, the perception terminal Data collection begins; the system support layer obtains the current device status, then uniformly interface encapsulates the acquired sensor data and unifies the data transmission format, and then stores the data in the corresponding data structure through the network or Bluetooth transmission; A sensing device that does not require human-computer interaction. Once the device is activated by a task in the system and the device passes the task verification, the device will automatically collect and upload perception data according to predetermined rules;
  • the server-end provides comprehensive management services, which are deployed on server clusters, cloud servers or edge servers, including task pool modules (Taskpool modules), resource management modules (Resource management), and data management center modules (DM). center), internal and external interface module (I&E interface), knowledge base module (Knowledge base), system plug-in module (System Plugin), task result quality optimization (TRO) and joint storage and retrieval module (Storage and query) eight modules;
  • Qunzhi tasks are allocated to users in the platform; in the resource management module, various resources are defined by software and the comprehensive management of equipment, users, environment, and task process scheduling is completed; then enter the data management center module, which provides a large number of Classification and storage of heterogeneous data and fast retrieval functions.
  • the combined storage and retrieval module stores and processes the data collected through the sensing terminal, extracts useful information and transmits it to the knowledge base module; for the collected data, the quality of the task results is optimized The module optimizes data quality.
  • the server will feed back the final result to the task publisher on the sensing side through the network, thereby completing the circulation of the entire task message between the sensing side and the server;
  • the system plug-in module provides users with privacy protection, Credit evaluation and user incentives; in the internal and external interface modules, the internal interface is used to maintain and update the operating system, and the external interface is provided to third-party applications for invoking the software interface.
  • Agents include task agent Task-agent (TA), user agent User-agent (UA), Device-agent (DA), Environment-agent (EA) and Process-agent (PA); where TA contains detailed information about each task, including but not limited to task type, execution time, Location, collected data format, UA is an abstract description of the user in the system, and records the user’s information, including but not limited to the tasks posted and performed, and credit rating; DA is the description of the terminal equipment resources, recording the device’s information Type, current status information; EA abstracts the hardware and software environment resources of the current system itself, including but not limited to the current CPU, memory, storage usage, the number of users in the system and the total amount of available equipment; PA manages all existing systems in the current system Task process, including but not limited to process status, priority, scheduling strategy; Agents complete mutual real-time interaction and update through the semaphore defined in the internal structure after
  • the task in the task information extraction part, when the group intelligence task enters the system, the task is firstly analyzed by semantic analysis and feature extraction, and natural language analysis is performed on the received task. For tasks described by language, the system performs word segmentation processing. Then the system performs operations that do not distinguish between languages, and finally extracts mission-critical information, including but not limited to the task execution method, location, time, number of participants, the extracted task information and the discrete features obtained through rule click selection
  • mission-critical information including but not limited to the task execution method, location, time, number of participants, the extracted task information and the discrete features obtained through rule click selection
  • For splicing input the spliced features into a deep neural network for unified encoding, and output a high-dimensional task intermediate vector; finally, map the vector to the task-agent of the task through decoding, and complete the conversion process from task vector to agents .
  • the Agents generation part contains 5 Agents: Task_agent, Environment_agent, Process_agent, Device_agent, User_agent, which are obtained through the process of task analysis and representation; among them, taskID is the unique identifier of the task in the system; process_state represents the The current state of the task process, whether it is in the generation state, the execution state or the feedback state, etc.; this state assists Process_agent in task process management, Prio represents the priority of the task, the value is 0-15, and the system performs the task according to the priority order Process scheduling; taskInfo is a structure that contains detailed task information, such as task time, location, vector representation, etc.; Classification represents the category to which the task belongs, such as data labeling, sensor information collection, questionnaire answering, etc.; Topic Represents the subject of the task, which can be extracted from keyword information, such as audio collection, photo collection, etc.; deviceNum, deviceInfo, and deviceID respectively represent the number of devices participating in the task, device detailed information, and device ID
  • the scheduling and allocation part, the task scheduling sub-framework includes a strategy library, a mapping model, and a strategy management module; first, the content of the task resource map is analyzed and reasoned, and the task ID is mapped to the strategy library.
  • the task allocation function and its number are stored in the library, so as to complete the task allocation strategy selection process; then according to the selected allocation strategy, perform scheduling operations on the devices and users, and push tasks to appropriate users.
  • the resource management module includes, but is not limited to, users, perception terminals, system environments, task processes, system software, task data, and knowledge bases.
  • the management objects of CrowdOS are abstracted into 4 categories, and they are executed sequentially according to the different periods when tasks enter the system. as follows:
  • the sensing terminal When the sensing terminal is connected through the network, the signal is triggered, and the device automatically sends the current status information to the system, including but not limited to device type, remaining power, location information, storage occupancy rate, based on The system developed by the CrowdOS framework captures and stores state information through the Device-agent agent;
  • the system portrays the user portrait through the User-agent.
  • the user interacts with the system by relying on the perception end.
  • the Useragent saves the user's name, age, and tasks that have participated in it. At the same time, it generates user credit ratings, user preferences, areas of interest, and others based on the user's participation in tasks. Personalized information;
  • Environment resource records server architecture and processing capabilities. Resources are stored in the Environment-agent and updated regularly.
  • the agent has an alarm function and makes predictions based on the current system status and the increase or decrease of the task volume. If the system CPU utilization rate or storage occupancy rate is When the rated threshold is reached, the system will give an alarm.
  • Process-agent is a collection of the status information of the current stage of the task.
  • a process identification number is assigned to each task as the only sign of the existence of the task process in the system; the ID is stored in both the Task-agent and the Process-agent and accompanies the entire life cycle of the task;
  • the PA class contains task process information, including but not limited to TPID: process unique identifier; process_state: the current state of the task in the system, there are seven switchable states; process_strategy: process scheduling strategy, including first come, first served ( FCFS), round-robin method (RB), task priority, highest response ratio priority (HRRN), feedback priority; process_prio represents the process priority, from 0-15, 0 represents the highest priority, in descending order;
  • the task process state includes seven states: creation state, generation state, allocation state, execution state, processing state, feedback state and termination state; the flow relationship between states is: First, when a user publishes a task in the system, the first is task creation State; enter the task generation state after passing system authentication and analysis; enter the distribution state after task scheduling and assignment; after the release is completed, participants can execute the task, and then enter the execution state; participants upload the collected data to the system After processing, enter the processing state; after data visualization, the results are summarized to the task publisher and enter the feedback state; when the task result fails the publisher’s acceptance verification, the system will temporarily stay in the feedback state, and then after reasoning and correction, the task progresses Return to the generation state, the allocation state or the processing state again, and then execute sequentially; after the publisher verifies the task submission result, the task enters the termination state, and the entire task life cycle ends;
  • the task process scheduling algorithm in the system can choose one of the following methods:
  • FCFS First-come, first-served
  • Circular method Generate task interrupts at a periodic interval, place the currently running process in the task ready queue, and select the next ready process to run based on FCFS;
  • Task priority Prioritize high-level tasks, and tasks with the same priority shall follow the FCFS principle
  • MC multi-character cube structure
  • OS knowledge is divided into two categories: existing knowledge (EK) and new knowledge (NK); NK is extracted from tasks or data, which helps to improve system mechanisms or update models.
  • EK existing knowledge
  • NK new knowledge
  • the system distinguishes from existing knowledge that can improve system performance, update models, or improve the quality of task results.
  • Information or knowledge useful to users or third parties is not included in the scope of knowledge here;
  • the system provides corresponding mechanisms or algorithms to mine the discovered knowledge, including but not limited to deep learning algorithms, online update algorithms, and migration learning algorithms.
  • the knowledge base can extract knowledge based on the type, form, and abstraction level of the knowledge. Perform induction and database storage. Knowledge is not centrally stored in a certain management list or module in the system, it is distributed in various modules or lists of the system.
  • the knowledge base mainly records knowledge addresses and internal relationships between knowledge, and establishes a knowledge network based on these.
  • the result quality optimization module includes an interaction layer, a reasoning layer, and an execution layer; firstly, in the interaction layer, the publisher evaluates the task results by inputting evaluation information or clicking buttons on the man-machine interface, and analyzes the diversified evaluation content ; If the evaluation shows acceptance or satisfaction, the end instruction is executed, and the task is terminated; if the result is found to have quality problems through analysis, then enter the next layer; secondly, enter the reasoning layer, and perform key information extraction and depth on the publisher’s feedback content Analyze the operation, infer the possible causes of the quality problem, and then map the reason to the problem code library according to the inference model established by the system, and find the corresponding error code; third, enter the execution layer, and the problem code will be coded with the corresponding internal operation After the correction is completed, the task enters a new process state. The task results corrected through various channels will enter the interactive layer again and be fed back to the publisher, waiting for the interactive layer to give new evaluation results.
  • the entire optimization process is step by step Execute and form a closed
  • the beneficial effect of the present invention is to propose a ubiquitous operating system oriented to group intelligence perception, and use CrowdOS to solve the problems of the existing mobile group intelligence perception or the lack of a unified architecture of the crowdsourcing platform and the incompatibility of algorithms or modules in related research.
  • the composition of the architecture and the contents of each module and the relationship between them are described in detail.
  • the implementation ideas of the three core mechanisms in the CrowdOS kernel architecture are explained in detail: Among them, the task analysis and scheduling mechanism establishes a bridge between the task and the OS kernel through the task resource graph, and then adapts to heterogeneous tasks.
  • the resource management mechanism abstracts heterogeneous physical and virtual resources in the system, and provides them with unified software definition and management;
  • the result quality optimization mechanism aims at quality evaluation and shallow in-depth inference mechanisms and integration Strategies for specific quality issues to quantify and optimize the quality of the results.
  • CrowdOS and WeSense an application example developed based on the architecture, are mainly evaluated from four aspects: correctness and efficiency (Ev 1 ), effectiveness and availability (Ev 2 ), optimization result quality evaluation (E v 3 ), performance, load and pressure Test (Ev 4 ).
  • Figure 1 is a block diagram of the group intelligence perception ecosystem.
  • Figure 2 is a block diagram of the core architecture of CrowdOS.
  • Figure 3 is a system resource map.
  • Figure 4 is a framework diagram of task analysis and scheduling.
  • Figure 5 is a state switching diagram of the task process.
  • Figure 6 is a framework diagram of result quality optimization.
  • Figure 7 is the WeSense usability evaluation.
  • Figure 7(a) is the homepage, showing and searching of the group intelligence tasks;
  • Figure 7(b) is the task detail page;
  • Figure 7(c) is the task submission page.
  • Figure 8 is a development efficiency evaluation diagram, where Figure 8 (a) is the time required to complete the f1-3 test with GA and GB, and Figure 8 (b) is a comparison diagram of the time consumption of all tests based on M1 and M2.
  • Figure 9 is the effectiveness evaluation of the core framework.
  • the tasks in Figure 9(a) are the effect diagrams of random assignment;
  • Figure 9(b) is the effect of using the location-based task allocation algorithm, and
  • Figure 9(c) compares the two methods.
  • Figure 9(d) is a schematic diagram of selecting the super privacy protection mode.
  • Figure 10 is a comparison diagram of optimization time, in which Figure 10 (a) is the data format correction request interface, Figure 10 (b) is the correction prompt message received by the participants; Figure 10 (c) is the time consumption comparison of the two optimization methods picture.
  • the present invention mainly includes the core architecture of CrowdOS and three important mechanisms.
  • the main contributions and innovations of CrowdOS are as follows:
  • the system analyzes group intelligence tasks based on natural language processing related technologies, and performs fine-grained semantic analysis and modeling of tasks combined with discrete features. Accordingly, a bridge between the task and the system is established, that is, the task analysis and scheduling mechanism. Based on natural language interaction understanding, it solves the problem that the platform mentioned in the first challenge only has a summary function for task results or can only handle a single task through a template.
  • the operating system abstracts all kinds of resources (user resources, task resources, system resources) required for the execution of group intelligence tasks and defines them in software. And then established a system resource map. It provides the cornerstone for the unified and efficient management of various resources in the system.
  • the resource management mechanism solves the second challenge.
  • the system proposes an evaluation method and optimization mechanism, that is, the result quality optimization mechanism.
  • This mechanism is based on the idea of deep human-computer integration, using natural language feedback interaction and deep and shallow reasoning methods, which mainly solves the two types of quality problems of sparse results and high error rate of results.
  • This man-machine coordination task result optimization mechanism solves the third challenge.
  • a ubiquitous operating system oriented to group intelligence perception is an architecture diagram of the group intelligence perception ecosystem, showing the task execution process and life cycle.
  • the ubiquitous operating system architecture for group intelligence perception includes a) server clusters, b) smart terminals, c) sensing devices, d) sensors, e) communication networks, f) basic software layer, g) platform and application software, h) Participants and group intelligence tasks, where af is used as a software carrier or hardware infrastructure support.
  • the life cycle and flow of the ubiquitous operating system for group intelligence perception is shown in Figure 1.
  • the task publisher inputs the original task data through the smart terminal and submits it to the platform.
  • the platform After the platform captures the task, it analyzes the task and gives it to the Tasks are assigned a unique task ID; when the task enters the platform, the task is analyzed and the corresponding task feature vector is generated, and the feature is spliced with other known discrete features, and the feature is extracted through the task vector.
  • the feature includes but is not limited to the task Types, number of participants required, task execution location, required sensors and types of collected data; the ubiquitous operating system for group intelligence perception completes the user scheduling and task allocation process by performing task reasoning, association and matching operations; After the participant receives the task, he selects the task of interest to execute and uploads the collected perception data or design plan to the ubiquitous operating system for group intelligence perception; when the data enters the ubiquitous operating system for group intelligence perception , Abstract and software definition of tasks, users, and process resources.
  • the ubiquitous operating system for group intelligence perception selects the task middleware that needs to be used and summarizes the collected data; finally returns the results to the task
  • the publisher the publisher evaluates and feedbacks the task results, when the publisher receives the results, the life cycle of the group intelligence task ends.
  • the present invention designs the operating system CrowdOS to implement a ubiquitous operating system oriented to group intelligence perception.
  • the core architecture of CrowdOS and the three subsystem frameworks included in the core architecture are as follows :
  • FIG. 2 is a block diagram of the core architecture of CrowdOS.
  • the ubiquitous operating system for group intelligence perception is implemented by the operating system CrowdOS. It can be seen from Figure 2 that CrowdOS runs between the native operating system and upper-layer applications, including the perception end and the server end.
  • the software carrier of the sensing end includes two types of devices. The first type is a portable intelligent sensing device with human-computer interaction functions, such as smart phones, smart watches, etc.; the second type is a fixed type deployed in the physical world.
  • CrowdOS adopts a cloud-side-end deployment method; the sensing end is deployed on various terminal sensing devices to collect environment, business, society, For data information such as crowds, the server is deployed on cloud servers or edge servers to comprehensively manage system resources and data resources and respond to system operations in real time; the system software deployed on edge servers is usually tailored and lightweight. For example, the redundant modules are removed, and only the core data processing part, visual integrated processing and other functions are still deployed on the cloud server.
  • the publisher uploads the task to the CrowdOS-based system through the interactive function of the smart terminal, and the participant browses through the perception terminal and executes the tasks that have been published in the system; when the participant receives the task, the perception terminal Data collection begins; the system support layer obtains the current device status, then uniformly interface encapsulates the acquired sensor data and unifies the data transmission format, and then stores the data in the corresponding data structure through the network or Bluetooth transmission; A sensing device that does not require human-computer interaction. Once the device is activated by a task in the system and the device passes the task verification, the device will automatically collect and upload perception data according to predetermined rules;
  • the server-end provides comprehensive management services, which are deployed on server clusters, cloud servers or edge servers. It includes a total of eight modules and belongs to the core processing framework of the operating system.
  • the server After the data is transmitted to the server, the server understands and expresses the task through the task pool module (Task Pool module), analyzes, schedules, allocates, and fine-tunes the received tasks through the task pool module, and assigns the group intelligence task to Users in the platform; in the resource management module (Resource management), software defines various resources and completes the comprehensive management of devices, users, environment, and task process scheduling (Process); then Enter the data management center module (DM center), which provides classified storage and fast retrieval functions for massive amounts of heterogeneous data.
  • Task Pool module Task Pool module
  • Resource management software defines various resources and completes the comprehensive management of devices, users, environment, and task process scheduling (Process)
  • DM center data management center module
  • the joint storage and retrieval module stores and processes the data collected through the sensing end, and extracts Useful information is transmitted to the knowledge base module; for the collected data, the task result quality optimization (TRO) module is used to optimize the data quality. After the optimization is completed, the server will send the final result back to the sensory side through the network for task release In order to complete the flow of the entire task message between the perception end and the server end.
  • Storage and query stores and processes the data collected through the sensing end, and extracts Useful information is transmitted to the knowledge base module; for the collected data, the task result quality optimization (TRO) module is used to optimize the data quality. After the optimization is completed, the server will send the final result back to the sensory side through the network for task release In order to complete the flow of the entire task message between the perception end and the server end.
  • TRO task result quality optimization
  • system plug-in module provides users with a wealth of system features, including privacy protection, credit evaluation, and user incentives; in the internal and external interface module (I&E interface), the internal interface is to facilitate CrowdOS developers’ access to the operating system For maintenance and update functions, the external interface is provided to third-party application developers, so that they can call the software interface and develop their personalized swarm perception application system based on the CrowdOS architecture.
  • I&E interface internal and external interface module
  • the internal interface is to facilitate CrowdOS developers’ access to the operating system
  • the external interface is provided to third-party application developers, so that they can call the software interface and develop their personalized swarm perception application system based on the CrowdOS architecture.
  • CrowdOS abstracted various entities and virtual resources in the ubiquitous operating system for group intelligence, completed the software definition and expression, and expressed the entity and virtual resources with the relationship between symbols and symbols. ;
  • five dynamic Agents are constructed to generate system resource maps to manage the tasks and resources in the system.
  • Agents Including task agent Task-agent (TA), user agent User-agent (UA), device agent (DA), environment agent Environment-agent (EA) and process agent Process-agent (PA).
  • TA task agent Task-agent
  • U user agent User-agent
  • DA device agent
  • EA environment agent Environment-agent
  • PA process agent Process-agent
  • TA contains detailed information of each task, including but not limited to task type, execution time, location, and data format collected.
  • UA is an abstract description of the user in the system and records the user's information, including but not limited to release And the tasks performed, the credit rating;
  • DA is the description of the terminal device resources, recording the type of the device, the current status information;
  • EA abstracts the software and hardware environment resources of the current system itself, including but not limited to the current CPU, Memory, storage usage, and the number of users in the system and the total amount of available equipment;
  • PA manages all task processes in the current system, including but not limited to process status, priority, and scheduling strategy; Agents pass internally after obtaining task information
  • the semaphore defined in the structure completes the real-time interaction and update between each other.
  • Fig. 4 task analysis and scheduling framework diagram is a detailed design of the task pool part of Task Pool in Fig. 2, which solves the first challenge proposed by the present invention. It is divided into two steps: first, deep understanding of group intelligence tasks, and fine-grained extraction of commonalities and differences of tasks; second, the system needs to allocate tasks to participants reasonably to ensure the shortest time and lowest energy consumption conditions Complete the information gathering process.
  • the system when the group intelligence task enters the system, the system first performs semantic analysis and feature extraction on the task.
  • the system will perform natural language analysis on the received task. For tasks described in languages such as English, the system performs word segmentation, and then the system performs language-insensitive operations such as part-of-speech tagging, named entity recognition, and keyword extraction, and finally extracts key information of the task, including but not limited to the task execution method, location, Time, the number of participants; the system further splices the extracted task information and the discrete features obtained through regular click selection, and inputs the spliced features into a deep neural network for unified coding, and outputs a high-dimensional task The intermediate vector; finally, the vector is mapped to the task-agent of the task through decoding, and the conversion process from the task vector to the agent P1 marked below in Figure 4 is completed.
  • the B Agents generation part marked at the top of Figure 4 contains 5 Agents: Task_agent, Environment_agent, Process_agent, Device_agent, and User_agent. Take the structure of Task-agent as an example to expand the specific definition description; TA contains all the common and individual information of the task, It is obtained through the process of task analysis and presentation in the previous step; among them, taskID is the unique identifier of the task in the system; process_state indicates the current state of the task process, whether it is in the generation state, the execution state or the feedback state; this state assists Process_agent manages the task process. Prio represents the priority of the task, with a value of 0-15.
  • taskInfo is a structure that contains task detailed information, such as task time, location, and vector Representation, etc.
  • Classification represents the category to which the task belongs, such as data labeling, sensor information collection, questionnaire answering, etc.
  • Topic represents the topic of the task, which can be extracted from keyword information, such as audio collection, photo collection Etc.
  • deviceNum, deviceInfo, and deviceID respectively represent the number of devices participating in the task, detailed device information, and a list of device IDs
  • Sensing_Data is a pointer to the collected task data set, pointing to the address of the cube where the data is stored.
  • the task scheduling subframe includes a strategy library, a mapping model, and a strategy management module; the system first analyzes and infers the content of the task resource map, and maps the task ID to the strategy The specific strategies in the library, where commonly used task allocation functions and their numbers are stored in the strategy library, so as to complete the task allocation strategy selection process; then the system performs scheduling operations on devices and users according to the selected allocation strategy, and pushes tasks to the appropriate User.
  • the resource management framework is the specific implementation of the Resource Management module Resource Management in Figure 2.
  • the content of resource management includes but is not limited to users, sensing terminals, system environments, task processes, system software, task data, and knowledge bases.
  • the management objects of CrowdOS are abstracted into four categories. The following describes how the system manages the four types of objects. The following four parts are part of resource management, and the four parts of management are executed in order according to the different periods when tasks enter the system.
  • the sensing terminal accesses the system through the network, it will trigger a signal, and the device will automatically send current status information to the system, including but not limited to device type, remaining power, location information, and storage occupancy.
  • the system developed based on the CrowdOS framework passes Device-agent The agent captures and stores state information;
  • the system portrays the user (task participant and publisher) portrait through User-agent, and the user interacts with the system by relying on the sensing terminal, for example, to publish tasks through the smart phone application interactive interface.
  • User-agent saves the user's name, age, and tasks that have participated in it. At the same time, it generates user credit ratings, user preferences, areas of interest, and other personalized information based on the user's participation in tasks.
  • Environmental resource records server architecture and processing capabilities, for example, centralized, distributed, or edge deployment architecture, the number of CPUs in the system, system CPU occupancy, memory usage, available disk storage space, and system access. These resources are stored in the Environment-agent and updated regularly to ensure that the system obtains the latest data; the agent has an alarm function, and makes predictions based on the current system status and the increase or decrease of the task volume. If the system CPU utilization or storage is occupied If the rate reaches the rated threshold, the system will give an alarm.
  • Process-agent is a collection of task current stage status information, and the system assigns a process identification number to each task as the task process in the system The only sign of existence; this ID is stored in both Task-agent and Process-agent, and accompanies the entire life cycle of the task.
  • the PA class contains a wealth of task process information, including but not limited to TPID: process unique identifier; Process-state: the state of the current task in the system, there are seven switchable states; Process-strategy: process scheduling strategy , Including first-come, first-served (FCFS), round-robin (RB), task priority, highest response ratio priority (HRRN), feedback priority; process_prio represents the process priority, from 0-15, 0 represents the highest priority, in descending order .
  • TPID process unique identifier
  • Process-state the state of the current task in the system, there are seven switchable states
  • Process-strategy process scheduling strategy , Including first-come, first-served (FCFS), round-robin (RB), task priority, highest response ratio priority (HRRN), feedback priority
  • process_prio represents the process priority, from 0-15, 0 represents the highest priority, in descending order .
  • the task process state includes seven states: creation state, generation state, allocation state, execution state, processing state, feedback state and termination state; the flow relationship between the states is shown in Figure 5.
  • a user publishes a task in the system
  • the first is the task creation state; after system authentication and analysis, it enters the task generation state; after task scheduling and assignment, it enters the distribution state; after the release is completed, participants can execute the task and then enter the execution state; the data collected by the participants Upload to the system for processing and enter the processing state; after the data is visualized, the results are summarized to the task publisher and enter the feedback state; when the task result fails to pass the publisher’s acceptance verification, the system will temporarily stay in the feedback state, and then go through reasoning and Correction, the task process returns to the generation state, the allocation state or the processing state again, and then executes sequentially; after the task submission result is verified by the publisher, the task enters the termination state, and the entire task life cycle ends.
  • the text above the state in Figure 5 represents the subject that executes the state, which is
  • the task process scheduling algorithm in the system selects one of the following methods:
  • FCFS First-come, first-served
  • Circular method Generate task interrupts at a periodic interval, place the currently running process in the task ready queue, and select the next ready process to run based on FCFS;
  • the system manages two types of data resources. One is the data carried by the task itself, which is called raw data (RD). The second is the new data (ND) that participants upload to the system during the execution of the task.
  • RD usually includes text, images and data sets that need to be labeled to describe the task.
  • ND new data
  • Various perception data uploaded by ND participants including but not limited to text descriptions, sensor data, statistical charts, design documents, and tagged data.
  • the present invention uses data cube technology to manage and store task data, construct a multi-feature cube structure (MC), and retrieve unstructured data based on the constructed data cube;
  • MC multi-feature cube structure
  • the system distinguishes from existing knowledge that can improve system performance, update models, or improve the quality of task results.
  • Information or knowledge useful to users or third parties is not included in the scope of knowledge here;
  • the system provides corresponding mechanisms or algorithms to mine the discovered knowledge, including but not limited to deep learning algorithms, online update algorithms, and migration learning algorithms.
  • the knowledge base can extract knowledge based on the type, form, and abstraction level of the knowledge. Perform induction and database storage. Knowledge is not centrally stored in a certain management list or module in the system, it is distributed in various modules or lists of the system.
  • the knowledge base mainly records knowledge addresses and internal relationships between knowledge, and establishes a knowledge network based on these.
  • the knowledge in the system is divided into two categories, one is the existing knowledge, and the other is the knowledge newly mined from tasks or data.
  • Existing knowledge includes expert strategies, decision rules or network models defined in the system in advance. Including but not limited to the existing task allocation strategies in the strategy library, and the reasoning tree in the feedback mechanism.
  • New knowledge mining is to identify effective, novel, potentially useful and interpretable content, methods, and models from data sets or existing knowledge.
  • the new knowledge in CrowdOS is an extension of the original knowledge. It includes the improvement of decision-making methods, the addition of rules or the update of the network model, and does not include the knowledge of irregular and explosive expansion.
  • the knowledge base summarizes and stores this knowledge according to the type, form and level of knowledge. Knowledge is not centrally stored in a certain management list or module in the system, it is distributed in various modules or lists of the system.
  • the knowledge base records the knowledge address, the internal relationship between knowledge, and the knowledge network topology.
  • the result quality optimization module includes an interaction layer, a reasoning layer, and an execution layer; firstly, in the interaction layer, the publisher evaluates the task results by inputting evaluation information or clicking buttons on the man-machine interface, and analyzes the diversified evaluation content ; If the evaluation shows acceptance or satisfaction, the end instruction is executed, and the task is terminated; if the result is found to have quality problems through analysis, then enter the next layer; secondly, enter the reasoning layer, and perform key information extraction and depth on the publisher’s feedback content Analyze the operation, infer the possible causes of the quality problem, and then map the reason to the problem code library according to the inference model established by the system to find the corresponding error code; third, enter the execution layer, and the problem code will be coded with the corresponding internal operation After the correction is completed, the task enters a new process state. The task results corrected through various channels will enter the interactive layer again and be fed back to the publisher, waiting for the interactive layer to give new evaluation results.
  • the entire optimization process is step by step Execute and form a closed loop
  • the result quality optimization framework is the TRO framework module in the server side of Figure 2.
  • TRO includes a deep feedback framework based on human-computer interaction (DFHMI) to imitate the process of human thinking about problems, and rely on analysis and reasoning to solve problems.
  • Task quality issues are divided into two categories. The first is that the number of results is sparse, and the second is that the error rate of results exceeds the standard.
  • the optimization method used in the present invention is based on the 5Agents in the system, which is convenient for the user to interact with the system at a deeper level. The specific implementation is described in detail in FIG. 6.
  • Figure 6 is the result quality optimization framework diagram, including the interaction layer, the reasoning layer and the execution layer.
  • the publisher evaluates the task result by inputting evaluation information or clicking the button on the human-machine interface.
  • the system is responsible for analyzing the diversified evaluation content; if the evaluation shows acceptance or satisfaction, the system will execute the end instruction.
  • the task is terminated; if the result of the analysis is found to have quality problems, it will enter the next layer; secondly, enter the reasoning layer, the system will perform key information extraction and in-depth analysis operations on the publisher’s feedback content to reason about the possible causes of the quality problems.
  • the reason is mapped to the problem code library, and the corresponding error code is found; third, enter the execution layer, where the system will map the problem code and the corresponding internal operation.
  • the system has already been Most of the mapping mechanisms are defined. These operations are implemented by modifying the values in the Agent, and other types of operations are also included.
  • the task will enter a new process state.
  • the task results corrected through various channels will enter the interactive layer again and be fed back to the publisher, waiting for the interactive layer to give new evaluation results.
  • the entire optimization process is executed step by step and forms a closed loop until the publisher is satisfied with the task result.
  • RSMT mapping table
  • the initial exploration is based on the decision tree model.
  • the data sparseness problem is regarded as the root node of the tree.
  • the first layer node represents the shallow cause, and the new shallow cause can be added as a child node of the root node. All the shallow causes are brothers to each other. node.
  • the deep cause is usually established on the basis of the shallow cause, which is the extension of the node in the vertical direction, which is called the deep node.
  • the system regularly updates the tree such as branching and pruning to ensure that the decision tree is maintained within a certain scale.
  • the present invention develops WeSense use cases on which users can publish and perform various sensing tasks, such as road congestion information collection, air quality status monitoring, product price research, etc.
  • Ev1 and Ev2 will be evaluated based on two development methods: M1 (independent development) and M2 (development based on the CrowdOS interface).
  • Ev3 is an evaluation of the result quality optimization mechanism (TRO)
  • Ev4 is the overall performance and stress test of WeSense supported by CrowdOS.
  • the experimental environment is set up in advance and related software is installed.
  • We hired 9 volunteers who are familiar with the Java programming language ( ⁇ num 9), and gave them two weeks to develop applications. First, it took 25 minutes (timec1) to introduce the functions and all APIs of CrowdOS.
  • GA members use M1
  • GB members use M2.
  • E v1 Comparison of M 1 and M 2 development models, comparing the time consumed to complete F ⁇ f i ⁇ , as well as the completeness and correctness of the completed tasks.
  • E v2 Compare the effect and time consumption of F ⁇ f i ⁇ between the two groups of G A and G B testers.
  • E v3 Compare the difference in optimization effect and time consumption between using the TRO framework and other optimization methods.
  • Figure 7(a) is the homepage, showing and searching for group intelligence tasks;
  • Figure 7(b) is the task detail page, you can click to view the task of interest;
  • Figure 7(c) is the task submission Page: Submit the completed result data.
  • Figure 8 is the analysis and comparison of the development cycle, that is, the efficiency evaluation.
  • Figure 8(a) shows the time required to complete the f 1-3 test with G A and G B.
  • Figure 8(b) shows the time consumption comparison of all tests based on M 1 and M 2.
  • the overall development efficiency (DE) of f 1 ⁇ f 5 is increased by 310%.
  • the shortened development time reflects the availability. It can be seen from Fig. 8(a) that compared with using M 1 , the average time consumption of f 2 and f 3 using M 2 is reduced by 65.4% and 88.7%. In addition, with the expansion of the algorithm library, the advantages of CrowdOS have been highlighted. Using it can not only greatly reduce the development time of each functional module, but also improve the overall visualization effect and program readability.
  • T iB T i ⁇ B +T i ⁇ B
  • T i ⁇ B the time of interface operation (T i ⁇ B ⁇ 6min)
  • the interface effect is shown in Figure 10(a).
  • T i ⁇ B is the time each participant spends on correcting the format and resubmitting the data, that is, T i ⁇ B ⁇ V . If there is no TRO framework, the time required for the publisher to correct all data formats is T iA .
  • T iA T i ⁇ A + T i ⁇ A , where T i ⁇ A is the preparation time before correcting the data format (T i ⁇ A ⁇ 30 min), T i ⁇ A is the time spent processing the data format, T i ⁇ A ⁇ n*V.
  • the formula (3) gives the time spent to deal with the data format problem through two optimization methods.
  • Figure 10(c) shows how the distance between the A and B curves varies with the amount of data D(n*V). That is, the comparison of optimization methods and time consumption.
  • Figure 10(a) is the data format correction request interface;
  • Figure 10(b) is the correction prompt message received by the participants;
  • Figure 10(c) compares the two optimization methods as the number of participants increases. the time consumption.
  • the correlation between task results and task requirements can be reflected in many ways, for example, when the requirements are provided in the form of video, but participants submit images; or the actual location of the uploaded data does not match the location in the task requirements .
  • the optimization mechanism will re-screen users with high creditworthiness, update task features based on specific information (such as geographic location), and then re-push the task and feedback information to the original participants, as shown in Figure 10(b).
  • the use of the TRO framework not only avoids the energy consumption caused by a large amount of processing, but is also suitable for various types of tasks.
  • the framework uses the idea of segmentation and integration to correctly transfer the work of different stages to humans or machines.
  • the time consumption of TRO is relatively stable and does not increase significantly as the number of participants increases.
  • many optimization problems are more suitable to be solved by TRO, which can greatly reduce resource consumption compared with pure machine optimization.
  • Performance and load testing Tasks of different scales were loaded sequentially, and the system response time, CPU and memory usage rate, and energy consumption of the sensor were measured. Run each test ten times, and use AndroidStudio's profile performance analyzer to monitor the data in real time while the application is running. The result is shown in Figure 11. Although the number of tasks has increased, the system response time is basically within 0.22s, and the CPU and memory usage rates also remain within the range of 3%-6% and 0.87%-1.14%. The energy consumption is basically kept below the minimum level (L1: light), which indicates that the system is functioning well.
  • L1 minimum level
  • Stability and stress testing Combines two test methods. First, the server runs continuously for 7x24 hours. During this period, the number and content of tasks are updated via mobile devices. Continuously observe and record the output logs of the sensor side and the server side, and there is no abnormal situation such as crashes or software errors. Secondly, after setting up the test environment, use Android SDKMonkey software to conduct stability and stress tests on WeSense. For example, send Monkey-pcom.hills.WeSense-v-v1000 to request execution of 1000 random command events (RCE), such as Map key, Home key. Record the number of occurrences of CRASH and ANR (application not responding). CRASH refers to the situation where the program stops or exits abnormally when an application error occurs.
  • RCE random command events
  • ANR means that when the Android system detects that the application does not respond to input events within 5 seconds or does not perform broadcast within 10 seconds, it will trigger an unresponsive prompt.
  • the test results are shown in Table 2. Combining the above two test results, we can see that the system can operate stably and efficiently under different pressure conditions.

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Software Systems (AREA)
  • General Engineering & Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • General Physics & Mathematics (AREA)
  • Computer Hardware Design (AREA)
  • Quality & Reliability (AREA)
  • Management, Administration, Business Operations System, And Electronic Commerce (AREA)

Abstract

Provided by the present invention is a ubiquitous operating system, specifically CrowdOS, that is oriented towards group intelligence perception. By means of the in-depth analysis of the complex environment and diversified features of group intelligence tasks, a set of comprehensive processing mechanisms and core functional components are designed, comprising three core mechanisms, i.e. task semantic analysis and user scheduling, system resource management, and in-depth feedback interaction of task results. The present invention uses CrowdOS to solve the problems of the lack of a unified system structure of existing mobile group intelligence perception or a crowdsourcing platform and the incompatibility of algorithms or modules in related research. The task analysis and scheduling mechanism establishes a bridge between tasks and OS kernels by means of a task resource map, and adaptively selects a reasonable allocation strategy for heterogeneous tasks. The resource management mechanism abstracts heterogeneous physical and virtual resources in the system to provide unified software definition and management. A result quality optimization mechanism quantifies and optimizes the quality of the results.

Description

一种面向群智感知的泛在操作系统A ubiquitous operating system oriented to group intelligence perception 技术领域Technical field
本发明涉及移动计算和泛在操作系统领域以及群智感知领域,尤其涉及基于系统The present invention relates to the field of mobile computing and ubiquitous operating systems and the field of group intelligence perception, in particular to system-based
底层架构的操作系统。The operating system of the underlying architecture.
背景技术Background technique
随着众包和群智应用技术的兴起,涌现出了大量基于群智思想的平台和应用软件,如Amazon Mechanical Turk[文献“M.Buhrmester,T.Kwang,and S.D.Gosling,“Amazon’s mechan-ical turk:A new source of inexpensive,yet high-quality,data?”Perspectives on psychological science,vol.6,no.1,pp.3–5,2011.”中记载],CrowdFlower,美食及旅游平台和情报分析平台。另外还有许多众包应用程序如做水质检测、空气质量检测、交通拥堵情况调研的,均通过群智思想解决不同领域的难题。对于依靠众包思想发展起来的应用和技术归纳为第一代群智技术,它的特点包括:通过互联网平台发布任务,利用问题切分的思想来解决大规模问题。这些平台本身作为任务发布和结果收集的媒介,不包含针对任务本身的分析和评估,且不优化平台收集到的结果质量。With the rise of crowdsourcing and group intelligence application technologies, a large number of platforms and application software based on group intelligence ideas have emerged, such as Amazon Mechanical Turk [document "M.Buhrmester, T.Kwang, and SDGosling,"Amazon's mechanical-ical turk: A new source of expensive, yet high-quality, data? "Perspectives on psychological science, vol. 6, no. 1, pp. 3–5, 2011."], CrowdFlower, a food and tourism platform, and an intelligence analysis platform. In addition, there are many crowdsourced applications, such as water quality testing, air quality testing, and traffic congestion surveys, all of which solve problems in different fields through the wisdom of the group. The applications and technologies developed based on crowdsourcing ideas are summarized as the first-generation group intelligence technology. Its characteristics include: publishing tasks through the Internet platform and using the idea of problem segmentation to solve large-scale problems. These platforms themselves serve as the media for task release and result collection, do not include analysis and evaluation of the task itself, and do not optimize the quality of the results collected by the platform.
随着移动终端及便携传感设备等发展起来的群智应用和相关技术,归纳为第二代群智技术。这些技术被应用于环境监测、公共设施监测等领域,例如文献“R.K.Ganti,F.Ye,and H.Lei,“Mo bile crowdsensing:current sta te and f uture challenges,”IEEECommunications Magazine,vol.49,no.11,pp.32–39,2011.”,以及文献“B.Guo,Z.Wang,Z.Yu,Y.Wang,N.Y.Yen,R.Huang,and X.Zhou,“Mobile crowd sensing and computing:The review of an emerging human-powered sensing paradigm,”ACM Computing Surveys(CSUR),vol.48,no.1,p.7,2015.”。较为有名系统的包括Common Sense、Ear-Phone、Chimera、Creekwatch、和PhotoCity等。另外,相关的群智感知技术也被广泛研究。例如,Wang等研究了群智应用中的多任务协作分配问题。郭等提出了优化感测数据的挑战。F.Restuccia等人研究了人群感知数据的质量改进方法。另外,众包工作者的激励机制和隐私保护已经成为一个经过深入研究的问题。这些应用和技术大多用于科学研究,以采集某种传感器数据为目标。所开发的软件只能完成单一的任务且不可复用,不可迁移到其他任务上。这些技术也大多是在理想环境下基于诸多假设条件进行的,并且只完成了简单的仿真实验。研究并未考虑与实际平台的结合。综合理论分析和实际体验,本发明深入思考了第一代及第二代群智技术在发展和推广时所面临的困难,期望能够研究统一的架构来集中解决这些困难。With the development of mobile terminals and portable sensing devices, the group intelligence application and related technologies are summarized as the second-generation group intelligence technology. These technologies are used in environmental monitoring, public facility monitoring and other fields, such as the literature "RKGanti, F. Ye, and H. Lei, "Mo bile crowdsensing: current stag and future challenges," IEEE Communications Magazine, vol. 49, no.11,pp.32–39,2011.", and the literature "B.Guo,Z.Wang,Z.Yu,Y.Wang,NYYen,R.Huang,and X.Zhou,"Mobile crowd sensing and computing: The review of an emerging human-powered sensing paradigm, "ACM Computing Surveys (CSUR), vol. 48, no. 1, p. 7, 2015.". The more well-known systems include Common Sense, Ear-Phone, Chimera, Creekwatch, and PhotoCity. In addition, related group intelligence perception technology has also been extensively studied. For example, Wang et al. studied the problem of multi-task cooperative allocation in the application of group intelligence. Guo et al. put forward the challenge of optimizing the sensing data. F. Restuccia et al. studied the quality improvement methods of crowd perception data. In addition, the incentive mechanism and privacy protection of crowdsourcing workers have become a subject of in-depth research. These applications and technologies are mostly used in scientific research, with the goal of collecting some kind of sensor data. The developed software can only complete a single task and cannot be reused, and cannot be migrated to other tasks. Most of these technologies are based on many assumptions in an ideal environment, and only simple simulation experiments have been completed. The study did not consider the combination with the actual platform. Comprehensive theoretical analysis and practical experience, the present invention deeply considers the difficulties faced by the development and promotion of the first and second generation of smart technologies, and hopes to study a unified architecture to focus on solving these difficulties.
当前平台面临的困难和挑战包括:The difficulties and challenges facing the current platform include:
1)缺少能够处理多种类型任务的框架,且该框架需要能对任务进行统一的和深层的理解。众包平台通常只是作为集中发布和收集任务的互联网公告栏,缺少区别对待不同任务的功能。当前的群智感知应用大多使用定制的软件以及特定感知设备来执行任务,应用和任务是一对一绑定的关系。因此,应用软件普遍缺乏普适性和扩展性,难以被迁移到其它类型任务上。1) There is a lack of a framework that can handle multiple types of tasks, and the framework requires a unified and in-depth understanding of tasks. The crowdsourcing platform is usually just an Internet bulletin board for centralized publishing and collecting tasks, and lacks the function of treating different tasks differently. The current group intelligence applications mostly use customized software and specific sensing devices to perform tasks, and applications and tasks have a one-to-one binding relationship. Therefore, application software generally lacks universality and scalability, and it is difficult to migrate to other types of tasks.
2)缺乏对群智系统中各类资源的抽象和统一管理。系统无法对人、任务、设备资源、软件资源,以及产生的知识数据等资源进行联合分析和调度。当前技术研究大多是在理想环境下为解决特定问题进行的,基于诸多条件假设为前提。由于研究的假设条件及场景设定都不同,因此技术之间是互相隔离的。由于这些分散的研究之间缺少沟通桥梁,因此各学者提出的局部性方法也很难推广到实际应用中。2) Lack of abstraction and unified management of various resources in the group intelligence system. The system cannot perform joint analysis and scheduling of resources such as people, tasks, equipment resources, software resources, and the knowledge data generated. Most of the current technical researches are carried out in an ideal environment to solve specific problems, based on many conditional assumptions. Because the research assumptions and scenario settings are different, the technologies are isolated from each other. Due to the lack of communication bridges between these scattered studies, the local methods proposed by various scholars are also difficult to generalize to practical applications.
3)缺少结果质量评估和优化方法。众包平台通常只是对任务结果数据做汇总,没有进一步评估分析。群智应用中可能预置一些简单的结果筛选方法,但由于这些方法通常是针对特定类型的任务或数据,难以推广到其它任务或数据类型上且不能支持复杂的数据处理任务(如数据语义理解等)。还有一些优化方法是在任务发布者拿到结果之后进行的,例如用户对数据做 清洗和筛选等,该操作与发布平台或应用无关,不能降低群智感知的数据汇聚成本。3) Lack of results quality evaluation and optimization methods. The crowdsourcing platform usually only summarizes the task result data without further evaluation and analysis. Some simple results filtering methods may be preset in the Qunzhi application, but because these methods are usually for specific types of tasks or data, it is difficult to generalize to other tasks or data types and cannot support complex data processing tasks (such as data semantic understanding). Wait). There are also some optimization methods that are performed after the task publisher gets the results. For example, the user cleans and filters the data. This operation has nothing to do with the publishing platform or application, and cannot reduce the data aggregation cost of the group intelligence perception.
发明内容Summary of the invention
为了克服现有技术的不足,本发明提供一种面向群智感知的泛在操作系统,具体为一个面向群智的泛在操作系统CrowdOS,通过对群智任务的复杂环境和多样化特征的深入分析,进而基于操作系统框架设计了一套综合处理机制和核心功能组件。作为群智应用的孵化器和加速器,CrowdOS作为中间件运行在异构设备的原生操作系统和应用层之间。本发明重点包含CrowdOS核心架构和它的三个核心机制,即:任务语义解析及用户调度、系统资源管理和任务结果深度反馈交互。In order to overcome the shortcomings of the prior art, the present invention provides a ubiquitous operating system oriented to group intelligence perception, specifically a ubiquitous operating system oriented to group intelligence, CrowdOS, through the in-depth understanding of the complex environment and diversified characteristics of group intelligence tasks Analysis, and then designed a set of comprehensive processing mechanism and core functional components based on the operating system framework. As an incubator and accelerator for Qunzhi applications, CrowdOS runs as a middleware between the native operating system and the application layer of heterogeneous devices. The present invention focuses on the core architecture of CrowdOS and its three core mechanisms, namely: task semantic analysis and user scheduling, system resource management and deep feedback interaction of task results.
本发明解决其技术问题所采用的技术方案是:The technical solutions adopted by the present invention to solve its technical problems are:
一种面向群智感知的泛在操作系统,任务发布者通过智能终端输入原始任务数据并提交至平台,平台捕获到任务后,对任务进行解析,并给该任务分配唯一的任务ID;当任务进入平台之后,进行任务解析并生成对应的任务特征向量,并和其他已知离散特征做特征拼接,通过任务向量提取出特性,所述特性包含但不限于任务种类、需要的参与者人数,任务执行地点,需用到的传感器和采集数据的类型;面向群智感知的泛在操作系统通过执行任务推理,关联及匹配操作,完成用户调度和任务分配过程;参与者接收到任务之后,选择感兴趣的任务执行并将采集到的感知数据或设计方案上传到面向群智感知的泛在操作系统中;当数据进入到面向群智感知的泛在操作系统后,对任务、用户、进程资源进行抽象和软件定义,根据任务特征信息描述,面向群智感知的泛在操作系统选择需要使用的任务中间件并对采集的数据进行汇总;最后将结果返回给任务发布者,发布者对任务结果进行评价和反馈,当发布者接收结果后,群智任务的生命周期结束。A ubiquitous operating system for group intelligence perception. The task publisher inputs the original task data through the smart terminal and submits it to the platform. After the task is captured by the platform, the task is analyzed and a unique task ID is assigned to the task; After entering the platform, perform task analysis and generate the corresponding task feature vector, and perform feature splicing with other known discrete features, and extract the characteristics through the task vector. The characteristics include but are not limited to the task type, the number of participants required, and the task The execution location, the type of sensors and data collected; the ubiquitous operating system for group intelligence perception completes the process of user scheduling and task allocation by performing task inference, association and matching operations; after the participants receive the task, they have a sense of choice The task of interest is executed and the collected perception data or design plan is uploaded to the ubiquitous operating system for group intelligence perception; when the data enters the ubiquitous operating system for group intelligence perception, the task, user, and process resources are performed Abstract and software definition, according to the description of task feature information, the ubiquitous operating system for group intelligence selects the task middleware to be used and summarizes the collected data; finally the results are returned to the task publisher, and the publisher performs the task results Evaluation and feedback, when the publisher receives the results, the life cycle of the group intelligence task ends.
所述面向群智感知的泛在操作系统采用操作系统CrowdOS实现,CrowdOS运行在原生操作系统和上层应用之间,包含感知端和服务端,其中感知端的软件载体包含两类设备,第一类是具备人机交互功能的便携式智能感知设备,第二类是部署在物理世界中的固定传感器;CrowdOS采用云-边-端的部署方式;感知端部署在各类终端感知设备上,收集数据信息,服务端部署在云服务器或者边缘服务器上,对系统资源及数据资源进行综合管理并实时响应系统操作;The ubiquitous operating system oriented to group intelligence perception is implemented by the operating system CrowdOS. CrowdOS runs between the native operating system and upper-layer applications, and includes the sensing end and the server. The software carrier of the sensing end includes two types of equipment, the first type is Portable intelligent sensing devices with human-computer interaction functions. The second category is fixed sensors deployed in the physical world; CrowdOS adopts a cloud-side-to-end deployment method; the sensing end is deployed on various terminal sensing devices to collect data, information and services The end is deployed on a cloud server or edge server to comprehensively manage system resources and data resources and respond to system operations in real time;
在感知端的功能层部分,首先发布者通过智能终端的交互功能将任务上传到基于CrowdOS的系统中,参与者通过感知端浏览并执行已发布到系统中的任务;当参与者接收任务,感知端则开始进行数据采集;系统支持层获取当前设备状态,然后对获取到的传感器数据统一进行接口封装并统一数据传输格式,再通过网络或蓝牙传输方式将数据存入对应的数据结构体中;对于无需人机交互的传感设备,一旦该设备被系统中的任务激活且设备通过了任务验证,则该设备开始自动按照预定规则收集并上传感知数据;In the functional layer part of the perception terminal, first the publisher uploads the task to the CrowdOS-based system through the interactive function of the smart terminal, and the participant browses through the perception terminal and executes the tasks that have been published in the system; when the participant receives the task, the perception terminal Data collection begins; the system support layer obtains the current device status, then uniformly interface encapsulates the acquired sensor data and unifies the data transmission format, and then stores the data in the corresponding data structure through the network or Bluetooth transmission; A sensing device that does not require human-computer interaction. Once the device is activated by a task in the system and the device passes the task verification, the device will automatically collect and upload perception data according to predetermined rules;
服务端(Server-end)提供了综合管理服务,部署在服务器集群、云服务器或者边缘服务器之上,共包括任务池模块(Taskpool模块)、资源管理模块(Resource management)、数据管理中心模块(DM center)、内外部接口模块(I&E interface)、知识库模块(Knowledge base)、系统插件模块(System Plugin)、任务结果质量优化(TRO)和联合存储和检索模块(Storage and query)八个模块;首先当任务发布者从感知端将任务数据传输到服务端后,服务端通过任务池模块对任务进行理解和表示,通过任务池模块对接收到的任务进行解析、调度、分配以及微调,并将群智任务分配给平台中的用户;在资源管理模块中,对各种资源进行软件定义并完成设备、用户、环境以及任务进程调度的综合管理;然后进入数据管理中心模块,该模块提供了海量异构数据的分类存储以及快速检索功能,联合存储和检索模块对通过感知端采集到的数据进行存储和处理,萃取有用的信息传输给知识库模块;对于采集完成的数据,通过任务结果质量优化模块进行数据质量优化,优化完成后服务端再将最终结果通网络反馈给感知端的任务发布者,从而完成整个任务消息在感知端和服务端之间的流通;系统插件模块为用户提供隐私保护、信用评估及用户激励;内外部接口模块中,内部接口用于对操作系统进行维护和更新,外部接口 提供给第三方应用,用于调用软件接口。The server-end (Server-end) provides comprehensive management services, which are deployed on server clusters, cloud servers or edge servers, including task pool modules (Taskpool modules), resource management modules (Resource management), and data management center modules (DM). center), internal and external interface module (I&E interface), knowledge base module (Knowledge base), system plug-in module (System Plugin), task result quality optimization (TRO) and joint storage and retrieval module (Storage and query) eight modules; First, after the task publisher transmits the task data from the sensing end to the server, the server understands and expresses the task through the task pool module, and parses, schedules, allocates, and fine-tunes the received tasks through the task pool module. Qunzhi tasks are allocated to users in the platform; in the resource management module, various resources are defined by software and the comprehensive management of equipment, users, environment, and task process scheduling is completed; then enter the data management center module, which provides a large number of Classification and storage of heterogeneous data and fast retrieval functions. The combined storage and retrieval module stores and processes the data collected through the sensing terminal, extracts useful information and transmits it to the knowledge base module; for the collected data, the quality of the task results is optimized The module optimizes data quality. After the optimization is completed, the server will feed back the final result to the task publisher on the sensing side through the network, thereby completing the circulation of the entire task message between the sensing side and the server; the system plug-in module provides users with privacy protection, Credit evaluation and user incentives; in the internal and external interface modules, the internal interface is used to maintain and update the operating system, and the external interface is provided to third-party applications for invoking the software interface.
在CrowdOS中,在任务进入后构建了五个动态Agent生成系统资源图谱,来管理系统中的任务和资源,Agents包括任务智能体Task-agent(TA),用户智能体User-agent(UA),设备智能体Device-agent(DA),环境智能体Environment-agent(EA)和进程智能体Process-agent(PA);其中TA包含每个任务的详细信息,包括但不限于任务种类,执行时间,地点,收集的数据格式,UA是对系统中用户的抽象描述,记录此用户的信息,包括但不限于发布和执行过的任务,信用等级;DA是对终端设备资源的描述,记录该设备的类型,当前状态信息;EA抽象当前系统本身所处的软硬件环境资源,包括但不限于当前CPU、内存、存储使用情况以及系统中用户数目及可用设备总量;PA管理当前系统中存在的所有任务进程,包括但不限于进程状态、优先级、调度策略;Agents在获取到任务信息之后通过内部结构中定义的信号量完成互相之间的实时交互和更新。In CrowdOS, five dynamic Agents are constructed to generate system resource maps after the task enters to manage the tasks and resources in the system. Agents include task agent Task-agent (TA), user agent User-agent (UA), Device-agent (DA), Environment-agent (EA) and Process-agent (PA); where TA contains detailed information about each task, including but not limited to task type, execution time, Location, collected data format, UA is an abstract description of the user in the system, and records the user’s information, including but not limited to the tasks posted and performed, and credit rating; DA is the description of the terminal equipment resources, recording the device’s information Type, current status information; EA abstracts the hardware and software environment resources of the current system itself, including but not limited to the current CPU, memory, storage usage, the number of users in the system and the total amount of available equipment; PA manages all existing systems in the current system Task process, including but not limited to process status, priority, scheduling strategy; Agents complete mutual real-time interaction and update through the semaphore defined in the internal structure after obtaining task information.
所述任务池中,任务信息萃取部分,当群智任务进入系统,首先对任务进行语义解析和特征提取,将对接收的任务进行自然语言分析,对于通过语言描述的任务,系统进行分词处理,然后系统执行不区分语种的操作,最终提取出任务关键信息,包括但不限于任务的执行方式、地点、时间、参与者数目,对提取出的任务信息和通过规则点击选取而获取到的离散特征进行拼接,将拼接完的特征输入到一个深度神经网络中进行统一编码,输出一个高维的任务中间向量;最后通过解码将向量映射到该任务的Task-agent,完成任务向量到agents的转换过程。In the task pool, in the task information extraction part, when the group intelligence task enters the system, the task is firstly analyzed by semantic analysis and feature extraction, and natural language analysis is performed on the received task. For tasks described by language, the system performs word segmentation processing. Then the system performs operations that do not distinguish between languages, and finally extracts mission-critical information, including but not limited to the task execution method, location, time, number of participants, the extracted task information and the discrete features obtained through rule click selection For splicing, input the spliced features into a deep neural network for unified encoding, and output a high-dimensional task intermediate vector; finally, map the vector to the task-agent of the task through decoding, and complete the conversion process from task vector to agents .
所述任务池中,Agents生成部分包含5个Agents:Task_agent、Environment_agent、Process_agent、Device_agent、User_agent,通过任务解析和表示过程得到的;其中,taskID是任务在系统中的唯一标识符;process_state表示了该任务进程的当前状态,是处于生成态、执行态还是反馈态等;该状态协助Process_agent进行任务进程管理,Prio代表了该任务的优先级,取值为0-15,系统根据优先级顺序进行任务进程调度;taskInfo是一个结构体,包含任务详细信息,如任务时间、地点、向量表示等;Classification代表了该任务所属的类别,如数据标注类、传感信息采集类、问卷回答类等;Topic表示了该任务的主题,可以从关键词信息中提取,如音频采集、照片收集等;deviceNum、deviceInfo、deviceID分别代表了参与执行该任务的设备数目、设备详细信息以及设备ID列表;Sensing_Data是收集到的任务数据集指针,指向存储数据的立方体地址。In the task pool, the Agents generation part contains 5 Agents: Task_agent, Environment_agent, Process_agent, Device_agent, User_agent, which are obtained through the process of task analysis and representation; among them, taskID is the unique identifier of the task in the system; process_state represents the The current state of the task process, whether it is in the generation state, the execution state or the feedback state, etc.; this state assists Process_agent in task process management, Prio represents the priority of the task, the value is 0-15, and the system performs the task according to the priority order Process scheduling; taskInfo is a structure that contains detailed task information, such as task time, location, vector representation, etc.; Classification represents the category to which the task belongs, such as data labeling, sensor information collection, questionnaire answering, etc.; Topic Represents the subject of the task, which can be extracted from keyword information, such as audio collection, photo collection, etc.; deviceNum, deviceInfo, and deviceID respectively represent the number of devices participating in the task, device detailed information, and device ID list; Sensing_Data is the collection The pointer to the task data set that points to the address of the cube where the data is stored.
所述任务池中,调度和分配部分,任务调度子框架中包含了策略库、映射模型及策略管理模块;首先对任务资源图谱的内容进行分析和推理,将任务ID映射到策略库中,策略库中存放任务分配函数及其编号,从而完成任务分配策略选取过程;然后根据选取的分配策略对设备和用户执行调度操作,并将任务推送给合适的用户。In the task pool, the scheduling and allocation part, the task scheduling sub-framework includes a strategy library, a mapping model, and a strategy management module; first, the content of the task resource map is analyzed and reasoned, and the task ID is mapped to the strategy library. The task allocation function and its number are stored in the library, so as to complete the task allocation strategy selection process; then according to the selected allocation strategy, perform scheduling operations on the devices and users, and push tasks to appropriate users.
所述资源管理模块包括但不限于用户、感知终端、系统环境、任务进程、系统软件、任务数据以及知识库,将CrowdOS的管理对象抽象为4类,根据任务进入系统的不同时期顺序执行,具体如下:The resource management module includes, but is not limited to, users, perception terminals, system environments, task processes, system software, task data, and knowledge bases. The management objects of CrowdOS are abstracted into 4 categories, and they are executed sequentially according to the different periods when tasks enter the system. as follows:
1)设备、用户及环境管理[0020]当感知端通过网络接入时触发信号,设备自动向系统发送当前的状态信息,包括但不限于设备类型、剩余电量、位置信息、存储占用率,基于CrowdOS框架开发的系统通过Device-agent智能体捕获和存储状态信息;1) Device, user and environment management [0020] When the sensing terminal is connected through the network, the signal is triggered, and the device automatically sends the current status information to the system, including but not limited to device type, remaining power, location information, storage occupancy rate, based on The system developed by the CrowdOS framework captures and stores state information through the Device-agent agent;
系统通过User-agent描绘用户画像,用户依托感知端与系统进行交互,Useragent保存用户姓名,年龄,参与过的任务,同时会根据用户参与任务的情况生成用户信用等级,用户偏好,兴趣领域及其他个性化信息;The system portrays the user portrait through the User-agent. The user interacts with the system by relying on the perception end. The Useragent saves the user's name, age, and tasks that have participated in it. At the same time, it generates user credit ratings, user preferences, areas of interest, and others based on the user's participation in tasks. Personalized information;
环境资源记录服务器架构和处理能力,资源被存储于Environment-agent中并且定时更新,该agent具备报警功能,根据当前系统状态和任务量增减情况做预判,如果系统CPU利用率或存储占用率达到了额定阈值,系统会给出警报。Environment resource records server architecture and processing capabilities. Resources are stored in the Environment-agent and updated regularly. The agent has an alarm function and makes predictions based on the current system status and the increase or decrease of the task volume. If the system CPU utilization rate or storage occupancy rate is When the rated threshold is reached, the system will give an alarm.
2)任务进程调度管理2) Task process scheduling management
Process-agent是任务当前阶段状态信息的集合,给每个任务分配进程识别号作为系统中任务进程存在的唯一标志;ID同时存储于Task-agent及Process-agent中,伴随任务的整个生命周期;PA类里面包含任务进程信息,包含但不限于TPID:进程唯一标识符;process_state:当前任务在系统中所处的状态,有七种可切换状态;process_strategy:进程调度策略,包含先到先服务(FCFS)、循环法(RB)、任务优先级、最高响应比优先(HRRN)、反馈优先;process_prio代表进程优先级,从0-15,0代表最高优先级,依次递减;Process-agent is a collection of the status information of the current stage of the task. A process identification number is assigned to each task as the only sign of the existence of the task process in the system; the ID is stored in both the Task-agent and the Process-agent and accompanies the entire life cycle of the task; The PA class contains task process information, including but not limited to TPID: process unique identifier; process_state: the current state of the task in the system, there are seven switchable states; process_strategy: process scheduling strategy, including first come, first served ( FCFS), round-robin method (RB), task priority, highest response ratio priority (HRRN), feedback priority; process_prio represents the process priority, from 0-15, 0 represents the highest priority, in descending order;
任务进程状态包含创建态、生成态、分配态、执行态、处理态、反馈态及终止态七种状态;状态之间的流转关系为:首先,当用户在系统中发布任务时首先是任务创建态;通过系统认证和解析后进入任务生成态;通过任务调度和分配后进入分配态;发布完成后参与者可对任务进行执行,进而进入执行态;参与者将采集到的数据上传到系统中进行处理,进入处理态;经过数据可视化呈现把结果汇总给任务发布者,进入反馈态;当任务结果未通过发布者接收验证时,系统会暂时停留在反馈态,进而经过推理和修正,任务进程再次回到生成态、分配态或处理态,然后顺序执行;通过发布者验证任务提交结果后,则任务进入终止态,整个任务生命周期结束;The task process state includes seven states: creation state, generation state, allocation state, execution state, processing state, feedback state and termination state; the flow relationship between states is: First, when a user publishes a task in the system, the first is task creation State; enter the task generation state after passing system authentication and analysis; enter the distribution state after task scheduling and assignment; after the release is completed, participants can execute the task, and then enter the execution state; participants upload the collected data to the system After processing, enter the processing state; after data visualization, the results are summarized to the task publisher and enter the feedback state; when the task result fails the publisher’s acceptance verification, the system will temporarily stay in the feedback state, and then after reasoning and correction, the task progresses Return to the generation state, the allocation state or the processing state again, and then execute sequentially; after the publisher verifies the task submission result, the task enters the termination state, and the entire task life cycle ends;
系统中的任务进程调度算法从以下方法中任选一个:The task process scheduling algorithm in the system can choose one of the following methods:
1)先到先服务(FCFS):优先处理先进入系统的任务,为其提供资源和服务;1) First-come, first-served (FCFS): Prioritize tasks that enter the system first, and provide resources and services for them;
2)循环法:以一个周期性间隔产生任务中断,将当前正在运行的进程置于任务就绪队列,基于FCFS选择下一个就绪进程运行;2) Circular method: Generate task interrupts at a periodic interval, place the currently running process in the task ready queue, and select the next ready process to run based on FCFS;
3)任务优先级:优先处理高级别的任务,同一优先级的任务按照FCFS原则;3) Task priority: Prioritize high-level tasks, and tasks with the same priority shall follow the FCFS principle;
4)最高响应比优先(HRRN),R=(w+s)/s,其中R表示响应比,w表示已经等待的时间,s表示期待被服务的时间;4) The highest response ratio priority (HRRN), R=(w+s)/s, where R represents the response ratio, w represents the waiting time, and s represents the time expected to be served;
5)反馈优先:对于进入反馈态的任务,在原始优先级基础上上调两级,优先使用系统资;5) Feedback priority: For tasks that enter the feedback state, two levels will be raised on the basis of the original priority, and system resources will be used first;
3)异构多模态数据资源管理3) Heterogeneous multi-modal data resource management
对于多模态数据的管理,步骤如下:For the management of multi-modal data, the steps are as follows:
1)收集和存储数据;1) Collect and store data;
2)构建基于群智的非结构化数据的检索方法,一旦数据准备完成,开始构建数据堆栈;2) Construct an unstructured data retrieval method based on Qunzhi. Once the data preparation is completed, start to build the data stack;
3)采用数据立方体技术管理和存储任务数据,构造多特征立方体结构(MC),依据构建的数据立方体来检索非结构化数据;3) Using data cube technology to manage and store task data, construct a multi-character cube structure (MC), and retrieve unstructured data based on the constructed data cube;
4)随着系统中任务量的增长,已完成的任务数据和中间数据被定期清理,而经过深层分析的数据则会被转移到知识库中进行管理;4) As the amount of tasks in the system grows, the completed task data and intermediate data are regularly cleaned up, and the data that has undergone in-depth analysis will be transferred to the knowledge base for management;
4)知识库管理4) Knowledge base management
OS的知识分为两类:现有知识(EK)和新知识(NK);NK是从任务或数据中提取的,这有助于改善系统机制或更新模型。知识管理的过程如下:OS knowledge is divided into two categories: existing knowledge (EK) and new knowledge (NK); NK is extracted from tasks or data, which helps to improve system mechanisms or update models. The process of knowledge management is as follows:
首先,系统从现有知识中分辨出能够改善系统性能、更新模型或提升任务结果质量的知识,对用户或第三方有用的信息或知识不包括在此处的知识范围内;其次,对于待被发现的知识,系统提供对应的的机制或算法对其进行挖掘,包括但不限于深度学习算法、在线更新算法、迁移学习算法;第三,知识库根据知识的种类,形式和抽象层级,对知识进行归纳和数据库存储。知识并不是集中存储于系统中的某个管理列表或模块中,它分布式存在于系统的各个模块或列表中。知识库主要记录了知识地址,知识之间内在的关系,并根据这些建立起知识网络。First, the system distinguishes from existing knowledge that can improve system performance, update models, or improve the quality of task results. Information or knowledge useful to users or third parties is not included in the scope of knowledge here; The system provides corresponding mechanisms or algorithms to mine the discovered knowledge, including but not limited to deep learning algorithms, online update algorithms, and migration learning algorithms. Third, the knowledge base can extract knowledge based on the type, form, and abstraction level of the knowledge. Perform induction and database storage. Knowledge is not centrally stored in a certain management list or module in the system, it is distributed in various modules or lists of the system. The knowledge base mainly records knowledge addresses and internal relationships between knowledge, and establishes a knowledge network based on these.
所述结果质量优化模块,包含交互层,推理层以及执行层;首先,在交互层中发布者通过输入评价信息或点击人机界面上的按钮来评估任务结果,对多样化的评价内容进行分析;如果评价显示接受或者满意,执行结束指令,该任务进行被终;若通过分析发现结果存在质量问题则进入下一层;其次,进入推理层,对发布者的反馈内容执行关键信息抽取和深度分析操作,推理出现质量问题的可能原因,然后根据系统建立的推理模型,将原因映射到问题编码库中, 找到对应的错误编码;第三,进入执行层,会将问题编码与对应的内部操作进行映射;修正完毕后,任务进入新的进程状态,通过各种途径修正的任务结果,将再次进入交互层,反馈给发布者,等待交互层给出新的评估结果,整个优化过程按步骤依次执行并形成一个闭环,直到发布者对任务结果满意则终止。The result quality optimization module includes an interaction layer, a reasoning layer, and an execution layer; firstly, in the interaction layer, the publisher evaluates the task results by inputting evaluation information or clicking buttons on the man-machine interface, and analyzes the diversified evaluation content ; If the evaluation shows acceptance or satisfaction, the end instruction is executed, and the task is terminated; if the result is found to have quality problems through analysis, then enter the next layer; secondly, enter the reasoning layer, and perform key information extraction and depth on the publisher’s feedback content Analyze the operation, infer the possible causes of the quality problem, and then map the reason to the problem code library according to the inference model established by the system, and find the corresponding error code; third, enter the execution layer, and the problem code will be coded with the corresponding internal operation After the correction is completed, the task enters a new process state. The task results corrected through various channels will enter the interactive layer again and be fed back to the publisher, waiting for the interactive layer to give new evaluation results. The entire optimization process is step by step Execute and form a closed loop, and terminate until the publisher is satisfied with the task result.
本发明的有益效果在于提出了一种面向群智感知的泛在操作系统,利用CrowdOS解决现有移动群智感知或者众包平台缺乏统一体系结构以及相关研究中的算法或模块不兼容的问题。对架构的组成和各模块包含的内容及之间的关系进行了详细描述。另外,对于CrowdOS内核体系结构中的三个核心机制的实现思路进行了详细说明:其中,任务解析及调度机制通过任务资源图谱在任务和OS内核之间建立了桥梁,然后针对异构任务自适应地选择合理的分配策略;资源管理机制在系统中抽象异构的物理和虚拟资源,并为它们提供统一的软件定义和管理;结果质量优化机制旨在通过质量评估和浅层深度推断机制以及整合特定质量问题的策略来量化和优化结果的质量。The beneficial effect of the present invention is to propose a ubiquitous operating system oriented to group intelligence perception, and use CrowdOS to solve the problems of the existing mobile group intelligence perception or the lack of a unified architecture of the crowdsourcing platform and the incompatibility of algorithms or modules in related research. The composition of the architecture and the contents of each module and the relationship between them are described in detail. In addition, the implementation ideas of the three core mechanisms in the CrowdOS kernel architecture are explained in detail: Among them, the task analysis and scheduling mechanism establishes a bridge between the task and the OS kernel through the task resource graph, and then adapts to heterogeneous tasks. Choose a reasonable allocation strategy; the resource management mechanism abstracts heterogeneous physical and virtual resources in the system, and provides them with unified software definition and management; the result quality optimization mechanism aims at quality evaluation and shallow in-depth inference mechanisms and integration Strategies for specific quality issues to quantify and optimize the quality of the results.
通过基于该架构来开发应用实例的方式,评估了CrowdOS的正确性和内核模块的有效性以及总体开发效率,比较了使用前后结果的优化速度和能耗。主要从四个方面评估CrowdOS以及基于该架构开发的应用实例WeSense:正确性和效率(Ev 1),有效性和可用性(Ev 2),优化结果质量评估(E v 3),性能,负载和压力测试(Ev 4)。 By developing application examples based on this architecture, the correctness of CrowdOS, the effectiveness of the kernel module and the overall development efficiency are evaluated, and the optimization speed and energy consumption of the results before and after use are compared. CrowdOS and WeSense, an application example developed based on the architecture, are mainly evaluated from four aspects: correctness and efficiency (Ev 1 ), effectiveness and availability (Ev 2 ), optimization result quality evaluation (E v 3 ), performance, load and pressure Test (Ev 4 ).
附图说明Description of the drawings
图1为群智感知生态系统框图。Figure 1 is a block diagram of the group intelligence perception ecosystem.
图2为CrowdOS核心架构框图。Figure 2 is a block diagram of the core architecture of CrowdOS.
图3是系统资源图谱。Figure 3 is a system resource map.
图4为任务解析及调度框架图。Figure 4 is a framework diagram of task analysis and scheduling.
图5是任务进程状态切换图。Figure 5 is a state switching diagram of the task process.
图6是结果质量优化框架图。Figure 6 is a framework diagram of result quality optimization.
图7是WeSense可用性评估。图7(a)为主页,群智任务的显示和搜索;图7(b)为任务详细页面;图7(c)为任务提交页面。Figure 7 is the WeSense usability evaluation. Figure 7(a) is the homepage, showing and searching of the group intelligence tasks; Figure 7(b) is the task detail page; Figure 7(c) is the task submission page.
图8是开发效率评估图,其中图8(a)为用GA和GB完成f1-3测试所需的时间,图8(b)为基于M1和M2的所有测试的时间消耗比较示意图。Figure 8 is a development efficiency evaluation diagram, where Figure 8 (a) is the time required to complete the f1-3 test with GA and GB, and Figure 8 (b) is a comparison diagram of the time consumption of all tests based on M1 and M2.
图9是核心框架有效性评估,图9(a)中的任务是随机分配的效果图;图9(b)是使用基于位置的任务分配算法的效果,图9(c)是比较两种方法的效果;图9(d)是选择超级隐私保护模式示意图。Figure 9 is the effectiveness evaluation of the core framework. The tasks in Figure 9(a) are the effect diagrams of random assignment; Figure 9(b) is the effect of using the location-based task allocation algorithm, and Figure 9(c) compares the two methods. Figure 9(d) is a schematic diagram of selecting the super privacy protection mode.
图10是优化时间对比图,其中图10(a)是数据格式校正请求界面,图10(b)是参与者收到的纠正提示消息;图10(c)则两种优化方法的时间消耗对比图。Figure 10 is a comparison diagram of optimization time, in which Figure 10 (a) is the data format correction request interface, Figure 10 (b) is the correction prompt message received by the participants; Figure 10 (c) is the time consumption comparison of the two optimization methods picture.
图11性能及压力测试对比图。Figure 11 Performance and stress test comparison chart.
具体实施方式Detailed ways
下面结合附图和实施例对本发明进一步说明。The present invention will be further described below in conjunction with the drawings and embodiments.
本发明主要包含CrowdOS的核心架构和三个重要机制。CrowdOS的主要贡献和创新点如下:The present invention mainly includes the core architecture of CrowdOS and three important mechanisms. The main contributions and innovations of CrowdOS are as follows:
第一,设计了群智感知操作系统框架,并详细介绍了CrowdOS的核心架构。First, the framework of the group intelligence perception operating system is designed, and the core architecture of CrowdOS is introduced in detail.
第二,系统基于自然语言处理相关技术分析了群智任务,结合离散特征对任务进行了细粒度的语义分析和建模。据此建立起了任务与系统沟通的桥梁,即任务解析及调度机制。它以自然语言交互理解作为基础,解决了第一个挑战中提到的平台对任务结果只有汇总功能或只能通过模板处理单一任务的问题。Second, the system analyzes group intelligence tasks based on natural language processing related technologies, and performs fine-grained semantic analysis and modeling of tasks combined with discrete features. Accordingly, a bridge between the task and the system is established, that is, the task analysis and scheduling mechanism. Based on natural language interaction understanding, it solves the problem that the platform mentioned in the first challenge only has a summary function for task results or can only handle a single task through a template.
第三,操作系统抽象出了群智任务执行所需要的各类资源(用户资源,任务资源,系统资源)并对其进行了软件定义。进而建立起了系统资源图谱。为系统中各类资源的统一高效管理提供了基石。资源管理机制解决了第二个挑战。Third, the operating system abstracts all kinds of resources (user resources, task resources, system resources) required for the execution of group intelligence tasks and defines them in software. And then established a system resource map. It provides the cornerstone for the unified and efficient management of various resources in the system. The resource management mechanism solves the second challenge.
第四,针对任务结果质量问题,系统提出了评估方法和优化机制,即结果质量优化机制。该机制基于深度人机融合的思想,使用自然语言反馈交互并深浅层推理等方法,主要解决了结果稀疏和结果错误率高两类质量问题。该人机协同的任务结果优化机制解决了第三个挑战。Fourth, in view of the task result quality problem, the system proposes an evaluation method and optimization mechanism, that is, the result quality optimization mechanism. This mechanism is based on the idea of deep human-computer integration, using natural language feedback interaction and deep and shallow reasoning methods, which mainly solves the two types of quality problems of sparse results and high error rate of results. This man-machine coordination task result optimization mechanism solves the third challenge.
一种面向群智感知的泛在操作系统,如图1所示,为群智感知生态系统架构图,展示了任务执行过程和生命周期。面向群智感知的泛在操作系统架构包括a)服务器集群、b)智能终端、c)感知设备、d)传感器、e)通信网络、f)基础软件层、g)平台及应用软件、h)参与者及群智任务,其中a-f作为软件的载体或硬件基础设施支撑。面向群智感知的泛在操作系统中的生存期和流动如图1所示,任务发布者通过智能终端输入原始任务数据并提交至平台,平台捕获到任务后,对任务进行解析,并给该任务分配唯一的任务ID;当任务进入平台之后,进行任务解析并生成对应的任务特征向量,并和其他已知离散特征做特征拼接,通过任务向量提取出特性,所述特性包含但不限于任务种类、需要的参与者人数,任务执行地点,需用到的传感器和采集数据的类型;面向群智感知的泛在操作系统通过执行任务推理,关联及匹配操作,完成用户调度和任务分配过程;参与者接收到任务之后,选择感兴趣的任务执行并将采集到的感知数据或设计方案上传到面向群智感知的泛在操作系统中;当数据进入到面向群智感知的泛在操作系统后,对任务、用户、进程资源进行抽象和软件定义,根据任务特征信息描述,面向群智感知的泛在操作系统选择需要使用的任务中间件并对采集的数据进行汇总;最后将结果返回给任务发布者,发布者对任务结果进行评价和反馈,当发布者接收结果后,群智任务的生命周期结束。A ubiquitous operating system oriented to group intelligence perception, as shown in Figure 1, is an architecture diagram of the group intelligence perception ecosystem, showing the task execution process and life cycle. The ubiquitous operating system architecture for group intelligence perception includes a) server clusters, b) smart terminals, c) sensing devices, d) sensors, e) communication networks, f) basic software layer, g) platform and application software, h) Participants and group intelligence tasks, where af is used as a software carrier or hardware infrastructure support. The life cycle and flow of the ubiquitous operating system for group intelligence perception is shown in Figure 1. The task publisher inputs the original task data through the smart terminal and submits it to the platform. After the platform captures the task, it analyzes the task and gives it to the Tasks are assigned a unique task ID; when the task enters the platform, the task is analyzed and the corresponding task feature vector is generated, and the feature is spliced with other known discrete features, and the feature is extracted through the task vector. The feature includes but is not limited to the task Types, number of participants required, task execution location, required sensors and types of collected data; the ubiquitous operating system for group intelligence perception completes the user scheduling and task allocation process by performing task reasoning, association and matching operations; After the participant receives the task, he selects the task of interest to execute and uploads the collected perception data or design plan to the ubiquitous operating system for group intelligence perception; when the data enters the ubiquitous operating system for group intelligence perception , Abstract and software definition of tasks, users, and process resources. According to the description of task characteristic information, the ubiquitous operating system for group intelligence perception selects the task middleware that needs to be used and summarizes the collected data; finally returns the results to the task The publisher, the publisher evaluates and feedbacks the task results, when the publisher receives the results, the life cycle of the group intelligence task ends.
为了解决各类群智任务在群智生态系统中的流通和执行问题,本发明设计操作系统CrowdOS实现面向群智感知的泛在操作系统,CrowdOS核心架构和核心架构中包含的三个子系统框架如下:In order to solve the problem of circulation and execution of various group intelligence tasks in the group intelligence ecosystem, the present invention designs the operating system CrowdOS to implement a ubiquitous operating system oriented to group intelligence perception. The core architecture of CrowdOS and the three subsystem frameworks included in the core architecture are as follows :
图2是CrowdOS核心架构框图,所述面向群智感知的泛在操作系统采用操作系统CrowdOS实现,从图2中可以看出CrowdOS运行在原生操作系统和上层应用之间包含感知端和服务端,其中感知端(Sensing-end)的软件载体包含两类设备,第一类是具备人机交互功能的便携式智能感知设备,如智能手机、智能手表等;第二类是部署在物理世界中的固定传感器,它们无需直接与人交互,如汽车传感器、水质检测传感器、空气质量传感器等;CrowdOS采用云-边-端的部署方式;感知端部署在各类终端感知设备上,收集环境、商业、社会、人群等数据信息,服务端部署在云服务器或者边缘服务器上,对系统资源及数据资源进行综合管理并实时响应系统操作;其中部署在边缘服务器的系统软件通常是经过裁剪的,轻量级的,例如将冗余模块去除,只保留最核心的数据处理部分、可视化综合处理等功能依旧部署在云服务器上。Figure 2 is a block diagram of the core architecture of CrowdOS. The ubiquitous operating system for group intelligence perception is implemented by the operating system CrowdOS. It can be seen from Figure 2 that CrowdOS runs between the native operating system and upper-layer applications, including the perception end and the server end. Among them, the software carrier of the sensing end (Sensing-end) includes two types of devices. The first type is a portable intelligent sensing device with human-computer interaction functions, such as smart phones, smart watches, etc.; the second type is a fixed type deployed in the physical world. Sensors, they do not need to directly interact with people, such as car sensors, water quality detection sensors, air quality sensors, etc.; CrowdOS adopts a cloud-side-end deployment method; the sensing end is deployed on various terminal sensing devices to collect environment, business, society, For data information such as crowds, the server is deployed on cloud servers or edge servers to comprehensively manage system resources and data resources and respond to system operations in real time; the system software deployed on edge servers is usually tailored and lightweight. For example, the redundant modules are removed, and only the core data processing part, visual integrated processing and other functions are still deployed on the cloud server.
在感知端的功能层部分,首先发布者通过智能终端的交互功能将任务上传到基于CrowdOS的系统中,参与者通过感知端浏览并执行已发布到系统中的任务;当参与者接收任务,感知端则开始进行数据采集;系统支持层获取当前设备状态,然后对获取到的传感器数据统一进行接口封装并统一数据传输格式,再通过网络或蓝牙传输方式将数据存入对应的数据结构体中;对于无需人机交互的传感设备,一旦该设备被系统中的任务激活且设备通过了任务验证,则该设备开始自动按照预定规则收集并上传感知数据;In the functional layer part of the perception terminal, first the publisher uploads the task to the CrowdOS-based system through the interactive function of the smart terminal, and the participant browses through the perception terminal and executes the tasks that have been published in the system; when the participant receives the task, the perception terminal Data collection begins; the system support layer obtains the current device status, then uniformly interface encapsulates the acquired sensor data and unifies the data transmission format, and then stores the data in the corresponding data structure through the network or Bluetooth transmission; A sensing device that does not require human-computer interaction. Once the device is activated by a task in the system and the device passes the task verification, the device will automatically collect and upload perception data according to predetermined rules;
服务端(Server-end)提供了综合管理服务,部署在服务器集群,云服务器或者边缘服务器之上,共包括八个模块,属于操作系统的核心处理框架;首先当任务发布者从感知端将任务数据传输到服务端后,服务端通过任务池模块(Task Pool模块)对任务进行理解和表示,通过任务池模块对接收到的任务进行解析、调度、分配以及微调,并将群智任务分配给平台中的用户;在资源管理模块(Resource management)中,对各种资源进行软件定义并完成设备(Devices)、用户(Users)、环境(Environment)以及任务进程调度(Process)的综合管理;然后进入数据管理中心模块(DM center),该模块提供了海量异构数据的分类存储以及快速检索功能,联合存储和检索模块(Storage and query)对通过感知端采集到的数据进行存储和处理,萃取有用的信息传输给知识库模块(Knowledge base);对于采集完成的数据,通过任务结果质 量优化(TRO)模块进行数据质量优化,优化完成后服务端再将最终结果通网络反馈给感知端的任务发布者,从而完成整个任务消息在感知端和服务端之间的流通。另外,系统插件模块(System Plugin)为用户提供了丰富的系统特色功能,包括隐私保护、信用评估及用户激励;内外部接口模块(I&E interface)中,内部接口是为了方便CrowdOS开发人员对操作系统进行维护和更新功能,外部接口则是提供给第三方应用的开发者,便于他们调用软件接口进而基于CrowdOS架构开发他们的个性化群智感知应用系统。The server-end provides comprehensive management services, which are deployed on server clusters, cloud servers or edge servers. It includes a total of eight modules and belongs to the core processing framework of the operating system. After the data is transmitted to the server, the server understands and expresses the task through the task pool module (Task Pool module), analyzes, schedules, allocates, and fine-tunes the received tasks through the task pool module, and assigns the group intelligence task to Users in the platform; in the resource management module (Resource management), software defines various resources and completes the comprehensive management of devices, users, environment, and task process scheduling (Process); then Enter the data management center module (DM center), which provides classified storage and fast retrieval functions for massive amounts of heterogeneous data. The joint storage and retrieval module (Storage and query) stores and processes the data collected through the sensing end, and extracts Useful information is transmitted to the knowledge base module; for the collected data, the task result quality optimization (TRO) module is used to optimize the data quality. After the optimization is completed, the server will send the final result back to the sensory side through the network for task release In order to complete the flow of the entire task message between the perception end and the server end. In addition, the system plug-in module (System Plugin) provides users with a wealth of system features, including privacy protection, credit evaluation, and user incentives; in the internal and external interface module (I&E interface), the internal interface is to facilitate CrowdOS developers’ access to the operating system For maintenance and update functions, the external interface is provided to third-party application developers, so that they can call the software interface and develop their personalized swarm perception application system based on the CrowdOS architecture.
在CrowdOS架构实现过程中,CrowdOS对面向群智感知的泛在操作系统中的各类实体及虚拟资源进行抽象,完成了软件定义和表达,用符号和符号之间的关系表示了实体及虚拟资源;在任务进入系统后构建了五个动态Agent生成系统资源图谱,来管理系统中的任务和资源,Agent的构建和资源图谱的生成管理在图2中的任务池及资源管理模块中完成,Agents包括任务智能体Task-agent(TA),用户智能体User-agent(UA),设备智能体Deviceagent(DA),环境智能体Environment-agent(EA)和进程智能体Process-agent(PA)。In the process of implementing the CrowdOS architecture, CrowdOS abstracted various entities and virtual resources in the ubiquitous operating system for group intelligence, completed the software definition and expression, and expressed the entity and virtual resources with the relationship between symbols and symbols. ; After the task enters the system, five dynamic Agents are constructed to generate system resource maps to manage the tasks and resources in the system. The construction of Agents and the generation and management of resource maps are completed in the task pool and resource management module in Figure 2. Agents Including task agent Task-agent (TA), user agent User-agent (UA), device agent (DA), environment agent Environment-agent (EA) and process agent Process-agent (PA).
如图3所示,通过五个agents定义了系统中的所有资源。其中TA包含了每个任务的详细信息,包括但不限于任务种类,执行时间,地点,收集的数据格式,UA是对系统中用户的抽象描述,记录了此用户的信息,包括但不限于发布和执行过的任务,信用等级;DA是对终端设备资源的描述,记录了该设备的类型,当前状态信息;EA抽象了当前系统本身所处的软硬件环境资源,包括但不限于当前CPU、内存、存储使用情况以及系统中用户数目及可用设备总量;PA管理了当前系统中存在的所有任务进程,包括但不限于进程状态、优先级、调度策略;Agents在获取到任务信息之后通过内部结构中定义的信号量完成互相之间的实时交互和更新。As shown in Figure 3, all resources in the system are defined by five agents. TA contains detailed information of each task, including but not limited to task type, execution time, location, and data format collected. UA is an abstract description of the user in the system and records the user's information, including but not limited to release And the tasks performed, the credit rating; DA is the description of the terminal device resources, recording the type of the device, the current status information; EA abstracts the software and hardware environment resources of the current system itself, including but not limited to the current CPU, Memory, storage usage, and the number of users in the system and the total amount of available equipment; PA manages all task processes in the current system, including but not limited to process status, priority, and scheduling strategy; Agents pass internally after obtaining task information The semaphore defined in the structure completes the real-time interaction and update between each other.
图4任务解析及调度框架图是对图2中Task Pool任务池部分的详细设计,解决了本发明提出的第一个挑战。分为两个步骤:第一,深入理解群智任务,细粒度提取出任务的共性及差异性;第二,系统需将任务合理分配给参与者,确保在最短时间、最低的能源消耗条件下完成信息收集过程。Fig. 4 task analysis and scheduling framework diagram is a detailed design of the task pool part of Task Pool in Fig. 2, which solves the first challenge proposed by the present invention. It is divided into two steps: first, deep understanding of group intelligence tasks, and fine-grained extraction of commonalities and differences of tasks; second, the system needs to allocate tasks to participants reasonably to ensure the shortest time and lowest energy consumption conditions Complete the information gathering process.
如图4所示,上方标注的A任务信息萃取部分所示,当群智任务进入系统,系统首先对任务进行语义解析和特征提取,系统将对接收的任务进行自然语言分析,对于通过中文、英文等语言描述的任务,系统进行分词处理,然后系统执行词性标注、命名实体识别、关键词提取这些不区分语种的操作,最终提取出任务关键信息,包括但不限于任务的执行方式、地点、时间、参与者数目;系统进一步对提取出的任务信息和通过规则点击选取而获取到的离散特征进行拼接,将拼接完的特征输入到一个深度神经网络中进行统一编码,输出一个高维的任务中间向量;最后通过解码将向量映射到该任务的Task-agent,完成图4下方标注的P1,任务向量到agents的转换过程。As shown in Figure 4, as shown in the A task information extraction part marked above, when the group intelligence task enters the system, the system first performs semantic analysis and feature extraction on the task. The system will perform natural language analysis on the received task. For tasks described in languages such as English, the system performs word segmentation, and then the system performs language-insensitive operations such as part-of-speech tagging, named entity recognition, and keyword extraction, and finally extracts key information of the task, including but not limited to the task execution method, location, Time, the number of participants; the system further splices the extracted task information and the discrete features obtained through regular click selection, and inputs the spliced features into a deep neural network for unified coding, and outputs a high-dimensional task The intermediate vector; finally, the vector is mapped to the task-agent of the task through decoding, and the conversion process from the task vector to the agent P1 marked below in Figure 4 is completed.
如图4上方标注的B Agents生成部分,包含5个Agents:Task_agent、Environment_agent、Process_agent、Device_agent、User_agent,以Task-agent的结构为例展开具体定义的说明;TA包含任务全部的共性及个性信息,是通过上一步任务解析和表示过程得到的;其中,taskID是任务在系统中的唯一标识符;process_state表示了该任务进程的当前状态,是处于生成态、执行态还是反馈态等;该状态协助Process_agent进行任务进程管理,Prio代表了该任务的优先级,取值为0-15,系统根据优先级顺序进行任务进程调度;taskInfo是一个结构体,包含任务详细信息,如任务时间、地点、向量表示等;Classification代表了该任务所属的类别,如数据标注类、传感信息采集类、问卷回答类等;Topic表示了该任务的主题,可以从关键词信息中提取,如音频采集、照片收集等;deviceNum、deviceInfo、deviceID分别代表了参与执行该任务的设备数目、设备详细信息以及设备ID列表;Sensing_Data是收集到的任务数据集指针,指向存储数据的立方体地址。The B Agents generation part marked at the top of Figure 4 contains 5 Agents: Task_agent, Environment_agent, Process_agent, Device_agent, and User_agent. Take the structure of Task-agent as an example to expand the specific definition description; TA contains all the common and individual information of the task, It is obtained through the process of task analysis and presentation in the previous step; among them, taskID is the unique identifier of the task in the system; process_state indicates the current state of the task process, whether it is in the generation state, the execution state or the feedback state; this state assists Process_agent manages the task process. Prio represents the priority of the task, with a value of 0-15. The system schedules the task process according to the priority order; taskInfo is a structure that contains task detailed information, such as task time, location, and vector Representation, etc.; Classification represents the category to which the task belongs, such as data labeling, sensor information collection, questionnaire answering, etc.; Topic represents the topic of the task, which can be extracted from keyword information, such as audio collection, photo collection Etc.; deviceNum, deviceInfo, and deviceID respectively represent the number of devices participating in the task, detailed device information, and a list of device IDs; Sensing_Data is a pointer to the collected task data set, pointing to the address of the cube where the data is stored.
如图4上方标注的C调度和分配部分所示,任务调度子框架中包含了策略库、映射模型及策略管理模块;系统首先对任务资源图谱的内容进行分析和推理,将任务ID映射到策略库中的 具体策略,其中策略库中存放了常用的任务分配函数及其编号,从而完成任务分配策略选取过程;然后系统根据选取的分配策略对设备和用户执行调度操作,并将任务推送给合适的用户。As shown in the C scheduling and allocation part marked at the top of Figure 4, the task scheduling subframe includes a strategy library, a mapping model, and a strategy management module; the system first analyzes and infers the content of the task resource map, and maps the task ID to the strategy The specific strategies in the library, where commonly used task allocation functions and their numbers are stored in the strategy library, so as to complete the task allocation strategy selection process; then the system performs scheduling operations on devices and users according to the selected allocation strategy, and pushes tasks to the appropriate User.
资源管理框架是图2中资源管理模块Resource Management的具体实现,资源管理的内容包括但不限于用户、感知终端、系统环境、任务进程、系统软件、任务数据以及知识库。将CrowdOS的管理对象抽象为4类,下面说明系统是如何对四类对象进行管理,以下四部分都属于资源管理的一部分,且管理的四部分内容根据任务进入系统的不同时期顺序执行。The resource management framework is the specific implementation of the Resource Management module Resource Management in Figure 2. The content of resource management includes but is not limited to users, sensing terminals, system environments, task processes, system software, task data, and knowledge bases. The management objects of CrowdOS are abstracted into four categories. The following describes how the system manages the four types of objects. The following four parts are part of resource management, and the four parts of management are executed in order according to the different periods when tasks enter the system.
1)设备、用户及环境管理1) Equipment, user and environmental management
当感知端通过网络接入系统时会触发信号,设备自动向系统发送当前的状态信息,包括但不限于设备类型、剩余电量、位置信息、存储占用率,基于CrowdOS框架开发的系统通过Device-agent智能体捕获和存储状态信息;When the sensing terminal accesses the system through the network, it will trigger a signal, and the device will automatically send current status information to the system, including but not limited to device type, remaining power, location information, and storage occupancy. The system developed based on the CrowdOS framework passes Device-agent The agent captures and stores state information;
系统通过User-agent描绘用户(任务参与者和发布者)画像,用户依托感知端与系统进行交互,例如通过智能手机应用交互界面来发布任务。User-agent保存用户姓名,年龄,参与过的任务,同时会根据用户参与任务的情况生成用户信用等级,用户偏好,兴趣领域及其他个性化信息。The system portrays the user (task participant and publisher) portrait through User-agent, and the user interacts with the system by relying on the sensing terminal, for example, to publish tasks through the smart phone application interactive interface. User-agent saves the user's name, age, and tasks that have participated in it. At the same time, it generates user credit ratings, user preferences, areas of interest, and other personalized information based on the user's participation in tasks.
环境资源记录服务器架构和处理能力,例如,集中式、分布式或边缘式部署架构,系统中CPU数目,系统CPU占用率,内存使用率,可用磁盘存储空间,系统访问量。这些资源被存储于Environment-agent中并且定时更新,以确保系统获取到最新的数据;该agent具备报警功能,根据当前系统状态和任务量增减情况做预判,如果系统CPU利用率或存储占用率达到了额定阈值,系统会给出警报。Environmental resource records server architecture and processing capabilities, for example, centralized, distributed, or edge deployment architecture, the number of CPUs in the system, system CPU occupancy, memory usage, available disk storage space, and system access. These resources are stored in the Environment-agent and updated regularly to ensure that the system obtains the latest data; the agent has an alarm function, and makes predictions based on the current system status and the increase or decrease of the task volume. If the system CPU utilization or storage is occupied If the rate reaches the rated threshold, the system will give an alarm.
2)任务进程调度管理2) Task process scheduling management
该部分是对图2中资源管理模块中进程(Process)管理的详细介绍;Process-agent(PA)是任务当前阶段状态信息的集合,系统给每个任务分配了进程识别号作为系统中任务进程存在的唯一标志;这个ID同时存储于Task-agent及Process-agent中,伴随任务的整个生命周期。PA类里面包含了丰富的任务进程信息,包含但不限于TPID:进程唯一标识符;Process-state:当前任务在系统中所处的状态,有七种可切换状态;Process-strategy:进程调度策略,包含先到先服务(FCFS)、循环法(RB)、任务优先级、最高响应比优先(HRRN)、反馈优先;process_prio代表进程优先级,从0-15,0代表最高优先级,依次递减。This part is a detailed introduction to the process (Process) management in the resource management module in Figure 2; Process-agent (PA) is a collection of task current stage status information, and the system assigns a process identification number to each task as the task process in the system The only sign of existence; this ID is stored in both Task-agent and Process-agent, and accompanies the entire life cycle of the task. The PA class contains a wealth of task process information, including but not limited to TPID: process unique identifier; Process-state: the state of the current task in the system, there are seven switchable states; Process-strategy: process scheduling strategy , Including first-come, first-served (FCFS), round-robin (RB), task priority, highest response ratio priority (HRRN), feedback priority; process_prio represents the process priority, from 0-15, 0 represents the highest priority, in descending order .
任务进程状态包含创建态、生成态、分配态、执行态、处理态、反馈态及终止态七种状态;状态之间的流转关系如图5所示,首先,当用户在系统中发布任务时首先是任务创建态;通过系统认证和解析后进入任务生成态;通过任务调度和分配后进入分配态;发布完成后参与者可对任务进行执行,进而进入执行态;参与者将采集到的数据上传到系统中进行处理,进入处理态;经过数据可视化呈现把结果汇总给任务发布者,进入反馈态;当任务结果未通过发布者接收验证时,系统会暂时停留在反馈态,进而经过推理和修正,任务进程再次回到生成态、分配态或处理态,然后顺序执行;通过发布者验证任务提交结果后,则任务进入终止态,整个任务生命周期结束。图5中状态上方的文字代表执行该状态的主体,是发布者,系统或者参与者。箭头上面的文字表示从一个状态转到另一个状态需要的操作。The task process state includes seven states: creation state, generation state, allocation state, execution state, processing state, feedback state and termination state; the flow relationship between the states is shown in Figure 5. First, when a user publishes a task in the system The first is the task creation state; after system authentication and analysis, it enters the task generation state; after task scheduling and assignment, it enters the distribution state; after the release is completed, participants can execute the task and then enter the execution state; the data collected by the participants Upload to the system for processing and enter the processing state; after the data is visualized, the results are summarized to the task publisher and enter the feedback state; when the task result fails to pass the publisher’s acceptance verification, the system will temporarily stay in the feedback state, and then go through reasoning and Correction, the task process returns to the generation state, the allocation state or the processing state again, and then executes sequentially; after the task submission result is verified by the publisher, the task enters the termination state, and the entire task life cycle ends. The text above the state in Figure 5 represents the subject that executes the state, which is the publisher, system, or participant. The text above the arrow indicates the operations required to go from one state to another.
系统中的任务进程调度算法从以下方法中选择一个:The task process scheduling algorithm in the system selects one of the following methods:
1)先到先服务(FCFS):优先处理先进入系统的任务,为其提供资源和服务;1) First-come, first-served (FCFS): Prioritize tasks that enter the system first, and provide resources and services for them;
2)循环法:以一个周期性间隔产生任务中断,将当前正在运行的进程置于任务就绪队列,基于FCFS选择下一个就绪进程运行;2) Circular method: Generate task interrupts at a periodic interval, place the currently running process in the task ready queue, and select the next ready process to run based on FCFS;
3)任务优先级:优先处理高级别的任务,同一优先级的任务按照FCFS原则;3) Task priority: Prioritize high-level tasks, and tasks with the same priority shall follow the FCFS principle;
4)最高响应比优先(HRRN),R=(w+s)/s,其中R表示响应比,w表示已经等待的时间,s表示期待被服务的时间;4) The highest response ratio priority (HRRN), R=(w+s)/s, where R represents the response ratio, w represents the waiting time, and s represents the time expected to be served;
5)反馈优先:对于进入反馈态的任务,在原始优先级基础上上调两级,优先使用系统资 源。调度算法的选择根据智能体中的调度算法标志位确定。5) Feedback priority: For tasks that enter the feedback state, the original priority is adjusted up to two levels, and system resources are used first. The selection of the scheduling algorithm is determined according to the scheduling algorithm flag in the agent.
3)异构多模态数据资源管理3) Heterogeneous multi-modal data resource management
系统管理两类数据资源,一是任务本身携带的数据,称之为原始数据(RD)。二是参与者在执行任务过程中上传到系统的新数据(ND)。RD通常包括描述任务所用到的文本,图像以及需要被打标签的数据集。ND参与者上传的各类感知数据,包括但不限于文本描述、传感器数据、统计图表、设计文档以及打完标签的数据。The system manages two types of data resources. One is the data carried by the task itself, which is called raw data (RD). The second is the new data (ND) that participants upload to the system during the execution of the task. RD usually includes text, images and data sets that need to be labeled to describe the task. Various perception data uploaded by ND participants, including but not limited to text descriptions, sensor data, statistical charts, design documents, and tagged data.
对于多模态数据的管理,步骤如下:For the management of multi-modal data, the steps are as follows:
1)需收集和存储数据;1) Data needs to be collected and stored;
2)构建基于群智的非结构化数据的检索方法,一旦数据准备完成,系统则可以开始构建数据堆栈;2) Construct an unstructured data retrieval method based on Qunzhi. Once the data preparation is completed, the system can start to build the data stack;
3)本发明采用数据立方体技术来管理和存储任务数据,构造多特征立方体结构(MC),依据构建的数据立方体来检索非结构化数据;3) The present invention uses data cube technology to manage and store task data, construct a multi-feature cube structure (MC), and retrieve unstructured data based on the constructed data cube;
4)随着系统中任务量的增长,部分已完成的任务数据和中间数据会被定期清理,而一些经过深层分析的数据则会被转移到知识库中进行管理。4) As the amount of tasks in the system grows, part of the completed task data and intermediate data will be regularly cleaned up, and some data that has undergone in-depth analysis will be transferred to the knowledge base for management.
4)知识库管理4) Knowledge base management
首先,系统从现有知识中分辨出能够改善系统性能、更新模型或提升任务结果质量的知识,对用户或第三方有用的信息或知识不包括在此处的知识范围内;其次,对于待被发现的知识,系统提供对应的的机制或算法对其进行挖掘,包括但不限于深度学习算法、在线更新算法、迁移学习算法;第三,知识库根据知识的种类,形式和抽象层级,对知识进行归纳和数据库存储。知识并不是集中存储于系统中的某个管理列表或模块中,它分布式存在于系统的各个模块或列表中。知识库主要记录了知识地址,知识之间内在的关系,并根据这些建立起知识网络。First, the system distinguishes from existing knowledge that can improve system performance, update models, or improve the quality of task results. Information or knowledge useful to users or third parties is not included in the scope of knowledge here; The system provides corresponding mechanisms or algorithms to mine the discovered knowledge, including but not limited to deep learning algorithms, online update algorithms, and migration learning algorithms. Third, the knowledge base can extract knowledge based on the type, form, and abstraction level of the knowledge. Perform induction and database storage. Knowledge is not centrally stored in a certain management list or module in the system, it is distributed in various modules or lists of the system. The knowledge base mainly records knowledge addresses and internal relationships between knowledge, and establishes a knowledge network based on these.
系统中的知识分为两类,一类是现存的知识,另一类是从任务或数据中新挖掘的知识。现有知识包括事先在系统中定义好的专家策略、决策规则或网络模型。包括但不限于策略库中已存在的任务分配策略,反馈机制中的的推理树。新知识挖掘则是从数据集或现有知识中识别出有效的、新颖的、潜在有用的以及可解释的内容,方法,模型。CrowdOS中的新知识是在原始知识基础上的扩展。包含决策方法的改进,规则的增加或者网络模型的更新,不包含无规则爆炸式扩张的知识。The knowledge in the system is divided into two categories, one is the existing knowledge, and the other is the knowledge newly mined from tasks or data. Existing knowledge includes expert strategies, decision rules or network models defined in the system in advance. Including but not limited to the existing task allocation strategies in the strategy library, and the reasoning tree in the feedback mechanism. New knowledge mining is to identify effective, novel, potentially useful and interpretable content, methods, and models from data sets or existing knowledge. The new knowledge in CrowdOS is an extension of the original knowledge. It includes the improvement of decision-making methods, the addition of rules or the update of the network model, and does not include the knowledge of irregular and explosive expansion.
知识库根据知识的种类,形式和层级,对这些知识进行归纳和存储。知识并未集中存储于系统中的某个管理列表或模块中,它分布式存在于系统的各个模块或列表中。知识库记录了知识地址,知识之间内在关联,以及知识网络拓扑。The knowledge base summarizes and stores this knowledge according to the type, form and level of knowledge. Knowledge is not centrally stored in a certain management list or module in the system, it is distributed in various modules or lists of the system. The knowledge base records the knowledge address, the internal relationship between knowledge, and the knowledge network topology.
所述结果质量优化模块,包含交互层,推理层以及执行层;首先,在交互层中发布者通过输入评价信息或点击人机界面上的按钮来评估任务结果,对多样化的评价内容进行分析;如果评价显示接受或者满意,执行结束指令,该任务进行被终;若通过分析发现结果存在质量问题则进入下一层;其次,进入推理层,对发布者的反馈内容执行关键信息抽取和深度分析操作,推理出现质量问题的可能原因,然后根据系统建立的推理模型,将原因映射到问题编码库中,找到对应的错误编码;第三,进入执行层,会将问题编码与对应的内部操作进行映射;修正完毕后,任务进入新的进程状态,通过各种途径修正的任务结果,将再次进入交互层,反馈给发布者,等待交互层给出新的评估结果,整个优化过程按步骤依次执行并形成一个闭环,直到发布者对任务结果满意则终止。The result quality optimization module includes an interaction layer, a reasoning layer, and an execution layer; firstly, in the interaction layer, the publisher evaluates the task results by inputting evaluation information or clicking buttons on the man-machine interface, and analyzes the diversified evaluation content ; If the evaluation shows acceptance or satisfaction, the end instruction is executed, and the task is terminated; if the result is found to have quality problems through analysis, then enter the next layer; secondly, enter the reasoning layer, and perform key information extraction and depth on the publisher’s feedback content Analyze the operation, infer the possible causes of the quality problem, and then map the reason to the problem code library according to the inference model established by the system to find the corresponding error code; third, enter the execution layer, and the problem code will be coded with the corresponding internal operation After the correction is completed, the task enters a new process state. The task results corrected through various channels will enter the interactive layer again and be fed back to the publisher, waiting for the interactive layer to give new evaluation results. The entire optimization process is step by step Execute and form a closed loop, and terminate until the publisher is satisfied with the task result.
结果质量优化框架是图2服务端中的TRO framework模块。为了解决任务结果质量未达到发布需求的问题,TRO中包含了基于人机协同交互的深层反馈框架(DFHMI)来模仿人类思考问题的过程,依靠分析和推理来解决问题。任务质量问题分为两类。第一是结果数量稀疏,第二是结果错误率超标。本发明所使用优化方法基于系统中的5Agents,便于用户和系统进行更深层次的交互,具体实现在图6中进行了详细说明。The result quality optimization framework is the TRO framework module in the server side of Figure 2. In order to solve the problem that the quality of task results does not meet the release requirements, TRO includes a deep feedback framework based on human-computer interaction (DFHMI) to imitate the process of human thinking about problems, and rely on analysis and reasoning to solve problems. Task quality issues are divided into two categories. The first is that the number of results is sparse, and the second is that the error rate of results exceeds the standard. The optimization method used in the present invention is based on the 5Agents in the system, which is convenient for the user to interact with the system at a deeper level. The specific implementation is described in detail in FIG. 6.
图6是结果质量优化框架图,包含交互层,推理层以及执行层。首先,在交互层中发布者通过输入评价信息或点击人机界面上的按钮来评估任务结果,系统负责对多样化的评价内容进行分析;如果评价显示接受或者满意,系统会执行结束指令,该任务进行被终;若通过分析发现结果存在质量问题则进入下一层;其次,进入推理层,系统会对发布者的反馈内容执行关键信息抽取和深度分析操作,推理出现质量问题的可能原因,然后根据系统建立的推理模型,将原因映射到问题编码库中,找到对应的错误编码;第三,进入执行层,在该层系统会将问题编码与对应的内部操作进行映射,系统在最初已经定义好了大部分映射机制。这些操作通过修正Agent中的数值来实现,也包括其他类型的操作。修正完毕后,任务会进入新的进程状态。通过各种途径修正的任务结果,将再次进入交互层,反馈给发布者,等待交互层给出新的评估结果。整个优化过程按步骤依次执行并形成一个闭环,直到发布者对任务结果满意则终止。Figure 6 is the result quality optimization framework diagram, including the interaction layer, the reasoning layer and the execution layer. First, in the interactive layer, the publisher evaluates the task result by inputting evaluation information or clicking the button on the human-machine interface. The system is responsible for analyzing the diversified evaluation content; if the evaluation shows acceptance or satisfaction, the system will execute the end instruction. The task is terminated; if the result of the analysis is found to have quality problems, it will enter the next layer; secondly, enter the reasoning layer, the system will perform key information extraction and in-depth analysis operations on the publisher’s feedback content to reason about the possible causes of the quality problems. Then, according to the reasoning model established by the system, the reason is mapped to the problem code library, and the corresponding error code is found; third, enter the execution layer, where the system will map the problem code and the corresponding internal operation. The system has already been Most of the mapping mechanisms are defined. These operations are implemented by modifying the values in the Agent, and other types of operations are also included. After the correction is completed, the task will enter a new process state. The task results corrected through various channels will enter the interactive layer again and be fed back to the publisher, waiting for the interactive layer to give new evaluation results. The entire optimization process is executed step by step and forms a closed loop until the publisher is satisfied with the task result.
推理层和执行层之间是通过推理决策树和系统更新操作库之间的映射表(RSMT)来连接的。RSMT的工作流程如下:The reasoning layer and the execution layer are connected through the mapping table (RSMT) between the reasoning decision tree and the system update operation library. The working process of RSMT is as follows:
首先,找到结果稀疏的原因,对于易被人们所理解的或可从反馈建议中直接获取到称之为浅层原因。而深层原因通常不能被系统直接获取到,它们需要通过结合任务相关信息联合分析而得到。First, find the reason for the sparse results. For those that are easily understood by people or can be directly obtained from feedback suggestions, they are called shallow reasons. The deep reasons are usually not directly obtained by the system, and they need to be obtained through joint analysis combined with task-related information.
其次,为问题原因建立一个树。初期基于决策树的模型来探索,数据稀疏问题作为树的根节点,第一层节点代表浅层原因,新出现的浅层原因可作为根节点的孩子结点加入,所有浅层原因互为兄弟节点。深层原因通常建立在浅层原因的基础上,是节点在纵向上的延伸,称之为深层节点。第三,随着问题规模的增加,为了降低检索时间,系统定期对树进行增枝、剪枝等更新操作,确保该决策树维持在一定规模范围内。Second, build a tree for the cause of the problem. The initial exploration is based on the decision tree model. The data sparseness problem is regarded as the root node of the tree. The first layer node represents the shallow cause, and the new shallow cause can be added as a child node of the root node. All the shallow causes are brothers to each other. node. The deep cause is usually established on the basis of the shallow cause, which is the extension of the node in the vertical direction, which is called the deep node. Third, as the scale of the problem increases, in order to reduce the retrieval time, the system regularly updates the tree such as branching and pruning to ensure that the decision tree is maintained within a certain scale.
实施例:Examples:
1.实验设置1. Experimental settings
本发明开发了WeSense用例,用户可以在上面发布并执行各种传感任务,例如道路拥堵信息收集,空气质量状况监视,产品价格研究等。在实施WeSense期间,将基于两种开发方法对E v1和Ev2进行评估:M1(独立开发)和M2(基于CrowdOS接口的开发)。Ev3是对结果质量优化机制(TRO)的评估,而Ev4是由CrowdOS支持的WeSense的整体性能和压力测试。表1对开发及测试环境进行了说明。在Ev1和Ev2中,比较并分析了在M1和M2条件下完成相关功能模块所需的时间,然后将要测试的功能和模块F{fi}分为五个部分(Fnum=5)。The present invention develops WeSense use cases on which users can publish and perform various sensing tasks, such as road congestion information collection, air quality status monitoring, product price research, etc. During the implementation of WeSense, Ev1 and Ev2 will be evaluated based on two development methods: M1 (independent development) and M2 (development based on the CrowdOS interface). Ev3 is an evaluation of the result quality optimization mechanism (TRO), while Ev4 is the overall performance and stress test of WeSense supported by CrowdOS. Table 1 describes the development and testing environment. In Ev1 and Ev2, the time required to complete the relevant function modules under the conditions of M1 and M2 is compared and analyzed, and then the function and module F{fi} to be tested are divided into five parts (Fnum=5).
f1:任务实时发布功能。f1: Task real-time release function.
f2:任务分配算法模块。f2: Task allocation algorithm module.
f3:隐私保护模块。f3: Privacy protection module.
f4:众包数据收集和上传功能。f4: Crowdsourcing data collection and upload function.
f5:结果质量优化功能。f5: The result quality optimization function.
预先设置实验环境并安装了相关软件。聘请了9位熟悉Java编程语言的志愿者(Λnum=9),并给了他们两个星期的时间来开发应用程序。首先花了25分钟(timec1)来介绍CrowdOS的功能和所有API。先将9名志愿者分配到A组中,然后再将他们切换到B组中,也就是说,每个志愿者在不同时间同时担任GA和GB成员,GA_num=GB_num=9。GA成员使用M1,而GB成员使用M2。每个志愿者都参加了所有测试,测试总数为(GA_num+GB_num)*Fnum=90。The experimental environment is set up in advance and related software is installed. We hired 9 volunteers who are familiar with the Java programming language (Λnum = 9), and gave them two weeks to develop applications. First, it took 25 minutes (timec1) to introduce the functions and all APIs of CrowdOS. First assign 9 volunteers to group A, and then switch them to group B, that is, each volunteer serves as GA and GB members at different times, GA_num=GB_num=9. GA members use M1, and GB members use M2. Each volunteer participated in all the tests, and the total number of tests was (GA_num+GB_num)*Fnum=90.
表1应用运行及开发环境Table 1 Application operation and development environment
Figure PCTCN2021087993-appb-000001
Figure PCTCN2021087993-appb-000001
四个评估指标的比较如下。The comparison of the four evaluation indicators is as follows.
E v1:M 1和M 2开发模式对比,比较完成F{f i}的时间消耗,以及所完成任务的完整性和正确性。 E v1 : Comparison of M 1 and M 2 development models, comparing the time consumed to complete F{f i }, as well as the completeness and correctness of the completed tasks.
E v2:对比G A和G B两组测试者完成F{f i}的效果和时间消耗。 E v2 : Compare the effect and time consumption of F{f i } between the two groups of G A and G B testers.
E v3:对比使用TRO框架与其他优化方法,优化效果及时间消耗的差异。 E v3 : Compare the difference in optimization effect and time consumption between using the TRO framework and other optimization methods.
E v4:系统稳定性及压力测试。 E v4 : System stability and stress test.
2有效性和效率评估2 Effectiveness and efficiency evaluation
经过检测,90项测试全部通过了正确性筛选,对所开发的WeSense用户界面进行简单介绍。如图7所示,图7(a)为主页,群智任务的显示和搜索;图7(b)为任务详细页面,可单击以查看感兴趣的任务;图7(c)为任务提交页面:提交完成的结果数据。After testing, all 90 tests passed the correctness screening, and a brief introduction to the developed WeSense user interface. As shown in Figure 7, Figure 7(a) is the homepage, showing and searching for group intelligence tasks; Figure 7(b) is the task detail page, you can click to view the task of interest; Figure 7(c) is the task submission Page: Submit the completed result data.
Figure PCTCN2021087993-appb-000002
Figure PCTCN2021087993-appb-000002
图8则是对开发周期的分析和比较,即效率评估。其中图8(a)显示了用G A和G B完成f 1-3测试所需的时间。图8(b)显示了基于M 1和M 2的所有测试的时间消耗比较。如图8(b)所示,根据公式(1),F{f i}的平均开发时间从原始DT GA(tc fi)={12.1,13.5,7.3,10.1,14.1}减少到DT GB(tc fi)={3.1,4.7、0.8、4.5、5.4}小时。 Figure 8 is the analysis and comparison of the development cycle, that is, the efficiency evaluation. Figure 8(a) shows the time required to complete the f 1-3 test with G A and G B. Figure 8(b) shows the time consumption comparison of all tests based on M 1 and M 2. As shown in Figure 8(b), according to formula (1), the average development time of F{f i } is reduced from the original DT GA (tc fi )={12.1, 13.5, 7.3, 10.1, 14.1} to DT GB (tc fi ) = {3.1, 4.7, 0.8, 4.5, 5.4} hours.
Figure PCTCN2021087993-appb-000003
Figure PCTCN2021087993-appb-000003
根据公式(2),f 1→f 5的整体开发效率(DE)提高了310%。 According to formula (2), the overall development efficiency (DE) of f 1 → f 5 is increased by 310%.
通过分析和验证测试f 2和f 3来证明其有效性和可用性。如果调用CrowdAPI后即使用M 2模式改善了性能,则意味着CrowdOS确实有效。如图9所示,是核心框架有效性评估,其中图9(a)中的任务是随机分配的,橙色圆圈是任务发布的地理范围;图9(b)中蓝色圆圈是使用基于位置的任务分配算法的效果;图9(c)中则比较两种方法的效果;图9(d)是选择超级隐私保护模式。 Through analysis and verification tests f 2 and f 3 to prove its effectiveness and usability. If the M 2 mode is used to improve performance after calling CrowdAPI, it means that CrowdOS is indeed effective. As shown in Figure 9, it is the evaluation of the effectiveness of the core framework. The tasks in Figure 9(a) are randomly assigned, and the orange circle is the geographic scope of the task release; the blue circle in Figure 9(b) uses location-based The effect of the task allocation algorithm; Figure 9(c) compares the effects of the two methods; Figure 9(d) is the selection of the super privacy protection mode.
调用CrowdAPI之后,开发时间的缩短反映了可用性。从图8(a)图中可以看出,与使用M 1相比,使用M 2的f 2和f 3的平均时间消耗减少了65.4%和88.7%。此外,随着算法库的扩展,CrowdOS的优势得到了突出。使用它不仅可以大大减少每个功能模块的开发时间,而且可以提高整体可视化效果和程序可读性。 After calling CrowdAPI, the shortened development time reflects the availability. It can be seen from Fig. 8(a) that compared with using M 1 , the average time consumption of f 2 and f 3 using M 2 is reduced by 65.4% and 88.7%. In addition, with the expansion of the algorithm library, the advantages of CrowdOS have been highlighted. Using it can not only greatly reduce the development time of each functional module, but also improve the overall visualization effect and program readability.
3结果质量优化机制评估3 Evaluation of result quality optimization mechanism
通过仿真实验,评估了TRO框架里应用程序接口的正确性和优化效果。Through simulation experiments, the correctness and optimization effect of the application program interface in the TRO framework are evaluated.
以数据格式错误问题为例。对于收集的数据D(n*V),其中n表示参与者人数,V代表每个参与者贡献的数据量。通过TRO框架校正数据格式所花费的时间为T iB=T iζB+T iηB,其中T iζB是界面操作的时间(T iζB<6min),界面效果如图10(a)所示。T iηB是每个参与者花费在纠正格式和重新提交数据上的时间,即T iηB∝V。如果没有TRO框架,则发布者纠正所有数据格式所需的时间是T iA。T iA=T iζA+T iηA,其中T iζA是校正数据格式之前的准备时间(T iζA<30min),T iηA是处理数据格式所花费的时间,T iηA∝n*V。 Take the problem of data format errors as an example. For the collected data D(n*V), where n represents the number of participants, and V represents the amount of data contributed by each participant. The time taken to correct the data format through the TRO framework is T iB =T iζB +T iηB , where T iζB is the time of interface operation (T iζB <6min), and the interface effect is shown in Figure 10(a). T iηB is the time each participant spends on correcting the format and resubmitting the data, that is, T iηB ∝V . If there is no TRO framework, the time required for the publisher to correct all data formats is T iA . T iA = T iζA + T iηA , where T iζA is the preparation time before correcting the data format (T iζA <30 min), T iηA is the time spent processing the data format, T iηA ∝n*V.
Figure PCTCN2021087993-appb-000004
Figure PCTCN2021087993-appb-000004
公式(3)给出了通过两种优化方法来处理数据格式问题所花费的时间。图10(c)则展示了A和B曲线之间的距离随着数据量D(n*V)而变化的情况。即优化方法和时间消耗的对比。其中图10(a)是数据格式校正请求界面;而图10(b)是参与者收到的纠正提示消息;图10(c)则比较了随着参与者人数的增加,两种优化方法的时间消耗。The formula (3) gives the time spent to deal with the data format problem through two optimization methods. Figure 10(c) shows how the distance between the A and B curves varies with the amount of data D(n*V). That is, the comparison of optimization methods and time consumption. Figure 10(a) is the data format correction request interface; Figure 10(b) is the correction prompt message received by the participants; Figure 10(c) compares the two optimization methods as the number of participants increases. the time consumption.
任务结果和任务需求之间的相关性可以通过多种方式反映出来,例如当需求是以视频的形式提供,但是参与者却提交了图像;或者上传数据的实际位置与任务要求中的位置不匹配。优化机制则会重新筛选具有信用度高的用户,并根据特定信息(例如地理位置)更新任务特征,然后将任务及反馈信息重新推送给原始参与者,如图10(b)所示。The correlation between task results and task requirements can be reflected in many ways, for example, when the requirements are provided in the form of video, but participants submit images; or the actual location of the uploaded data does not match the location in the task requirements . The optimization mechanism will re-screen users with high creditworthiness, update task features based on specific information (such as geographic location), and then re-push the task and feedback information to the original participants, as shown in Figure 10(b).
可以看出,TRO框架的使用不仅避免了因大量处理而导致的能耗,而且还适用于各种类型的任务。该框架使用分段和集成的思想将不同阶段的工作正确地移交给人或机器。如图10(c)所示,与其他优化方法相比,TRO的时间消耗相对稳定,并且随着参与者数量的增加而不会显着增加。同时,许多优化问题更适合通过TRO解决,与纯机器优化相比,可以大大减少资源消耗。It can be seen that the use of the TRO framework not only avoids the energy consumption caused by a large amount of processing, but is also suitable for various types of tasks. The framework uses the idea of segmentation and integration to correctly transfer the work of different stages to humans or machines. As shown in Figure 10(c), compared with other optimization methods, the time consumption of TRO is relatively stable and does not increase significantly as the number of participants increases. At the same time, many optimization problems are more suitable to be solved by TRO, which can greatly reduce resource consumption compared with pure machine optimization.
4.性能、负载及压力测试4. Performance, load and stress testing
从以下两个方面对WeSense的感知端和服务器端进行了全面测试。The sensory and server side of WeSense have been fully tested from the following two aspects.
性能和负载测试。依次加载了不同规模的任务,并测量了系统响应时间,CPU和内存使用率以及传感端的能耗。对每个测试运行了十次,并在应用程序运行时使用AndroidStudio的配置文件性能分析器实时监控数据。结果如图11所示。尽管任务数量增加了,但是系统响应时间基本上在0.22s之内,CPU和内存使用率也保持在3%-6%和0.87%-1.14%的范围内。能耗基本上保持在最低水平以下(L1:轻),这表明系统运转良好。Performance and load testing. Tasks of different scales were loaded sequentially, and the system response time, CPU and memory usage rate, and energy consumption of the sensor were measured. Run each test ten times, and use AndroidStudio's profile performance analyzer to monitor the data in real time while the application is running. The result is shown in Figure 11. Although the number of tasks has increased, the system response time is basically within 0.22s, and the CPU and memory usage rates also remain within the range of 3%-6% and 0.87%-1.14%. The energy consumption is basically kept below the minimum level (L1: light), which indicates that the system is functioning well.
表2稳定性及压力测试Table 2 Stability and stress test
REC REC numbernumber 10001000 50005000 1000010000 5000050000
CRASH CRASH timestimes 00 00 00 00
ANR ANR timestimes 00 00 00 00
TOTAL TOTAL 00 00 00 00 00
稳定性和压力测试。结合了两种测试方法。首先,服务器连续运行7x24小时。在此期间, 任务的数量和内容通过移动设备进行更新。不断观察并记录传感器端和服务器端的输出日志,没有出现死机或软件错误等异常情况。其次,在设置测试环境后,使用Android SDKMonkey软件对WeSense进行稳定性和压力测试。例如,发送Monkey-pcom.hills.WeSense-v-v1000来请求执行1000个随机命令事件(RCE),例如Map键,Home键。记录CRASH和ANR(应用程序无响应)的发生次数。CRASH是指在发生应用程序错误时程序异常停止或退出的情况。ANR表示当Android系统在5秒内检测到应用程序未响应输入事件或10秒内未执行广播时,它将引发无响应的提示。测试结果如表2所示。结合以上两个测试结果,可以看到系统可以在不同的压力条件下稳定高效地运行。Stability and stress testing. Combines two test methods. First, the server runs continuously for 7x24 hours. During this period, the number and content of tasks are updated via mobile devices. Continuously observe and record the output logs of the sensor side and the server side, and there is no abnormal situation such as crashes or software errors. Secondly, after setting up the test environment, use Android SDKMonkey software to conduct stability and stress tests on WeSense. For example, send Monkey-pcom.hills.WeSense-v-v1000 to request execution of 1000 random command events (RCE), such as Map key, Home key. Record the number of occurrences of CRASH and ANR (application not responding). CRASH refers to the situation where the program stops or exits abnormally when an application error occurs. ANR means that when the Android system detects that the application does not respond to input events within 5 seconds or does not perform broadcast within 10 seconds, it will trigger an unresponsive prompt. The test results are shown in Table 2. Combining the above two test results, we can see that the system can operate stably and efficiently under different pressure conditions.

Claims (6)

  1. 一种面向群智感知的泛在操作系统,其特征在于:A ubiquitous operating system oriented to group intelligence perception, which is characterized by:
    所述面向群智感知的泛在操作系统,任务发布者通过智能终端输入原始任务数据并提交至平台,平台捕获到任务后,对任务进行解析,并给该任务分配唯一的任务ID;当任务进入平台之后,进行任务解析并生成对应的任务特征向量,并和其他已知离散特征做特征拼接,通过任务向量提取出特性,所述特性包含但不限于任务种类、需要的参与者人数,任务执行地点,需用到的传感器和采集数据的类型;面向群智感知的泛在操作系统通过执行任务推理,关联及匹配操作,完成用户调度和任务分配过程;参与者接收到任务之后,选择感兴趣的任务执行并将采集到的感知数据或设计方案上传到面向群智感知的泛在操作系统中;当数据进入到面向群智感知的泛在操作系统后,对任务、用户、进程资源进行抽象和软件定义,根据任务特征信息描述,面向群智感知的泛在操作系统选择需要使用的任务中间件并对采集的数据进行汇总;最后将结果返回给任务发布者,发布者对任务结果进行评价和反馈,当发布者接收结果后,群智任务的生命周期结束。In the ubiquitous operating system oriented to group intelligence perception, the task publisher inputs the original task data through the smart terminal and submits it to the platform. After the task is captured by the platform, the task is analyzed and a unique task ID is assigned to the task; After entering the platform, perform task analysis and generate the corresponding task feature vector, and perform feature splicing with other known discrete features, and extract the characteristics through the task vector. The characteristics include but are not limited to the task type, the number of participants required, and the task The execution location, the type of sensors and data collected; the ubiquitous operating system for group intelligence perception completes the process of user scheduling and task allocation by performing task inference, association and matching operations; after the participants receive the task, they have a sense of choice The task of interest is executed and the collected perception data or design plan is uploaded to the ubiquitous operating system for group intelligence perception; when the data enters the ubiquitous operating system for group intelligence perception, the task, user, and process resources are performed Abstract and software definition, according to the description of task feature information, the ubiquitous operating system for group intelligence selects the task middleware to be used and summarizes the collected data; finally the results are returned to the task publisher, and the publisher performs the task results Evaluation and feedback, when the publisher receives the results, the life cycle of the group intelligence task ends.
  2. 根据权利要求1所述的一种面向群智感知的泛在操作系统,其特征在于:The ubiquitous operating system oriented to group intelligence perception according to claim 1, characterized in that:
    所述面向群智感知的泛在操作系统采用操作系统CrowdOS实现,CrowdOS运行在原生操作系统和上层应用之间,包含感知端和服务端,其中感知端的软件载体包含两类设备,第一类是具备人机交互功能的便携式智能感知设备,第二类是部署在物理世界中的固定传感器;CrowdOS采用云-边-端的部署方式;感知端部署在各类终端感知设备上,收集数据信息,服务端部署在云服务器或者边缘服务器上,对系统资源及数据资源进行综合管理并实时响应系统操作;The ubiquitous operating system oriented to group intelligence perception is implemented by the operating system CrowdOS. CrowdOS runs between the native operating system and upper-layer applications, and includes the sensing end and the server. The software carrier of the sensing end includes two types of equipment, the first type is Portable intelligent sensing devices with human-computer interaction functions. The second category is fixed sensors deployed in the physical world; CrowdOS adopts a cloud-side-to-end deployment method; the sensing end is deployed on various terminal sensing devices to collect data, information and services The end is deployed on a cloud server or edge server to comprehensively manage system resources and data resources and respond to system operations in real time;
    在感知端的功能层部分,首先发布者通过智能终端的交互功能将任务上传到基于CrowdOS的系统中,参与者通过感知端浏览并执行已发布到系统中的任务;当参与者接收任务,感知端则开始进行数据采集;系统支持层获取当前设备状态,然后对获取到的传感器数据统一进行接口封装并统一数据传输格式,再通过网络或蓝牙传输方式将数据存入对应的数据结构体中;对于无需人机交互的传感设备,一旦该设备被系统中的任务激活且设备通过了任务验证,则该设备开始自动按照预定规则收集并上传感知数据;In the functional layer part of the perception terminal, first the publisher uploads the task to the CrowdOS-based system through the interactive function of the smart terminal, and the participant browses through the perception terminal and executes the tasks that have been published in the system; when the participant receives the task, the perception terminal Data collection begins; the system support layer obtains the current device status, then uniformly interface encapsulates the acquired sensor data and unifies the data transmission format, and then stores the data in the corresponding data structure through the network or Bluetooth transmission; A sensing device that does not require human-computer interaction. Once the device is activated by a task in the system and the device passes the task verification, the device will automatically collect and upload perception data according to predetermined rules;
    服务端提供了综合管理服务,部署在服务器集群、云服务器或者边缘服务器之上,共包括任务池模块、资源管理模块、数据管理中心模块、内外部接口模块、知识库模块、系统插件模块、任务结果质量优化和联合存储和检索模块八个模块;首先当任务发布者从感知端将任务数据传输到服务端后,服务端通过任务池模块对任务进行理解和表示,通过任务池模块对接收到的任务进行解析、调度、分配以及微调,并将群智任务分配给平台中的用户;在资源管理模块中,对各种资源进行软件定义并完成设备、用户、环境以及任务进程调度的综合管理;然后进入数据管理中心模块,该模块提供了海量异构数据的分类存储以及快速检索功能,联合存储和检索模块对通过感知端采集到的数据进行存储和处理,萃取有用的信息传输给知识库模块;对于采集完成的数据,通过任务结果质量优化模块进行数据质量优化,优化完成后服务端再将最终结果通网络反馈给感知端的任务发布者,从而完成整个任务消息在感知端和服务端之间的流通;系统插件模块为用户提供隐私保护、信用评估及用户激励;内外部接口模块中,内部接口用于对操作系统进行维护和更新,外部接口提供给第三方应用,用于调用软件接口。The server provides comprehensive management services, deployed on server clusters, cloud servers or edge servers, including task pool modules, resource management modules, data management center modules, internal and external interface modules, knowledge base modules, system plug-in modules, and tasks Result quality optimization and eight modules of joint storage and retrieval modules; first, after the task publisher transmits the task data from the sensing end to the server, the server understands and expresses the task through the task pool module, and receives the data through the task pool module. Analyze, schedule, allocate and fine-tune the tasks of the company, and assign the group intelligence tasks to the users in the platform; in the resource management module, software defines various resources and completes the comprehensive management of equipment, users, environment and task process scheduling ; Then enter the data management center module, which provides classified storage and fast retrieval of massive heterogeneous data. The combined storage and retrieval module stores and processes the data collected through the sensing terminal, extracts useful information and transmits it to the knowledge base Module; for the collected data, the task result quality optimization module is used to optimize the data quality. After the optimization is completed, the server sends the final result back to the task publisher of the sensor through the network, so as to complete the entire task message on the sensor and the server. The system plug-in module provides users with privacy protection, credit evaluation and user incentives; in the internal and external interface modules, the internal interface is used to maintain and update the operating system, and the external interface is provided to third-party applications to call the software interface .
  3. 根据权利要求2所述的一种面向群智感知的泛在操作系统,其特征在于:The ubiquitous operating system oriented to group intelligence perception according to claim 2, characterized in that:
    在CrowdOS中,在任务进入后构建了五个动态Agent生成系统资源图谱,来管理系统中 的任务和资源,Agents包括任务智能体Task-agent,用户智能体User-agent,设备智能体Device-agent,环境智能体Environment-agent和进程智能体Process-agent;其中TA包含每个任务的详细信息,包括但不限于任务种类,执行时间,地点,收集的数据格式,UA是对系统中用户的抽象描述,记录此用户的信息,包括但不限于发布和执行过的任务,信用等级;DA是对终端设备资源的描述,记录该设备的类型,当前状态信息;EA抽象当前系统本身所处的软硬件环境资源,包括但不限于当前CPU、内存、存储使用情况以及系统中用户数目及可用设备总量;PA管理当前系统中存在的所有任务进程,包括但不限于进程状态、优先级、调度策略;Agents在获取到任务信息之后通过内部结构中定义的信号量完成互相之间的实时交互和更新。In CrowdOS, five dynamic Agents are constructed to generate system resource maps after the task is entered to manage the tasks and resources in the system. Agents include task agent Task-agent, user agent User-agent, and device agent Device-agent. , Environment-agent and Process-agent; TA contains detailed information about each task, including but not limited to task type, execution time, location, collected data format, UA is an abstraction of users in the system Description, record the user’s information, including but not limited to the tasks issued and performed, and credit rating; DA is the description of the terminal equipment resources, recording the type of the equipment, current status information; EA abstracts the current software system itself Hardware environment resources, including but not limited to the current CPU, memory, storage usage and the number of users in the system and the total amount of available equipment; PA manages all task processes in the current system, including but not limited to process status, priority, and scheduling strategy ; Agents complete mutual real-time interaction and update through the semaphore defined in the internal structure after obtaining the task information.
  4. 根据权利要求2所述的一种面向群智感知的泛在操作系统,其特征在于:The ubiquitous operating system oriented to group intelligence perception according to claim 2, characterized in that:
    所述任务池中,任务信息萃取部分,当群智任务进入系统,首先对任务进行语义解析和特征提取,将对接收的任务进行自然语言分析,对于通过语言描述的任务,系统进行分词处理,然后系统执行不区分语种的操作,最终提取出任务关键信息,包括但不限于任务的执行方式、地点、时间、参与者数目,对提取出的任务信息和通过规则点击选取而获取到的离散特征进行拼接,将拼接完的特征输入到一个深度神经网络中进行统一编码,输出一个高维的任务中间向量;最后通过解码将向量映射到该任务的Task-agent,完成任务向量到agents的转换过程;In the task pool, in the task information extraction part, when the group intelligence task enters the system, the task is firstly analyzed by semantic analysis and feature extraction, and natural language analysis is performed on the received task. For tasks described by language, the system performs word segmentation processing. Then the system performs operations that do not distinguish between languages, and finally extracts mission-critical information, including but not limited to the task execution method, location, time, number of participants, the extracted task information and the discrete features obtained through rule click selection For splicing, input the spliced features into a deep neural network for unified encoding, and output a high-dimensional task intermediate vector; finally, map the vector to the task-agent of the task through decoding, and complete the conversion process from task vector to agents ;
    所述任务池中,Agent生成部分包含5个Agents:Task_agent、Environment_agent、Process_agent、Device_agent、User_agent,通过任务解析和表示过程得到的;其中,taskID是任务在系统中的唯一标识符;process_state表示了该任务进程的当前状态,是处于生成态、执行态还是反馈态等;该状态协助Process_agent进行任务进程管理,Prio代表了该任务的优先级,取值为0-15,系统根据优先级顺序进行任务进程调度;taskInfo是一个结构体,包含任务详细信息,如任务时间、地点、向量表示等;Classification代表了该任务所属的类别,如数据标注类、传感信息采集类、问卷回答类等;Topic表示了该任务的主题,从关键词信息中提取,如音频采集、照片收集等;deviceNum、deviceInfo、deviceID分别代表了参与执行该任务的设备数目、设备详细信息以及设备ID列表;Sensing_Data是收集到的任务数据集指针,指向存储数据的立方体地址;In the task pool, the Agent generation part contains 5 Agents: Task_agent, Environment_agent, Process_agent, Device_agent, User_agent, which are obtained through the process of task analysis and representation; among them, taskID is the unique identifier of the task in the system; process_state represents the The current state of the task process, whether it is in the generation state, the execution state or the feedback state, etc.; this state assists Process_agent in task process management, Prio represents the priority of the task, the value is 0-15, and the system performs the task according to the priority order Process scheduling; taskInfo is a structure that contains detailed task information, such as task time, location, vector representation, etc.; Classification represents the category to which the task belongs, such as data labeling, sensor information collection, questionnaire answering, etc.; Topic Represents the subject of the task, extracted from keyword information, such as audio collection, photo collection, etc.; deviceNum, deviceInfo, and deviceID represent the number of devices participating in the task, device detailed information, and device ID list; Sensing_Data is collected Pointer to the task data set, pointing to the address of the cube where the data is stored;
    所述任务池中,调度和分配部分,任务调度子框架中包含了策略库、映射模型及策略管理模块;首先对任务资源图谱的内容进行分析和推理,将任务ID映射到策略库中,策略库中存放任务分配函数及其编号,从而完成任务分配策略选取过程;然后根据选取的分配策略对设备和用户执行调度操作,并将任务推送给合适的用户。In the task pool, the scheduling and allocation part, the task scheduling sub-framework includes a strategy library, a mapping model, and a strategy management module; first, the content of the task resource map is analyzed and reasoned, and the task ID is mapped to the strategy library. The task allocation function and its number are stored in the library, so as to complete the task allocation strategy selection process; then according to the selected allocation strategy, perform scheduling operations on the devices and users, and push tasks to appropriate users.
  5. 根据权利要求2所述的一种面向群智感知的泛在操作系统,其特征在于:The ubiquitous operating system oriented to group intelligence perception according to claim 2, characterized in that:
    所述资源管理模块包括但不限于用户、感知终端、系统环境、任务进程、系统软件、任务数据以及知识库,将CrowdOS的管理对象抽象为4类,根据任务进入系统的不同时期顺序执行,具体如下:The resource management module includes, but is not limited to, users, perception terminals, system environments, task processes, system software, task data, and knowledge bases. The management objects of CrowdOS are abstracted into 4 categories, and they are executed sequentially according to the different periods when tasks enter the system. as follows:
    1)设备、用户及环境管理1) Equipment, user and environmental management
    当感知端通过网络接入时触发信号,设备自动向系统发送当前的状态信息,包括但不限于设备类型、剩余电量、位置信息、存储占用率,基于CrowdOS框架开发的系统通过Device-agent智能体捕获和存储状态信息;When the sensing terminal is connected through the network, the signal is triggered, and the device automatically sends the current status information to the system, including but not limited to device type, remaining power, location information, and storage occupancy. The system developed based on the CrowdOS framework uses the Device-agent agent Capture and store status information;
    系统通过User-agent描绘用户画像,用户依托感知端与系统进行交互,User-agent保存用户姓名,年龄,参与过的任务,同时会根据用户参与任务的情况生成用户信用等级,用户偏好, 兴趣领域及其他个性化信息;The system portrays the user portrait through the User-agent, and the user interacts with the system on the perceptual end. The User-agent saves the user's name, age, and tasks participated in, and at the same time generates user credit ratings, user preferences, and areas of interest based on the user's participation in tasks And other personalized information;
    环境资源记录服务器架构和处理能力,资源被存储于Environment-agent中并且定时更新,该agent具备报警功能,根据当前系统状态和任务量增减情况做预判,如果系统CPU利用率或存储占用率达到了额定阈值,系统会给出警报;Environment resource records server architecture and processing capabilities. Resources are stored in the Environment-agent and updated regularly. The agent has an alarm function and makes predictions based on the current system status and the increase or decrease of the task volume. If the system CPU utilization rate or storage occupancy rate is When the rated threshold is reached, the system will give an alarm;
    2)任务进程调度管理2) Task process scheduling management
    Process-agent是任务当前阶段状态信息的集合,给每个任务分配进程识别号作为系统中任务进程存在的唯一标志;ID同时存储于Task-agent及Process-agent中,伴随任务的整个生命周期;PA类里面包含任务进程信息,包含但不限于TPID:进程唯一标识符;process_state:当前任务在系统中所处的状态,有七种可切换状态;process_strategy:进程调度策略,包含先到先服务(FCFS)、循环法(RB)、任务优先级、最高响应比优先(HRRN)、反馈优先;process_prio代表进程优先级,从0-15,0代表最高优先级,依次递减;Process-agent is a collection of the status information of the current stage of the task. A process identification number is assigned to each task as the only sign of the existence of the task process in the system; the ID is stored in both the Task-agent and the Process-agent and accompanies the entire life cycle of the task; The PA class contains task process information, including but not limited to TPID: process unique identifier; process_state: the current state of the task in the system, there are seven switchable states; process_strategy: process scheduling strategy, including first come, first served ( FCFS), round-robin method (RB), task priority, highest response ratio priority (HRRN), feedback priority; process_prio represents the process priority, from 0-15, 0 represents the highest priority, in descending order;
    任务进程状态包含创建态、生成态、分配态、执行态、处理态、反馈态及终止态七种状态;状态之间的流转关系为:首先,当用户在系统中发布任务时首先是任务创建态;通过系统认证和解析后进入任务生成态;通过任务调度和分配后进入分配态;发布完成后参与者可对任务进行执行,进而进入执行态;参与者将采集到的数据上传到系统中进行处理,进入处理态;经过数据可视化呈现把结果汇总给任务发布者,进入反馈态;当任务结果未通过发布者接收验证时,系统会暂时停留在反馈态,进而经过推理和修正,任务进程再次回到生成态、分配态或处理态,然后顺序执行;通过发布者验证任务提交结果后,则任务进入终止态,整个任务生命周期结束;The task process state includes seven states: creation state, generation state, allocation state, execution state, processing state, feedback state and termination state; the flow relationship between states is: First, when a user publishes a task in the system, the first is task creation State; enter the task generation state after passing system authentication and analysis; enter the distribution state after task scheduling and assignment; after the release is completed, participants can execute the task, and then enter the execution state; participants upload the collected data to the system After processing, enter the processing state; after data visualization, the results are summarized to the task publisher and enter the feedback state; when the task result fails the publisher’s acceptance verification, the system will temporarily stay in the feedback state, and then after reasoning and correction, the task progresses Return to the generation state, the allocation state or the processing state again, and then execute sequentially; after the publisher verifies the task submission result, the task enters the termination state, and the entire task life cycle ends;
    系统中的任务进程调度算法从以下方法中任选一个:The task process scheduling algorithm in the system can choose one of the following methods:
    1)先到先服务(FCFS):优先处理先进入系统的任务,为其提供资源和服务;1) First-come, first-served (FCFS): Prioritize tasks that enter the system first, and provide resources and services for them;
    2)循环法:以一个周期性间隔产生任务中断,将当前正在运行的进程置于任务就绪队列,基于FCFS选择下一个就绪进程运行;2) Circular method: Generate task interrupts at a periodic interval, place the currently running process in the task ready queue, and select the next ready process to run based on FCFS;
    3)任务优先级:优先处理高级别的任务,同一优先级的任务按照FCFS原则;3) Task priority: Prioritize high-level tasks, and tasks with the same priority shall follow the FCFS principle;
    4)最高响应比优先(HRRN),R=(w+s)/s,其中R表示响应比,w表示已经等待的时间,s表示期待被服务的时间;4) The highest response ratio priority (HRRN), R=(w+s)/s, where R represents the response ratio, w represents the waiting time, and s represents the time expected to be served;
    5)反馈优先:对于进入反馈态的任务,在原始优先级基础上上调两级,优先使用系统资;5) Feedback priority: For tasks that enter the feedback state, two levels will be raised on the basis of the original priority, and system resources will be used first;
    3)异构多模态数据资源管理对于多模态数据的管理,步骤如下:3) Heterogeneous multi-modal data resource management For the management of multi-modal data, the steps are as follows:
    1)收集和存储数据;1) Collect and store data;
    2)构建基于群智的非结构化数据的检索方法,一旦数据准备完成,开始构建数据堆栈;2) Construct an unstructured data retrieval method based on Qunzhi. Once the data preparation is completed, start to build the data stack;
    3)采用数据立方体技术管理和存储任务数据,构造多特征立方体结构(MC),依据构建的数据立方体来检索非结构化数据;3) Using data cube technology to manage and store task data, construct a multi-character cube structure (MC), and retrieve unstructured data based on the constructed data cube;
    4)随着系统中任务量的增长,已完成的任务数据和中间数据被定期清理,而经过深层分析的数据则会被转移到知识库中进行管理;4) As the amount of tasks in the system grows, the completed task data and intermediate data are regularly cleaned up, and the data that has undergone in-depth analysis will be transferred to the knowledge base for management;
    4)知识库管理4) Knowledge base management
    OS的知识分为两类:现有知识(EK)和新知识(NK);NK是从任务或数据中提取的,这有助于改善系统机制或更新模型;知识管理的过程如下:OS knowledge is divided into two categories: existing knowledge (EK) and new knowledge (NK); NK is extracted from tasks or data, which helps to improve system mechanisms or update models; the process of knowledge management is as follows:
    首先,系统从现有知识中分辨出能够改善系统性能、更新模型或提升任务结果质量的知识,对用户或第三方有用的信息或知识不包括在此处的知识范围内;其次,对于待被发现的知识,系统提供对应的的机制或算法对其进行挖掘,包括但不限于深度学习算法、在线更新 算法、迁移学习算法;第三,知识库根据知识的种类,形式和抽象层级,对知识进行归纳和数据库存储。First, the system distinguishes from existing knowledge that can improve system performance, update models, or improve the quality of task results. Information or knowledge useful to users or third parties is not included in the scope of knowledge here; The system provides corresponding mechanisms or algorithms to mine the discovered knowledge, including but not limited to deep learning algorithms, online update algorithms, and migration learning algorithms. Third, the knowledge base can extract knowledge based on the type, form, and abstraction level of the knowledge. Perform induction and database storage.
  6. 根据权利要求2所述的一种面向群智感知的泛在操作系统,其特征在于:The ubiquitous operating system oriented to group intelligence perception according to claim 2, characterized in that:
    所述结果质量优化模块,包含交互层,推理层以及执行层;首先,在交互层中发布者通过输入评价信息或点击人机界面上的按钮来评估任务结果,对多样化的评价内容进行分析;如果评价显示接受或者满意,执行结束指令,该任务进行被终;若通过分析发现结果存在质量问题则进入下一层;其次,进入推理层,对发布者的反馈内容执行关键信息抽取和深度分析操作,推理出现质量问题的可能原因,然后根据系统建立的推理模型,将原因映射到问题编码库中,找到对应的错误编码;第三,进入执行层,会将问题编码与对应的内部操作进行映射;修正完毕后,任务进入新的进程状态,通过各种途径修正的任务结果,将再次进入交互层,反馈给发布者,等待交互层给出新的评估结果,整个优化过程按步骤依次执行并形成一个闭环,直到发布者对任务结果满意则终止。The result quality optimization module includes an interaction layer, a reasoning layer, and an execution layer; firstly, in the interaction layer, the publisher evaluates the task results by inputting evaluation information or clicking buttons on the man-machine interface, and analyzes the diversified evaluation content ; If the evaluation shows acceptance or satisfaction, the end instruction is executed, and the task is terminated; if the result is found to have quality problems through analysis, then enter the next layer; secondly, enter the reasoning layer, and perform key information extraction and depth on the publisher’s feedback content Analyze the operation, infer the possible causes of the quality problem, and then map the reason to the problem code library according to the inference model established by the system to find the corresponding error code; third, enter the execution layer, and the problem code will be coded with the corresponding internal operation After the correction is completed, the task enters a new process state. The task results corrected through various channels will enter the interactive layer again and be fed back to the publisher, waiting for the interactive layer to give new evaluation results. The entire optimization process is step by step Execute and form a closed loop, and terminate until the publisher is satisfied with the task result.
PCT/CN2021/087993 2020-04-24 2021-04-19 Ubiquitous operating system oriented toward group intelligence perception WO2021213293A1 (en)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
CN202010330505.XA CN111562972A (en) 2020-04-24 2020-04-24 Ubiquitous operating system for crowd sensing
CN202010330505.X 2020-04-24

Publications (1)

Publication Number Publication Date
WO2021213293A1 true WO2021213293A1 (en) 2021-10-28

Family

ID=72074353

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2021/087993 WO2021213293A1 (en) 2020-04-24 2021-04-19 Ubiquitous operating system oriented toward group intelligence perception

Country Status (2)

Country Link
CN (1) CN111562972A (en)
WO (1) WO2021213293A1 (en)

Cited By (23)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN113835685A (en) * 2021-11-26 2021-12-24 之江实验室 Network operating system design method based on mimicry database
CN114356092A (en) * 2022-01-05 2022-04-15 花脸数字技术(杭州)有限公司 Multi-mode-based man-machine interaction system for digital human information processing
CN114697141A (en) * 2022-05-30 2022-07-01 深圳市永达电子信息股份有限公司 C4ISR situation perception analysis system and method based on state machine
CN114860396A (en) * 2022-04-10 2022-08-05 西北工业大学 Immediate perception even-execution task allocation method based on reinforcement learning
CN114971416A (en) * 2022-07-13 2022-08-30 宁波创元信息科技有限公司 Data acquisition and command scheduling system and method based on intelligent form
CN114971047A (en) * 2022-06-04 2022-08-30 哈尔滨理工大学 Mobile crowd sensing-oriented user collaborative optimization method
CN115002713A (en) * 2022-08-03 2022-09-02 中南大学 Method, system, medium and device for improving crowd sensing coverage rate
CN115047826A (en) * 2021-12-14 2022-09-13 西安航空学院 Universal implementation method for component model framework of information physical fusion system
CN115080968A (en) * 2022-06-08 2022-09-20 陕西天诚软件有限公司 Artificial intelligence server with intelligent security protection
CN115145789A (en) * 2022-07-01 2022-10-04 北京瑞风协同科技股份有限公司 Complex equipment test integrated system
CN115150069A (en) * 2022-06-13 2022-10-04 湖北工业大学 Safety data acquisition method and system based on Gaussian distribution filtering in crowd-sourcing perception
CN115292743A (en) * 2022-07-06 2022-11-04 南京邮电大学 Big data application system based on intelligent group sensing technology
CN115348325A (en) * 2022-08-24 2022-11-15 中国电子科技集团公司第十五研究所 Multichannel real-time transmission priority management and control method and system
CN115529312A (en) * 2022-09-16 2022-12-27 武汉虹信技术服务有限责任公司 Information release OpenAPI conversion system and method for multi-terminal heterogeneous system
CN115545799A (en) * 2022-11-04 2022-12-30 北京赛西科技发展有限责任公司 Information technology service quality evaluation method, device, equipment and medium
CN115535854A (en) * 2022-08-24 2022-12-30 杭州大杰智能传动科技有限公司 Intelligent tower crane maintenance control system
CN115858316A (en) * 2022-11-22 2023-03-28 中国兵器工业信息中心 Networked software system reliability modeling simulation method based on multiple agents
CN116095721A (en) * 2023-04-07 2023-05-09 湖北工业大学 Mobile crowd-sourced network contract excitation method and system integrating perception communication
CN116311387A (en) * 2023-05-25 2023-06-23 浙江工业大学 Cross-modal pedestrian re-identification method based on feature intersection
CN116582582A (en) * 2023-07-14 2023-08-11 中冶武勘智诚(武汉)工程技术有限公司 Data acquisition method, system, equipment and medium for component type Internet of things
CN116862208A (en) * 2023-09-05 2023-10-10 中煤科工机器人科技有限公司 Cluster management system for coal mine robots
CN117151914A (en) * 2023-11-01 2023-12-01 中国人民解放军国防科技大学 Crowd sensing user selection method and device based on comprehensive influence evaluation
CN117519953A (en) * 2024-01-08 2024-02-06 北京大学 Separated memory management method for server-oriented non-perception calculation

Families Citing this family (9)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN111562972A (en) * 2020-04-24 2020-08-21 西北工业大学 Ubiquitous operating system for crowd sensing
CN114915630B (en) * 2021-02-10 2023-10-31 中国移动通信有限公司研究院 Task allocation method, network training method and device based on Internet of Things equipment
CN113242294B (en) * 2021-05-08 2022-09-16 西北工业大学 Stream computing processing method for crowd sensing data
CN113411765B (en) * 2021-05-22 2022-07-26 西北工业大学 Mobile intelligent terminal energy consumption optimization method based on multi-sensor cooperative sensing
CN113343149A (en) * 2021-06-22 2021-09-03 深圳市网联安瑞网络科技有限公司 Agent-based mobile terminal social media propagation effect evaluation method, system and application
CN115550367B (en) * 2022-11-30 2023-03-07 成都中星世通电子科技有限公司 Radio monitoring method and system based on distributed task management and resource scheduling
CN116340007B (en) * 2023-05-30 2023-08-08 中国人民解放军军事科学院系统工程研究院 Resource collaborative optimization intelligent unmanned cluster task allocation method and system
CN116820711A (en) * 2023-06-07 2023-09-29 上海幽孚网络科技有限公司 Task driven autonomous agent algorithm
CN116680201B (en) * 2023-07-31 2023-10-17 南京争锋信息科技有限公司 System pressure testing method based on machine learning

Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
KR20160133677A (en) * 2015-05-13 2016-11-23 한국전자통신연구원 Dynamic auto verification system for improving reliability of crowdsourcing data based on programming
CN109086976A (en) * 2018-07-11 2018-12-25 陕西师范大学 A kind of method for allocating tasks towards intelligent perception
CN110995858A (en) * 2019-12-17 2020-04-10 大连理工大学 Edge network request scheduling decision method based on deep Q network
CN110992121A (en) * 2019-10-22 2020-04-10 西安电子科技大学 Perception task information distribution system and method based on perception error in crowd sensing
CN111562972A (en) * 2020-04-24 2020-08-21 西北工业大学 Ubiquitous operating system for crowd sensing

Family Cites Families (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US8875131B2 (en) * 2010-11-18 2014-10-28 International Business Machines Corporation Specification of environment required for crowdsourcing tasks
CN109345331B (en) * 2018-08-21 2021-06-11 中国科学技术大学苏州研究院 Crowd sensing system task allocation method with privacy protection
CN109885944A (en) * 2019-02-25 2019-06-14 天津工业大学 A kind of blending space Mechanism of Task Allocation in mobile intelligent perception
CN110009233B (en) * 2019-04-08 2020-12-25 清华大学深圳研究生院 Game theory-based task allocation method in crowd sensing
CN110489235A (en) * 2019-08-20 2019-11-22 西北工业大学 A kind of mobile crowdsourcing method for allocating tasks based on HEFT

Patent Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
KR20160133677A (en) * 2015-05-13 2016-11-23 한국전자통신연구원 Dynamic auto verification system for improving reliability of crowdsourcing data based on programming
CN109086976A (en) * 2018-07-11 2018-12-25 陕西师范大学 A kind of method for allocating tasks towards intelligent perception
CN110992121A (en) * 2019-10-22 2020-04-10 西安电子科技大学 Perception task information distribution system and method based on perception error in crowd sensing
CN110995858A (en) * 2019-12-17 2020-04-10 大连理工大学 Edge network request scheduling decision method based on deep Q network
CN111562972A (en) * 2020-04-24 2020-08-21 西北工业大学 Ubiquitous operating system for crowd sensing

Non-Patent Citations (3)

* Cited by examiner, † Cited by third party
Title
WANG LIANG, YU ZHIWEN GUO BIN XIONG FEI: "Crowd Sensing Socialization Task Allocation Based on Mobile Social Network", JOURNAL OF ZHEJIANG UNIVERSITY (ENGINEERING SCIENCE), GAIKAN BIANJIBU, HANGZHOU, CN, vol. 52, no. 9, 30 September 2018 (2018-09-30), CN , pages 1709 - 1716, XP055859778, ISSN: 1008-973X, DOI: 10.3785/j.issn.1008-973X.2018.09.011 *
XING QIAN, XUEMEI SUN, CHUNMIAO YUAN: "Assignment mechanism for spatial tasks in mobile crowd sensing", APPLICATION RESEARCH OF COMPUTERS, CHENGDU, CN, vol. 37, no. 3, 31 March 2020 (2020-03-31), CN , pages 868 - 871, XP055859788, ISSN: 1001-3695, DOI: 10.19734 /j.issn.1001-3695.2018.09.0644 *
XU ZHE, LI ZHUO CHEN XIN: "Multi-task assignment algorithm for mobile crowdsensing", JOURNAL OF COMPUTER APPLICATIONS, JISUANJI YINGYONG, CN, vol. 37, no. 1, 10 January 2017 (2017-01-10), CN , XP055859771, ISSN: 1001-9081, DOI: 10.11772 /j.issn.1001-9081.2017.01.0018 *

Cited By (40)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN113835685B (en) * 2021-11-26 2022-02-18 之江实验室 Network operating system design method based on mimicry database
CN113835685A (en) * 2021-11-26 2021-12-24 之江实验室 Network operating system design method based on mimicry database
US11934383B2 (en) 2021-11-26 2024-03-19 Zhejiang Lab Mimetic database-based network operating system design method
CN115047826A (en) * 2021-12-14 2022-09-13 西安航空学院 Universal implementation method for component model framework of information physical fusion system
CN115047826B (en) * 2021-12-14 2024-04-09 西安航空学院 Method for realizing general information physical fusion system component model framework
CN114356092A (en) * 2022-01-05 2022-04-15 花脸数字技术(杭州)有限公司 Multi-mode-based man-machine interaction system for digital human information processing
CN114356092B (en) * 2022-01-05 2022-09-09 花脸数字技术(杭州)有限公司 Multi-mode-based man-machine interaction system for digital human information processing
CN114860396A (en) * 2022-04-10 2022-08-05 西北工业大学 Immediate perception even-execution task allocation method based on reinforcement learning
CN114860396B (en) * 2022-04-10 2024-02-09 西北工业大学 Instant perception even execution task allocation method based on reinforcement learning
CN114697141A (en) * 2022-05-30 2022-07-01 深圳市永达电子信息股份有限公司 C4ISR situation perception analysis system and method based on state machine
CN114697141B (en) * 2022-05-30 2022-12-27 深圳市永达电子信息股份有限公司 C4ISR situation awareness analysis system and method based on state machine
CN114971047A (en) * 2022-06-04 2022-08-30 哈尔滨理工大学 Mobile crowd sensing-oriented user collaborative optimization method
CN115080968A (en) * 2022-06-08 2022-09-20 陕西天诚软件有限公司 Artificial intelligence server with intelligent security protection
CN115080968B (en) * 2022-06-08 2023-06-02 陕西天诚软件有限公司 Artificial intelligence server with intelligent safety protection
CN115150069A (en) * 2022-06-13 2022-10-04 湖北工业大学 Safety data acquisition method and system based on Gaussian distribution filtering in crowd-sourcing perception
CN115150069B (en) * 2022-06-13 2024-04-30 湖北工业大学 Security data acquisition method and system based on Gaussian distribution filtering in crowd sensing
CN115145789A (en) * 2022-07-01 2022-10-04 北京瑞风协同科技股份有限公司 Complex equipment test integrated system
CN115292743A (en) * 2022-07-06 2022-11-04 南京邮电大学 Big data application system based on intelligent group sensing technology
CN114971416A (en) * 2022-07-13 2022-08-30 宁波创元信息科技有限公司 Data acquisition and command scheduling system and method based on intelligent form
CN115002713B (en) * 2022-08-03 2022-10-18 中南大学 Method, system, medium and device for improving crowd sensing coverage rate
CN115002713A (en) * 2022-08-03 2022-09-02 中南大学 Method, system, medium and device for improving crowd sensing coverage rate
CN115348325A (en) * 2022-08-24 2022-11-15 中国电子科技集团公司第十五研究所 Multichannel real-time transmission priority management and control method and system
CN115535854A (en) * 2022-08-24 2022-12-30 杭州大杰智能传动科技有限公司 Intelligent tower crane maintenance control system
CN115535854B (en) * 2022-08-24 2024-01-19 杭州大杰智能传动科技有限公司 Intelligent maintenance control system for tower crane
CN115348325B (en) * 2022-08-24 2024-01-23 中国电子科技集团公司第十五研究所 Multichannel real-time transmission priority management and control method and system
CN115529312A (en) * 2022-09-16 2022-12-27 武汉虹信技术服务有限责任公司 Information release OpenAPI conversion system and method for multi-terminal heterogeneous system
CN115545799B (en) * 2022-11-04 2023-03-24 北京赛西科技发展有限责任公司 Information technology service quality evaluation method, device, equipment and medium
CN115545799A (en) * 2022-11-04 2022-12-30 北京赛西科技发展有限责任公司 Information technology service quality evaluation method, device, equipment and medium
CN115858316B (en) * 2022-11-22 2023-08-08 中国兵器工业信息中心 Multi-Agent-based networked software system reliability modeling simulation method
CN115858316A (en) * 2022-11-22 2023-03-28 中国兵器工业信息中心 Networked software system reliability modeling simulation method based on multiple agents
CN116095721A (en) * 2023-04-07 2023-05-09 湖北工业大学 Mobile crowd-sourced network contract excitation method and system integrating perception communication
CN116311387B (en) * 2023-05-25 2023-09-01 浙江工业大学 Cross-modal pedestrian re-identification method based on feature intersection
CN116311387A (en) * 2023-05-25 2023-06-23 浙江工业大学 Cross-modal pedestrian re-identification method based on feature intersection
CN116582582B (en) * 2023-07-14 2023-10-13 中冶武勘智诚(武汉)工程技术有限公司 Data acquisition method, system, equipment and medium for component type Internet of things
CN116582582A (en) * 2023-07-14 2023-08-11 中冶武勘智诚(武汉)工程技术有限公司 Data acquisition method, system, equipment and medium for component type Internet of things
CN116862208A (en) * 2023-09-05 2023-10-10 中煤科工机器人科技有限公司 Cluster management system for coal mine robots
CN117151914A (en) * 2023-11-01 2023-12-01 中国人民解放军国防科技大学 Crowd sensing user selection method and device based on comprehensive influence evaluation
CN117151914B (en) * 2023-11-01 2024-01-30 中国人民解放军国防科技大学 Crowd sensing user selection method and device based on comprehensive influence evaluation
CN117519953B (en) * 2024-01-08 2024-04-05 北京大学 Separated memory management method for server-oriented non-perception calculation
CN117519953A (en) * 2024-01-08 2024-02-06 北京大学 Separated memory management method for server-oriented non-perception calculation

Also Published As

Publication number Publication date
CN111562972A (en) 2020-08-21

Similar Documents

Publication Publication Date Title
WO2021213293A1 (en) Ubiquitous operating system oriented toward group intelligence perception
JP6523354B2 (en) State machine builder with improved interface and handling of state independent events
US11288142B2 (en) Recovery strategy for a stream processing system
US20200366581A1 (en) Simplified entity lifecycle management
US10146592B2 (en) Managing resource allocation in a stream processing framework
US9965330B2 (en) Maintaining throughput of a stream processing framework while increasing processing load
US9842000B2 (en) Managing processing of long tail task sequences in a stream processing framework
US20200082340A1 (en) PROCESSING EVENTS GENERATED BY INTERNET OF THINGS (IoT)
CN108304250A (en) Method and apparatus for the node for determining operation machine learning task
CN108353090A (en) Edge intelligence platform and internet of things sensors streaming system
CN105830049A (en) Automated experimentation platform
CN103092683A (en) Scheduling used for analyzing data and based on elicitation method
CN104298550A (en) Hadoop-oriented dynamic scheduling method
Liu et al. CrowdOS: A ubiquitous operating system for crowdsourcing and mobile crowd sensing
CN114416855A (en) Visualization platform and method based on electric power big data
CN110781180B (en) Data screening method and data screening device
CN110780856B (en) Electricity data release platform based on micro-service
CN110740079A (en) full link benchmark test system for distributed scheduling system
CN113010296B (en) Formalized model based task analysis and resource allocation method and system
Tan et al. News information platform optimization based on the internet of things
CN114896049A (en) Method, system, equipment and medium for scheduling operation tasks of electric power artificial intelligence platform
WO2022048432A1 (en) Method and apparatus for constructing recommendation model, method and apparatus for constructing neural network model, and electronic device and storage medium
Yadav et al. The Event Crowd: A novel approach for crowd-enabled event processing
Shouaib et al. Survey on iot-based big data analytics
CN114707667A (en) Data-driven automatic model training and application system

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 21793543

Country of ref document: EP

Kind code of ref document: A1

NENP Non-entry into the national phase

Ref country code: DE

122 Ep: pct application non-entry in european phase

Ref document number: 21793543

Country of ref document: EP

Kind code of ref document: A1

122 Ep: pct application non-entry in european phase

Ref document number: 21793543

Country of ref document: EP

Kind code of ref document: A1