CN108491972A - A kind of crowd evacuation emulation method and device based on Sarsa algorithms - Google Patents

A kind of crowd evacuation emulation method and device based on Sarsa algorithms Download PDF

Info

Publication number
CN108491972A
CN108491972A CN201810233963.4A CN201810233963A CN108491972A CN 108491972 A CN108491972 A CN 108491972A CN 201810233963 A CN201810233963 A CN 201810233963A CN 108491972 A CN108491972 A CN 108491972A
Authority
CN
China
Prior art keywords
crowd
evacuation
model
leader
sarsa
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN201810233963.4A
Other languages
Chinese (zh)
Inventor
刘弘
王晴晴
段培永
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Shandong Normal University
Original Assignee
Shandong Normal University
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Shandong Normal University filed Critical Shandong Normal University
Priority to CN201810233963.4A priority Critical patent/CN108491972A/en
Publication of CN108491972A publication Critical patent/CN108491972A/en
Pending legal-status Critical Current

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06QINFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
    • G06Q10/00Administration; Management
    • G06Q10/04Forecasting or optimisation specially adapted for administrative or management purposes, e.g. linear programming or "cutting stock problem"
    • G06Q10/047Optimisation of routes or paths, e.g. travelling salesman problem
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06QINFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
    • G06Q50/00Systems or methods specially adapted for specific business sectors, e.g. utilities or tourism
    • G06Q50/10Services
    • G06Q50/26Government or public services
    • G06Q50/265Personal security, identity or safety

Abstract

The invention discloses a kind of crowd evacuation emulation method and device based on Sarsa algorithms, this method include:Real scene database is received, the video crowd in database that tracks obtains crowd's parameter information;Model of place and person model are evacuated according to default evacuation scenario parameters information creating, and person model is imported in evacuation model of place, crowd's initialization is carried out using crowd's parameter information as default evacuation crowd's parameter information;Crowd after initialization is grouped and filters out the leader of each group, microcosmic crowd movement's guidance is carried out using social force model is improved, path is most preferably evacuated using the Sarsa algorithms selections for presetting modifiable decay value, leader leads normal pedestrian in group to be moved according to best evacuation path.Social force model and the Sarsa algorithms in intensified learning are combined by the present invention, and the guidance of evacuation path can be provided to crowd, improves evacuation efficiency.

Description

A kind of crowd evacuation emulation method and device based on Sarsa algorithms
Technical field
The invention belongs to the technical fields of crowd evacuation emulation, are dredged more particularly, to a kind of crowd based on Sarsa algorithms Dissipate emulation mode and device.
Background technology
With the continuous quickening of urbanization process, city incity building and people El density are also rapidly increasing, therewith And that comes is that public place personnel largely assemble, and in densely populated place region, once accident occurs, easily cause such as crowd Congestion such as tramples at the malignant events, often causes the dead group of group if it effectively cannot evacuate crowd and hinders serious accident.How prominent Hair event effectively carries out the condition of a disaster control and crowd evacuation when occurring, to reduce or remit casualties and property loss is both at home and abroad all The problem of highest attention.China is a populous nation, and social structure complexity is unbalanced, and especially China is in society now Transitional period, urbanization process are accelerated, population increase in city.These realities make public place crowd massing problem more Prominent, casualties caused by after accident generation is also more serious.In these public places, caused by densely populated place Injures and deaths case also happens occasionally.And scene modeling, optimum path search and crowd movement's row are carried out by computer simulation technique , can be while reaching best evacuation rehearsal effect by cost minimization, therefore for modeling, computer simulation, which becomes, to be ground Study carefully the main method of crowd evacuation under accident.
There are mainly two types of crowd evacuation emulation models, macromodel and micromodel.Macromodel is not examined from entirety Consider the local detail information of individual behavior.The interaction of micromodel each individual and environment from the point of view of individual, can be more Mend deficiency of the macromodel to pedestrian's datail description.Social force model is exactly a kind of typical micromodel, and social force model is New pedestrian's flow model that Helbing et al. is proposed in nineteen ninety-five, is described as power in social force model by pedestrian movement Effect as a result, pedestrian movement by reciprocal force drives jointly between reciprocal force, individual and environment between own drive power, individual.Wherein, The expectation that own drive power description individual is moved to target;Reciprocal force reflection individual is to other people psychological repellence and physics between individual Repel, makes to keep certain distance between individual, realize that the collision of pedestrian movement avoids;Reciprocal force ensures individual between individual and environment Safe distance between barrier makes smooth avoiding barrier during pedestrian movement, and improved social force model is in original Outlet attraction and friend's attraction are introduced on the basis of beginning social force model, respectively describe sucking action of the outlet to pedestrian And the phenomenon of gathering in groups band together of pedestrian.
Sarsa algorithms are a kind of important intensified learning methods, it was proposed in 1994 by Rummer and Niranjan Based on model algorithm, be initially referred to as improved Q- learning algorithms., still using Q value iteration, Sarsa is that one kind exists for it Line strategy TD study (on-policy TD).Agent is walked in each study, is acted a according to ε-Greedy strategy determination first, is obtained Heuristics and training example (st, at, st+1, rt+1, at+1);And then state s is determined according to ε-Greedy strategyt+1When action at+1, then carry out value function modification;By determining at+1The next action taken as agent.Obviously, Sarsa algorithms With Q- learning algorithms the difference is that Q- learns to be iterated using the maximum value of value function, and Sarsa then using Actual Q values are iterated, and in addition to this, Sarsa study is acted in each study step agent according to current Q values determination, therefore It is a kind of strategy of on-line TD study to claim Sarsa.
In conclusion how will be in social force model and intensified learning in the crowd evacuation emulation of the prior art The problem of Sarsa algorithms are conjointly employed in crowd evacuation emulation method, and how to improve the effect of crowd evacuation in public place The problem of safety of evacuation crowd under rate and crisis situations, still lack effective solution.
Invention content
For the deficiencies in the prior art, how solve in the prior art will be in social force model and intensified learning The problem of Sarsa algorithms are conjointly employed in crowd evacuation emulation method, and how to improve the effect of crowd evacuation in public place The problem of safety of evacuation crowd under rate and crisis situations, it is thin that the present invention proposes a kind of crowd based on Sarsa algorithms Emulation mode and device are dissipated, social force model and the Sarsa algorithms in intensified learning are combined, can be provided to crowd Path guidance is evacuated, evacuation efficiency is improved.
The first object of the present invention is to provide a kind of crowd evacuation emulation method based on Sarsa algorithms.
To achieve the goals above, the present invention is using a kind of following technical solution:
A kind of crowd evacuation emulation method based on Sarsa algorithms, this method include:
Real scene database is received, the video crowd in database that tracks obtains crowd's parameter information;
Model of place and person model are evacuated according to default evacuation scenario parameters information creating, and person model is imported and is dredged It dissipates in model of place, crowd's initialization is carried out using crowd's parameter information as default evacuation crowd's parameter information;
Crowd after initialization is grouped to and is filtered out the leader of each group, social force model progress is micro- using improving Crowd movement's guidance is seen, path, leader's led cluster are most preferably evacuated using the Sarsa algorithms selections for presetting modifiable decay value Normal pedestrian is according to the movement of best evacuation path in group.
Scheme as a further preference, the real scene database include several real scene video informations, are utilized KLT tracing algorithms extract crowd's parameter information, crowd's parameter information from the video information of the real scene database Including crowd evacuation motion path, initialization coordinate, initial velocity.
Scheme as a further preference screens leading for each group according to crowd and outlet port in the method Person, the leader are the pedestrian near outlet port in each group.
Scheme as a further preference, in the method, the outlet if leader arrives safe and sound is drawn in selection group again Neck person, until optional without pedestrian's individual in each group.
Scheme as a further preference, the improvement social force model have modified the phase on the basis of traditional society's power model It hopes speed, structure is carried out according to self related desired speed function of distance between remaining time, pedestrian and surrounding pedestrian's speed It builds.
Scheme as a further preference, in the method, in each emergency exit of the evacuation model of place of establishment Place's setting counter, crowd's number of individuals for counting each outlet evacuation.
Scheme as a further preference according to crowd's number of individuals of each outlet evacuation, calculates separately in the method The crowding in exit;The crowding is under normal circumstances by the crowd of the predetermined number of outlet and corresponding outlet evacuation The ratio of body number total value, when exporting crowding more than the crowding threshold value, it is believed that congestion has occurred in outlet;
Leader selects most preferably to export as thin according to the crowding of distance and each outlet apart from each outlet port Dissipate target.
Scheme as a further preference, according to the leader current location, direction of action, using Sarsa algorithm meters The income of all and current location unicom next target point is calculated, and calculates average Q value and maximum Q values, is weighted and asks With, the point of Income Maximum is chosen as next evacuation target point, while updating income storage table, target is evacuated until reaching, Complete income storage table is obtained, determines best evacuation path.
Scheme as a further preference is preset and is changed in the utilization Sarsa algorithms selections most preferably evacuate path Decay value, the decay value value range be 0-1, when need Sarsa algorithms single step update when, the decay value, which is arranged, is 0, when wanting the newer dynamics of the everything of Sarsa algorithms consistent, it is 1 that the decay value, which is arranged, and the decay value value is When between 0-1, the size for presetting decay value is directly proportional to the action update intensity closer from target point.
The second object of the present invention is to provide a kind of computer readable storage medium.
To achieve the goals above, the present invention is using a kind of following technical solution:
A kind of computer readable storage medium, wherein being stored with a plurality of instruction, described instruction is suitable for by terminal device equipment Processor load and execute following processing:
Real scene database is received, the video crowd in database that tracks obtains crowd's parameter information;
Model of place and person model are evacuated according to default evacuation scenario parameters information creating, and person model is imported and is dredged It dissipates in model of place, crowd's initialization is carried out using crowd's parameter information as default evacuation crowd's parameter information;
Crowd after initialization is grouped to and is filtered out the leader of each group, social force model progress is micro- using improving Crowd movement's guidance is seen, path, leader's led cluster are most preferably evacuated using the Sarsa algorithms selections for presetting modifiable decay value Normal pedestrian is according to the movement of best evacuation path in group.
The third object of the present invention is to provide a kind of terminal device.
To achieve the goals above, the present invention is using a kind of following technical solution:
A kind of terminal device, including processor and computer readable storage medium, processor is for realizing each instruction;It calculates Machine readable storage medium storing program for executing is suitable for being loaded by processor and executing following processing for storing a plurality of instruction, described instruction:
Real scene database is received, the video crowd in database that tracks obtains crowd's parameter information;
Model of place and person model are evacuated according to default evacuation scenario parameters information creating, and person model is imported and is dredged It dissipates in model of place, crowd's initialization is carried out using crowd's parameter information as default evacuation crowd's parameter information;
Crowd after initialization is grouped to and is filtered out the leader of each group, social force model progress is micro- using improving Crowd movement's guidance is seen, path, leader's led cluster are most preferably evacuated using the Sarsa algorithms selections for presetting modifiable decay value Normal pedestrian is according to the movement of best evacuation path in group.
Beneficial effects of the present invention:
1, a kind of crowd evacuation emulation method and device based on Sarsa algorithms of the present invention, by Sarsa algorithms It is combined with social force model is improved, macroscopical path planning is carried out using Sarsa algorithms, carried out using improvement social force model microcosmic Individual movement instructs, the common crowd evacuation emulation completed under complex scene.
2, a kind of crowd evacuation emulation method and device based on Sarsa algorithms of the present invention, sharp Sarsa (λ) are calculated Method, be arranged a decay value λ, when λ between zero and one, value is bigger, and the action update intensity bigger closer from target point is in this way Just do not have to be limited to that single step is newer can only to update nearest step action every time, all related steps of update that can be more efficiently Action.
3, a kind of crowd evacuation emulation method and device based on Sarsa algorithms of the present invention, are being familiar with scene It is grouped evacuation under the guiding of navigation, the utilization rate in channel and personnel's peace under crisis situations in public place can be effectively improved Quan Xing is conducive to design evacuation prediction scheme, and help is provided for true evacuation rehearsal.
Description of the drawings
The accompanying drawings which form a part of this application are used for providing further understanding of the present application, and the application's shows Meaning property embodiment and its explanation do not constitute the improper restriction to the application for explaining the application.
Fig. 1 is flow chart of the method for the present invention;
Fig. 2 is crowd evacuation schematic diagram in certain classroom of the embodiment of the present invention 2;
Fig. 3 is crowd evacuation schematic diagram in certain school corridor of the embodiment of the present invention 2;
Fig. 4 be the embodiment of the present invention 2 emulation experiment in crowd evacuation initialization schematic diagram;
Fig. 5 be the embodiment of the present invention 2 crowd's grouping after lead individual choice to evacuate target, and to the mobile signal in outlet Figure;
Fig. 6 is that the individual in leading individual to execute Sarsa algorithms, organizing of the embodiment of the present invention 2 is followed and led close to outlet Schematic diagram;
Fig. 7 is the embodiment of the present invention 2 in evacuation finish time schematic diagram.
Specific implementation mode:
Following will be combined with the drawings in the embodiments of the present invention, and technical solution in the embodiment of the present invention carries out clear, complete Site preparation describes, it is clear that described embodiments are only a part of the embodiments of the present invention, instead of all the embodiments.It is based on Embodiment in the present invention, it is obtained by those of ordinary skill in the art without making creative efforts every other Embodiment shall fall within the protection scope of the present invention.
It is noted that following detailed description is all illustrative, it is intended to provide further instruction to the application.Unless another It indicates, all technical and scientific terms that the present embodiment uses have and the application person of an ordinary skill in the technical field Normally understood identical meanings.
It should be noted that term used herein above is merely to describe specific implementation mode, and be not intended to restricted root According to the illustrative embodiments of the application.As used herein, unless the context clearly indicates otherwise, otherwise singulative It is also intended to include plural form, additionally, it should be understood that, when in the present specification using term "comprising" and/or " packet Include " when, indicate existing characteristics, step, operation, device, component and/or combination thereof.
It should be noted that flowcharts and block diagrams in the drawings show according to various embodiments of the present disclosure method and The architecture, function and operation in the cards of system.It should be noted that each box in flowchart or block diagram can represent A part for a part for one module, program segment, or code, the module, program segment, or code may include one or more A executable instruction for realizing the logic function of defined in each embodiment.It should also be noted that some alternately Realization in, the function that is marked in box can also occur according to the sequence different from being marked in attached drawing.For example, two connect The box even indicated can essentially be basically executed in parallel or they can also be executed in a reverse order sometimes, This depends on involved function.It should also be noted that each box in flowchart and or block diagram and flow chart And/or the combination of the box in block diagram, it can be come using the dedicated hardware based system for executing defined functions or operations It realizes, or can make to combine using a combination of dedicated hardware and computer instructions to realize.
In the absence of conflict, the features in the embodiments and the embodiments of the present application can be combined with each other with reference to The invention will be further described with embodiment for attached drawing.
Embodiment 1:
The purpose of the present embodiment 1 is to provide a kind of crowd evacuation emulation method based on Sarsa algorithms.
To achieve the goals above, the present invention is using a kind of following technical solution:
As shown in Figure 1,
A kind of crowd evacuation emulation method based on Sarsa algorithms, this method include:
Step (1):Real scene database is received, the video crowd in database that tracks obtains crowd's parameter information;
Step (2):Model of place and person model are evacuated according to default evacuation scenario parameters information creating, and by personage's mould Type imports in evacuation model of place, and crowd's initialization is carried out using crowd's parameter information as default evacuation crowd's parameter information;
Step (3):Crowd after initialization is grouped to and is filtered out the leader of each group, using improvement social force mould Type carries out microcosmic crowd movement's guidance, most preferably evacuates path using the Sarsa algorithms selections for presetting modifiable decay value, leads Person leads normal pedestrian in group to be moved according to best evacuation path.
The present embodiment the step of in (1), obtains and the video record in region is specified to form real scene database, it is described true Real scene database includes several real scene video informations, utilizes KLT tracing algorithms regarding from the real scene database Crowd's parameter information is extracted in frequency information, crowd's parameter information includes crowd evacuation motion path, initializes coordinate, is initial Speed.
The present embodiment the step of in (2), setting evacuation scenario parameters information creates evacuation model of place and personage's mould Type, and person model is imported in evacuation model of place, environment space of the evacuation model of place as crowd evacuation is described Person model is as evacuation crowd.The semantic information of extraction evacuation model of place, by crowd's parameter information under the evacuation scene Crowd's initialization is carried out as default evacuation crowd parameter information, and according to default evacuation crowd parameter information.
The present embodiment the step of in (2), it is arranged at each emergency exit of the evacuation model of place of establishment and counts Device, crowd's number of individuals for counting each outlet evacuation.According to crowd's number of individuals of each outlet evacuation, outlet is calculated separately The crowding at place;The crowding is under normal circumstances by crowd's number of individuals of the predetermined number of outlet and corresponding outlet evacuation The ratio of total value, when exporting crowding more than the crowding threshold value, it is believed that congestion has occurred in outlet;Leader is according to distance The crowding of the distance of each outlet port and each outlet selects most preferably to export as evacuation target.
The present embodiment the step of in (3), evacuation crowd is grouped, one is filtered out in each group and is led Person, the leader of each group is screened according to crowd and outlet port, and the leader is in each group near outlet position The pedestrian set.Microcosmic crowd movement's guidance is carried out using social force model is improved, using Sarsa algorithms in the crowd evacuation Best evacuation path, leader is selected to lead the interior normal pedestrian of group according to the movement of best evacuation path in path;Preserve each group most Good evacuation path carries out crowd evacuation emulation as the recommendation paths of evacuation rehearsal.
The present embodiment the step of in (3), normal pedestrian follows movement according to improved social force model in group.This The improvement social force model used in invention is that outlet attraction and friend's attraction are introduced on the basis of primitive society's power model Power respectively describes outlet to the sucking action of pedestrian and the phenomenon of gathering in groups band together of pedestrian.Considering relative velocity to the social heart On the basis of managing power influence, desired speed is had modified, it is proposed that distance and surrounding pedestrian's speed between remaining time, pedestrian Self related desired speed function is spent, improvement social force model is resulted in, which can be used for more truly mould Large-scale crowd behavior in quasi- accident.
Improving social force model formula is:
The final desired speed of pedestrian is combined by self desired speed and surrounding pedestrian's average speed, pedestrian self Desired speed is exactly that pedestrian oneself wants with great speed, and direction here is not necessarily the direction for being directed toward outlet, is merely representative of Self desired route of pedestrian, improved desired speed formula:
Improved self drive is made of the quality of pedestrian, final desired speed, current velocity Huo existing speed and reaction time, from Driving force illustrates that the pedestrian it is expected to move to export direction with what kind of speed, and final desired speed here is that have with direction The speed of size, improved self drive model formation:
Improved interpersonal active force represents pedestrian j to the active force of pedestrian i, consists of two parts:The heart Manage power and physical force, when i and j not in contact with when, indicate to only exist psychological forces between two pedestrians at this time;When i and j are contacted, this is indicated When two pedestrians between there is only psychological forces, there is also physical force, improved interpersonal active force formula is:
The interaction force of pedestrian and barrier represents active force of the barrier to pedestrian i, also by psychological forces and physics Power forms, when i and barrier not in contact with when, indicate that barrier only exists psychological force effect to pedestrian at this time;When i and barrier connect When touching, public affairs indicate that there is only psychological forces effects to pedestrian for barrier at this time, there is also physics force effect, pedestrian and barrier Interaction force formula is:
fiw=[1+cg (- vi·niw)]Ai exp[(ri-diw)/Bi]niw+kg(rij-diw)niw-kg(ri-diw)(vi· tiw)tiw
In initial social force model, the interpersonal active force description of all pedestrians is the same, and in reality This is false, and friend and stranger can be distinguished by introducing friend's attraction, can be simulated realistically and be tied in groups Team's phenomenon, the formula that outlet attraction is sent out with friend's attraction are:
fis=Ci exp[(ri-dis)/Di]nis
fiq=Eexp [(riq-diq)/Fi]niq
The present embodiment the step of in (3), according to the leader current location, direction of action, using Sarsa algorithms The income of all and current location unicom next target point is calculated, and calculates average Q value and maximum Q values, is weighted and asks With, the point of Income Maximum is chosen as next evacuation target point, while updating income storage table, target is evacuated until reaching, Complete income storage table is obtained, determines best evacuation path.
Sarsa algorithms are a kind of important intensified learning methods, it equally uses trial-and-error method, need not establish environment and appoint The precise information of business describes, by learning agent can from system mode, action, reward useful information in grasp it is a set of Optimisation strategy and knowledge, the estimation of Sarsa algorithms is action value function, that is to say, that estimation is free position s under tactful π The action value function Q of all executable action aπ(s, a) for the state S of each nonterminalt, reach next state St+1 Afterwards, formula update Q (S be may be byt, at), and if StIt is final state, then enables Q (St+1=0, at+1), algorithm is final Obtain stateful-action pair Q functions, and optimal policy π is exported according to Q functions.Sarsa is introduced into crowd evacuation emulation System is advantageous in that:In face of specific environment, after training study, pedestrian can directly obtain from Sarsa algorithms Experience in find a best evacuation path and accelerate crowd evacuation so as to avoid computing repeatedly.
The interaction of row human and environment can regard the process of a sampling four-tuple as, i.e.,:
<st, at, r, st+1,at+1>
Wherein, stExpression state, atExpression acts, and r indicates return, st+1Indicate next state, at+1It indicates next The action taken when state.
The value function of Sarsa algorithms more new formula:
Q(st,at)←Q(st,at)+α[rt+1+λQ(st+1,at+1)-Q(st,at)]
In the present embodiment, using Sarsa algorithms come update evacuation during each step information, state stIt is exactly leader Current position, i.e. the distance between leader and target point;Then atBe exactly leader from current location to next position Transition, i.e., leader action direction;From starting point stState starts, and updates each state-action with Sarsa algorithms Value function Q, strategy use ε-greedy methods, until reaching aiming spot.
Using Sarsa algorithms, the value in Q table is constantly updated, then according to new value come judge will be at some What kind of action state takes, from being currently at stStart, has just calculated current at, and the s of next stept+1With at+1Also it is calculated, such iteration, until reaching target point, finally obtains complete value function Q tables, the path passed by As optimal path.
The present embodiment the step of in (3), most preferably evacuate path using Sarsa algorithms selections described, preset and can be changed Decay value more, the decay value value range are 0-1, and when needing the single step of Sarsa algorithms to update, the decay value is arranged It is 0, when wanting the newer dynamics of the everything of Sarsa algorithms consistent, it is 1 that the decay value, which is arranged, the decay value value When between 0-1, the size for presetting decay value is directly proportional to the action update intensity closer from target point.
Using Sarsa (λ) algorithm, a decay value λ is set, chooses a number between 0 and 1, when λ is equal to 0, The single step update for having reformed into Sarsa algorithms can only update nearest step action;When λ be equal to 1, reformed into bout update, Is just as to the newer dynamics of all steps action and works as λ between zero and one, value is bigger, the action closer from target point update The bigger .Sarsa of dynamics (λ) can update the preceding λ steps for getting reward, this sample embodiment does not just have to be limited to single step update Can only update nearest step action every time, all related steps of the update that the present embodiment can be more efficiently act.
The present embodiment the step of in (3), the outlet if leader arrives safe and sound, leader in selection group again, until It is optional without pedestrian's individual in each group.
Embodiment 2:
The purpose of the present embodiment 2 is to carry out experimental verification to method using the present embodiment.
In the present embodiment, the running environment of system be using Visual Studio 2012+OSG as developing instrument, It is carried out under Windows7 operating system environments, realizes the crowd evacuation emulation under complex scene.Wait for that evacuation individual exists by 350 Crowd evacuation emulation simulation is carried out on the teaching region of 300*150, as shown in Figure 4-Figure 7.
Fig. 2 is crowd evacuation video recording sectional drawing in certain classroom;Fig. 3 is crowd evacuation video recording sectional drawing in certain school corridor; Fig. 4 is the initialization schematic diagram of crowd evacuation in emulation experiment;Fig. 5 be crowd grouping after lead individual choice evacuate target, and The schematic diagram mobile to outlet;Fig. 6 is that individual is being led to execute Sarsa algorithms, and the interior individual of group, which follows, leads showing close to outlet It is intended to;Fig. 7 is in evacuation finish time schematic diagram.
Embodiment 3:
The purpose of the present embodiment 3 is to provide a kind of computer readable storage medium.
To achieve the goals above, the present invention is using a kind of following technical solution:
A kind of computer readable storage medium, wherein being stored with a plurality of instruction, described instruction is suitable for by terminal device equipment Processor load and execute following processing:
Step (1):Real scene database is received, the video crowd in database that tracks obtains crowd's parameter information;
Step (2):Model of place and person model are evacuated according to default evacuation scenario parameters information creating, and by personage's mould Type imports in evacuation model of place, and crowd's initialization is carried out using crowd's parameter information as default evacuation crowd's parameter information;
Step (3):Crowd after initialization is grouped to and is filtered out the leader of each group, using improvement social force mould Type carries out microcosmic crowd movement's guidance, most preferably evacuates path using the Sarsa algorithms selections for presetting modifiable decay value, leads Person leads normal pedestrian in group to be moved according to best evacuation path.
Embodiment 4:
The purpose of the present embodiment 4 is to provide a kind of terminal device.
To achieve the goals above, the present invention is using a kind of following technical solution:
A kind of terminal device, including processor and computer readable storage medium, processor is for realizing each instruction;It calculates Machine readable storage medium storing program for executing is suitable for being loaded by processor and executing following processing for storing a plurality of instruction, described instruction:
Step (1):Real scene database is received, the video crowd in database that tracks obtains crowd's parameter information;
Step (2):Model of place and person model are evacuated according to default evacuation scenario parameters information creating, and by personage's mould Type imports in evacuation model of place, and crowd's initialization is carried out using crowd's parameter information as default evacuation crowd's parameter information;
Step (3):Crowd after initialization is grouped to and is filtered out the leader of each group, using improvement social force mould Type carries out microcosmic crowd movement's guidance, most preferably evacuates path using the Sarsa algorithms selections for presetting modifiable decay value, leads Person leads normal pedestrian in group to be moved according to best evacuation path.
These computer executable instructions make the equipment execute according to each reality in the disclosure when running in a device Apply method or process described in example.
In the present embodiment, computer program product may include computer readable storage medium, containing for holding The computer-readable program instructions of row various aspects of the disclosure.Computer readable storage medium can be kept and store By the tangible device for the instruction that instruction execution equipment uses.Computer readable storage medium for example can be-- but it is unlimited In-- storage device electric, magnetic storage apparatus, light storage device, electromagnetism storage device, semiconductor memory apparatus or above-mentioned Any appropriate combination.The more specific example (non exhaustive list) of computer readable storage medium includes:Portable computing Machine disk, hard disk, random access memory (RAM), read-only memory (ROM), erasable programmable read only memory (EPROM or Flash memory), static RAM (SRAM), Portable compressed disk read-only memory (CD-ROM), digital versatile disc (DVD), memory stick, floppy disk, mechanical coding equipment, the punch card for being for example stored thereon with instruction or groove internal projection structure, with And above-mentioned any appropriate combination.Computer readable storage medium used herein above is not interpreted instantaneous signal itself, The electromagnetic wave of such as radio wave or other Free propagations, the electromagnetic wave propagated by waveguide or other transmission mediums (for example, Pass through the light pulse of fiber optic cables) or pass through electric wire transmit electric signal.
Computer-readable program instructions described herein can be downloaded to from computer readable storage medium it is each calculate/ Processing equipment, or outer computer or outer is downloaded to by network, such as internet, LAN, wide area network and/or wireless network Portion's storage device.Network may include copper transmission cable, optical fiber transmission, wireless transmission, router, fire wall, interchanger, gateway Computer and/or Edge Server.Adapter or network interface in each calculating/processing equipment are received from network to be counted Calculation machine readable program instructions, and the computer-readable program instructions are forwarded, for the meter being stored in each calculating/processing equipment In calculation machine readable storage medium storing program for executing.
Computer program instructions for executing present disclosure operation can be assembly instruction, instruction set architecture (ISA) Instruction, machine instruction, machine-dependent instructions, microcode, firmware instructions, condition setup data or with one or more programmings Language arbitrarily combines the source code or object code write, the programming language include the programming language-of object-oriented such as C++ etc., and conventional procedural programming languages-such as " C " language or similar programming language.Computer-readable program refers to Order can be executed fully, partly be executed on the user computer, as an independent software package on the user computer Execute, part on the user computer part on the remote computer execute or completely on a remote computer or server It executes.In situations involving remote computers, remote computer can include LAN by the network-of any kind (LAN) or wide area network (WAN)-is connected to subscriber computer, or, it may be connected to outer computer (such as utilize internet Service provider is connected by internet).In some embodiments, believe by using the state of computer-readable program instructions Breath comes personalized customization electronic circuit, such as programmable logic circuit, field programmable gate array (FPGA) or programmable logic Array (PLA), the electronic circuit can execute computer-readable program instructions, to realize the various aspects of present disclosure.
It should be noted that although being referred to several modules or submodule of equipment in the detailed description above, it is this Division is merely exemplary rather than enforceable.In fact, in accordance with an embodiment of the present disclosure, two or more above-described moulds The feature and function of block can embody in a module.Conversely, the feature and function of an above-described module can be with It is further divided into and is embodied by multiple modules.
Beneficial effects of the present invention:
1, a kind of crowd evacuation emulation method and device based on Sarsa algorithms of the present invention, by Sarsa algorithms It is combined with social force model is improved, macroscopical path planning is carried out using Sarsa algorithms, carried out using improvement social force model microcosmic Individual movement instructs, the common crowd evacuation emulation completed under complex scene.
2, a kind of crowd evacuation emulation method and device based on Sarsa algorithms of the present invention, sharp Sarsa (λ) are calculated Method, be arranged a decay value λ, when λ between zero and one, value is bigger, and the action update intensity bigger closer from target point is in this way Just do not have to be limited to that single step is newer can only to update nearest step action every time, all related steps of update that can be more efficiently Action.
3, a kind of crowd evacuation emulation method and device based on Sarsa algorithms of the present invention, are being familiar with scene It is grouped evacuation under the guiding of navigation, the utilization rate in channel and personnel's peace under crisis situations in public place can be effectively improved Quan Xing is conducive to design evacuation prediction scheme, and help is provided for true evacuation rehearsal.
The foregoing is merely the preferred embodiments of the application, are not intended to limit this application, for the skill of this field For art personnel, the application can have various modifications and variations.Within the spirit and principles of this application, any made by repair Change, equivalent replacement, improvement etc., should be included within the protection domain of the application.Therefore, the present invention is not intended to be limited to this These embodiments shown in text, and it is to fit to widest range consistent with the principles and novel features disclosed in this article.

Claims (10)

1. a kind of crowd evacuation emulation method based on Sarsa algorithms, which is characterized in that this method includes:
Real scene database is received, the video crowd in database that tracks obtains crowd's parameter information;
Model of place and person model are evacuated according to default evacuation scenario parameters information creating, and person model is imported into evacuation field In scape model, crowd's initialization is carried out using crowd's parameter information as default evacuation crowd's parameter information;
Crowd after initialization is grouped to and is filtered out the leader of each group, microcosmic people is carried out using social force model is improved Group's exercise guidance most preferably evacuates path using the Sarsa algorithms selections for presetting modifiable decay value, and leader leads in group Normal pedestrian is according to the movement of best evacuation path.
2. the method as described in claim 1, which is characterized in that the real scene database includes several real scene videos Information extracts crowd's parameter information, the crowd using KLT tracing algorithms from the video information of the real scene database Parameter information includes crowd evacuation motion path, initialization coordinate, initial velocity.
3. the method as described in claim 1, which is characterized in that in the method, screened according to crowd and outlet port each The leader of group, the leader are the pedestrian near outlet port in each group.
If the outlet or, leader arrives safe and sound, leader in selection group again, until optional without pedestrian's individual in each group.
4. the method as described in claim 1, which is characterized in that the improvement social force model is on traditional society's power model basis On have modified desired speed, according to distance between remaining time, pedestrian and self related desired speed of surrounding pedestrian's speed Function is built.
5. the method as described in claim 1, which is characterized in that in the method, each of model of place is evacuated in establishment Counter is set at a emergency exit, crowd's number of individuals for counting each outlet evacuation.
6. method as claimed in claim 5, which is characterized in that in the method, according to crowd's individual of each outlet evacuation Number, calculates separately the crowding in exit;The crowding is under normal circumstances by the predetermined number of outlet and corresponding outlet The ratio of crowd's number of individuals total value of evacuation, when exporting crowding more than the crowding threshold value, it is believed that gathered around outlet It is stifled;
Leader selects most preferably to export as evacuation mesh according to the crowding of distance and each outlet apart from each outlet port Mark.
7. the method as described in claim 1, which is characterized in that according to the leader current location, direction of action, use Sarsa algorithms calculate the income of all and current location unicom next target point, and calculate average Q value and maximum Q values, into Row weighted sum chooses the point of Income Maximum as next evacuation target point, while updating income storage table, is dredged until reaching Target is dissipated, complete income storage table is obtained, determines best evacuation path.
8. the method as described in claim 1, which is characterized in that in the utilization Sarsa algorithms selections most preferably evacuate path, Modifiable decay value is preset, the decay value value range is 0-1, and when needing the single step of Sarsa algorithms to update, institute is arranged It is 0 to state decay value, and when wanting the newer dynamics of the everything of Sarsa algorithms consistent, it is 1 that the decay value, which is arranged, described to decline When variate value is between 0-1, the size for presetting decay value is directly proportional to the action update intensity closer from target point.
9. a kind of computer readable storage medium, wherein being stored with a plurality of instruction, which is characterized in that described instruction is suitable for by terminal The processor of equipment equipment loads and executes the method according to any one of claim 1-8.
10. a kind of terminal device, including processor and computer readable storage medium, processor is for realizing each instruction;It calculates Machine readable storage medium storing program for executing is for storing a plurality of instruction, which is characterized in that described instruction is appointed for executing according in claim 1-8 Method described in one.
CN201810233963.4A 2018-03-21 2018-03-21 A kind of crowd evacuation emulation method and device based on Sarsa algorithms Pending CN108491972A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201810233963.4A CN108491972A (en) 2018-03-21 2018-03-21 A kind of crowd evacuation emulation method and device based on Sarsa algorithms

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201810233963.4A CN108491972A (en) 2018-03-21 2018-03-21 A kind of crowd evacuation emulation method and device based on Sarsa algorithms

Publications (1)

Publication Number Publication Date
CN108491972A true CN108491972A (en) 2018-09-04

Family

ID=63318893

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201810233963.4A Pending CN108491972A (en) 2018-03-21 2018-03-21 A kind of crowd evacuation emulation method and device based on Sarsa algorithms

Country Status (1)

Country Link
CN (1) CN108491972A (en)

Cited By (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN109543285A (en) * 2018-11-20 2019-03-29 山东师范大学 A kind of crowd evacuation emulation method and system of fused data driving and intensified learning
CN109670270A (en) * 2019-01-11 2019-04-23 山东师范大学 Crowd evacuation emulation method and system based on the study of multiple agent deeply
CN113536613A (en) * 2021-09-17 2021-10-22 深圳市城市交通规划设计研究中心股份有限公司 Crowd evacuation simulation method and device, terminal equipment and storage medium
CN113642978A (en) * 2021-06-30 2021-11-12 山东师范大学 Crowd evacuation method and system based on crowd sensing trust management mechanism

Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN104658297A (en) * 2015-02-04 2015-05-27 沈阳理工大学 Central type dynamic path inducing method based on Sarsa learning
CN105740514A (en) * 2016-01-22 2016-07-06 山东师范大学 Computer simulation system for large-size crowd evacuation and method therefor
CN107292064A (en) * 2017-08-09 2017-10-24 山东师范大学 A kind of crowd evacuation emulation method and system based on many ant colony algorithms
CN107464021A (en) * 2017-08-07 2017-12-12 山东师范大学 A kind of crowd evacuation emulation method based on intensified learning, device

Patent Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN104658297A (en) * 2015-02-04 2015-05-27 沈阳理工大学 Central type dynamic path inducing method based on Sarsa learning
CN105740514A (en) * 2016-01-22 2016-07-06 山东师范大学 Computer simulation system for large-size crowd evacuation and method therefor
CN107464021A (en) * 2017-08-07 2017-12-12 山东师范大学 A kind of crowd evacuation emulation method based on intensified learning, device
CN107292064A (en) * 2017-08-09 2017-10-24 山东师范大学 A kind of crowd evacuation emulation method and system based on many ant colony algorithms

Non-Patent Citations (3)

* Cited by examiner, † Cited by third party
Title
JACK GUEST,ETC.: "Visual Analysis of Situationally Aware Building Evacuations", 《VISUALIZATION AND DATA ANALYSIS 2013》 *
TRAN XUAN SANG,ETC.: "Path Finding Algorithms for Autonomous Robots Based on Reinforcement Learning", 《INTERNATIONAL JOURNAL OF ADVANCED RESEARCH IN COMPUTER ENGINEERING & TECHNOLOGY》 *
汪蕾: "社会力模型的改进研究", 《南京理工大学学报(自然科学版)》 *

Cited By (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN109543285A (en) * 2018-11-20 2019-03-29 山东师范大学 A kind of crowd evacuation emulation method and system of fused data driving and intensified learning
CN109670270A (en) * 2019-01-11 2019-04-23 山东师范大学 Crowd evacuation emulation method and system based on the study of multiple agent deeply
CN113642978A (en) * 2021-06-30 2021-11-12 山东师范大学 Crowd evacuation method and system based on crowd sensing trust management mechanism
CN113536613A (en) * 2021-09-17 2021-10-22 深圳市城市交通规划设计研究中心股份有限公司 Crowd evacuation simulation method and device, terminal equipment and storage medium

Similar Documents

Publication Publication Date Title
CN108491972A (en) A kind of crowd evacuation emulation method and device based on Sarsa algorithms
WO2021073046A1 (en) Parallel smart emergency collaboration method and system, and electronic device
CN109670270A (en) Crowd evacuation emulation method and system based on the study of multiple agent deeply
CN108491598B (en) Crowd evacuation simulation method and system based on path planning
CN110399983A (en) Shape similarity analysis
CN109101694B (en) A kind of the crowd behaviour emulation mode and system of the guidance of safe escape mark
CN107480320B (en) Crowd evacuation simulation method and system based on topological map and visual influence
Wong et al. Optimized evacuation route based on crowd simulation
CN107403049B (en) A kind of Q-Learning pedestrian&#39;s evacuation emulation method and system based on artificial neural network
CN110334245A (en) A kind of short video recommendation method and device of the figure neural network based on Temporal Order
CN110415521A (en) Prediction technique, device and the computer readable storage medium of traffic data
CN108446469B (en) Video-driven group behavior evacuation simulation method and device
CN101216951A (en) Intelligent group motion simulation method in virtual scenes
Liu et al. A perception‐based emotion contagion model in crowd emergent evacuation simulation
CN107480821A (en) The multi-Agent cooperation crowd evacuation emulation method and device of instance-based learning
Sun et al. Crowd evacuation simulation method combining the density field and social force model
KR102284862B1 (en) Method for providing video content for programming education
WO2021138761A1 (en) Task execution method and apparatus for virtual avatar, and terminal device
Barnett et al. Coordinated crowd simulation with topological scene analysis
CN109584667A (en) A kind of subway large passenger flow rehearsal simulation training system and method
Karbovskii et al. Multimodel agent-based simulation environment for mass-gatherings and pedestrian dynamics
WO2021102615A1 (en) Virtual reality scene and interaction method therefor, and terminal device
KR20140137068A (en) Evacuation simulation system and providing method thereof
US20110161060A1 (en) Optimization-Based exact formulation and solution of crowd simulation in virtual worlds
Zhang et al. Knowledge-based crowd motion for the unfamiliar environment

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
RJ01 Rejection of invention patent application after publication

Application publication date: 20180904

RJ01 Rejection of invention patent application after publication