CN107038482A

CN107038482A - Applied to AI algorithm engineerings, the Distributed Architecture of systematization

Info

Publication number: CN107038482A
Application number: CN201710264446.9A
Authority: CN
Inventors: 刘聪
Original assignee: Shanghai Jilian Network Technology Co Ltd
Current assignee: Shanghai Jilian Network Technology Co Ltd
Priority date: 2017-04-21
Filing date: 2017-04-21
Publication date: 2017-08-11

Abstract

The invention discloses a kind of applied to AI algorithm engineerings, the Distributed Architecture of systematization, whole Distributed Architecture is divided into task queue and dispatch service, video cutting service, Algorithm Analysis service, four major parts of data center；What the present invention was provided is applied to AI algorithm engineerings, the Distributed Architecture of systematization, the configuration requirement of single machine can be reduced, and the computing capability of each machine can be effectively utilized by the system of dispatching, the analysis of single video can not only be accelerated, the analysis task for high-volume video can also reply extending transversely at any time.

Description

Applied to AI algorithm engineerings, the Distributed Architecture of systematization

Technical field

The invention provides a kind of distributed structure/architecture, to accelerate deep learning video analysis, and computer is effectively improved Cpu, gpu utilization rate, specifically applied to AI algorithm engineerings, the Distributed Architecture of systematization.

Background technology

What the present invention was provided is applied to AI algorithm engineerings, the Distributed Architecture of systematization, to accelerate deep learning to regard Frequency analysis, and effectively improve computer cpu, gpu utilization rate.In general, analysis knowledge is carried out to video using deep learning All it is not the offline task for expending very much time and hardware resource；The hardware capabilities of single machine can not infinite expanding it is big to meet Batch video analysis, can not also accomplish High Availabitity.For such situation, it is necessary to take distributed framework to carry out expanding machinery Ability, increase fault-tolerance accomplishes High Availabitity simultaneously.After Distributed Architecture, the configuration requirement of single machine can be reduced, And the computing capability of each machine can be effectively utilized by the system of dispatching, the analysis of single video can not only be added Speed, the analysis task for high-volume video can also reply extending transversely at any time.

The content of the invention

The present invention for solve the technical scheme that uses of above-mentioned technical problem be to provide it is a kind of applied to AI algorithm engineerings, The Distributed Architecture of systematization, wherein, concrete technical scheme is：

Whole Distributed Architecture is divided into task queue and dispatch service, video cutting service, Algorithm Analysis service, data Four major parts of the heart；

1) task queue and scheduling system, can enter task queue and queue up, wait first after video file content input Scheduler program processing；

2) video cutting service, when there is arithmetic server idle, video file is taken out from queue, but can't horse On be put on arithmetic server perform analysis, it is necessary to first pass through video cutting service processing video file；Add this service Analysis mainly to single video is raised speed；

3) Algorithm Analysis service, is the final server for performing deep learning algorithm here, simultaneously for the gpu on machine Scheduling with cpu is also realized herein, multiple gpu cores and cpu cores would generally be configured on separate unit physical machine, specific to hardware The use of resource is completed before algorithm is called by being deployed on machine dispatching algorithm, while being supervised for algorithm performs process Control shows that the result that last algorithm is performed can feed back to result following data by adjusting back by feedback of status is performed to front end Center；

4) data center, is collected the analysis result of video in data center, is merged into the knot in same video Really, stored after the data for being processed into structuring in database, facilitate later retrieval.

Above-mentioned is applied to AI algorithm engineerings, the Distributed Architecture of systematization, wherein：Upper EDS maps are realized for convenience Formula mode is, it is necessary to more convenient Deployment Algorithm server；Because one arithmetic server of increase needs to do many configurations, and compile Many extra third party libraries are translated, if all doing these repetitive works in extended theorem device or increase algorithm every time, Undoubtedly lose time.So the deployment of whole system all can be by the way of now popular docker, by inciting somebody to action Either all to make docker packaged for algorithm or programs of other each service, is only needed to docker when needing Deployment can just start service, greatly simplify work and reduce deployment time, be truly realized dynamic dilatation.

Above-mentioned is applied to AI algorithm engineerings, the Distributed Architecture of systematization, wherein：1) in step, queue system The instrument realized using rabbitmq or other offer queue functions is realized, by the way of queue, primarily to convenient When running into multitude of video input, it is ensured that other all services of rear end can not be hit；Pass through the backstage for the system of dispatching simultaneously Current task amount and state can be clearly observed, to help operation maintenance personnel expanding machinery to tackle analysis task；Other team Row system also to decouple well between each service well, it is ensured that respective to develop and run unaffected.

Above-mentioned is applied to AI algorithm engineerings, the Distributed Architecture of systematization, wherein：2) it is one big to regard in step Frequency file generally requires long processing time, but if being divided into multiple small videos, is distributed to many arithmetic servers Analysis is performed simultaneously, the analysis time of single video will be greatly reduced, the multiple of specific reduction depends on the number of arithmetic server Amount, in order to ensure the continuity of ultimate analysis result, takes appropriate strategy (such as adjacent time-slices is overlapping several in cutting The mode of second)；Video cutting can use c++ to encapsulate ffmpeg to do；Finally again by the analysis result of small documents by returning Storage server is returned to be handled.Mapreduce of this thought in hadoop has had good practice, equally Suitable for analysis scene here；

Above-mentioned is applied to AI algorithm engineerings, the Distributed Architecture of systematization, wherein：3) in step, various depth Practising algorithm is realized based on c++ in itself, and in order to ensure the independence of these algorithm routines, (other program generations can be used with python For) scheduler program of an algorithm is realized on arithmetic server, major function is to realize the consumer of task queue, according to hard The use state of part calls different Algorithm Analysis service and result is fed back into data center.

Above-mentioned is applied to AI algorithm engineerings, the Distributed Architecture of systematization, wherein：4) in step, data center makes Realize that one group of api is supplied to external call with nginx, nodejs, the storages of main processing video structural data, retrieval and Analysis.It can simultaneously be put into this kind of search systems of elasticsearch if desired, make a video frequency search system. Storage service finally can also will analyze the state completed, feed back to front end system, notify customer analysis to complete.So far whole analysis Task is completed.

The present invention has the advantages that relative to prior art：The configuration requirement of single machine can be reduced, and The computing capability of each machine can be effectively utilized by the system of dispatching, the analysis of single video can not only be accelerated, Analysis task for high-volume video can also reply extending transversely at any time.

Brief description of the drawings

Fig. 1 is the system framework figure applied to AI algorithm engineerings, the Distributed Architecture of systematization.

Fig. 2 is the module call flow chart applied to AI algorithm engineerings, the Distributed Architecture of systematization.

Embodiment

Whole Distributed Architecture is divided into task queue and dispatch service, video cutting service, Algorithm Analysis service, data Four major parts of the heart and the docker encapsulation that algorithm does distributed deployment and made for convenience.Below for the work of each service With being described, finally the circulation of whole task can be clearly visible that by system framework figure and flow chart.

1. task queue and scheduling system.

Task queue can be entered after video file content input first to queue up, scheduler program processing is waited.Queue system Realize can using rabbitmq or other provide queue functions realize instrument.By the way of queue, mainly it is Convenience is when running into multitude of video input, it is ensured that other all services of rear end can not be hit；Pass through the system of dispatching simultaneously Backstage can clearly observe current task amount and state, to help operation maintenance personnel expanding machinery to tackle analysis task； Other queue system also to decouple well between each service well, it is ensured that respective to develop and run unaffected.

2. video cutting service.

When there is arithmetic server idle, video file is taken out from queue, but can't be put into algorithm service at once Analysis is performed on device, it is necessary to which first passing through video cuts service processing into the thinner video file of granularity.Add this service master If the analysis to single video is raised speed.One big video file generally requires long processing time, but such as Fruit is divided into multiple small videos, is distributed to many arithmetic servers while performing analysis, will greatly reduce the analysis of single video Time (multiple of specific reduction depends on the quantity of arithmetic server)., can be with order to ensure the continuity of ultimate analysis result The strategy (mode for the time that such as overlaps) compared is taken in cutting；Finally the analysis result of small documents is passed through again Return to fall to return to storage server and handled.Mapreduce of this thought in hadoop has had good practice, together Sample is also applied for analysis scene here.

3. Algorithm Analysis service.

Here it is the final server for performing deep learning algorithm.Scheduling simultaneously for gpu and cpu on machine also exists Here realize.Multiple gpu cores and cpu cores would generally be configured on separate unit physical machine, specific to the use of these hardware resources Completed before algorithm is called by being deployed on these machines dispatching algorithm, while can also be supervised for algorithm performs process Control is shown feedback of status is performed to front end.The result that last algorithm is performed can feed back to result following data by adjusting back Center.

4. data center.

The analysis result of video is collected in data center, the result in same video is merged into, knot is processed into Stored after the data of structure in database, facilitate later retrieval, elasticsearch can be put into simultaneously if desired In this kind of search system, a video frequency search system is made.Storage service finally can also will analyze the state completed, feed back to Front end system, notifies customer analysis to complete.

So far whole analysis task is completed.

5. distributed way above is realized for convenience, it is necessary to more convenient Deployment Algorithm server；Due to increase by one Arithmetic server needs to do many configurations, and compiles many extra third party libraries, if every time in extended theorem device or These repetitive works are all done during increase algorithm, are undoubtedly lost time.So the deployment of whole system all can be using existing In popular docker mode, by the way that either algorithm or other each programs serviced are all made into docker envelopes Install, only need to docker deployment can just start service when needing, greatly simplify work and reduce and dispose Time, it is truly realized dynamic dilatation.

Although the present invention is disclosed as above with preferred embodiment, so it is not limited to the present invention, any this area skill Art personnel, without departing from the spirit and scope of the present invention, when a little modification can be made and perfect, therefore the protection model of the present invention Enclose when by being defined that claims are defined.

Claims

1. a kind of be applied to AI algorithm engineerings, the Distributed Architecture of systematization, it is characterised in that：Whole Distributed Architecture is divided into Task queue and dispatch service, video cutting service, Algorithm Analysis service, four major parts of data center；

1) task queue and scheduling system, can enter task queue and queue up, wait scheduling first after video file content input Program processing；

2) video cutting service, when there is arithmetic server idle, video file is taken out from queue, but can't be put at once Analysis is performed on to arithmetic server, it is necessary to first pass through video cutting service processing video file；Adding this service is pair The analysis of single video is raised speed；

3) Algorithm Analysis service, is the final server for performing deep learning algorithm here, simultaneously for the gpu on machine and Cpu scheduling is also realized herein, and multiple gpu cores and cpu cores would generally be configured on separate unit physical machine, is provided specific to hardware The use in source is completed before algorithm is called by being deployed on machine dispatching algorithm, while being monitored for algorithm performs process Feedback of status will be performed and show that the result that last algorithm is performed can feed back to result in following data by adjusting back to front end The heart；

4) data center, is collected the analysis result of video in data center, is merged into the result in same video, place Stored after the data for managing into structuring in database, facilitate later retrieval.

2. AI algorithm engineerings, the Distributed Architecture of systematization are applied to as claimed in claim 1, it is characterised in that：In order to Conveniently realize 1) -4) distributed way is, it is necessary to Deployment Algorithm server；The deployment of whole system is led to by the way of docker Cross that algorithm routine all made to docker is packaged, docker deployment is started into service when needing.

3. AI algorithm engineerings, the Distributed Architecture of systematization are applied to as claimed in claim 2, it is characterised in that：1) step In rapid, the realization of queue system uses rabbitmq instruments, when conveniently multitude of video input is being run into by the way of queue, protects The service of card rear end can not be hit；Current task amount and state are observed by the backstage for the system of dispatching simultaneously, to help Operation maintenance personnel expanding machinery tackles analysis task；Other queue system causes each to be decoupled well between servicing, it is ensured that each Exploitation and run it is unaffected.

4. AI algorithm engineerings, the Distributed Architecture of systematization are applied to as claimed in claim 3, it is characterised in that：2) step In rapid, video file is divided into multiple videos, many arithmetic servers are distributed to while performing analysis, reduce point of single video Analysis time, the multiple of specific reduction depends on the quantity of arithmetic server, in order to ensure the continuity of ultimate analysis result, is cutting The strategy compared is taken when cutting, with the mode for the time of partly overlapping；Finally again the analysis result of small documents is fallen to return by returning Handled to storage server.

5. AI algorithm engineerings, the Distributed Architecture of systematization are applied to as claimed in claim 4, it is characterised in that：4) step In rapid, database is put into this kind of search systems of elasticsearch simultaneously, makes a video frequency search system, storage clothes The state that business ultimate analysis is completed, feeds back to front end system, notifies customer analysis to complete, and so far whole analysis task is completed.