CN110489749A - Intelligent Office-Automation System Work Flow Optimizing - Google Patents

Intelligent Office-Automation System Work Flow Optimizing Download PDF

Info

Publication number
CN110489749A
CN110489749A CN201910723500.0A CN201910723500A CN110489749A CN 110489749 A CN110489749 A CN 110489749A CN 201910723500 A CN201910723500 A CN 201910723500A CN 110489749 A CN110489749 A CN 110489749A
Authority
CN
China
Prior art keywords
term vector
official document
document
layer
word
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CN201910723500.0A
Other languages
Chinese (zh)
Other versions
CN110489749B (en
Inventor
于劲松
刘犇
武耀
代京
唐荻音
刘浩
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Beihang University
Beijing University of Aeronautics and Astronautics
Original Assignee
Beijing University of Aeronautics and Astronautics
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Beijing University of Aeronautics and Astronautics filed Critical Beijing University of Aeronautics and Astronautics
Priority to CN201910723500.0A priority Critical patent/CN110489749B/en
Publication of CN110489749A publication Critical patent/CN110489749A/en
Application granted granted Critical
Publication of CN110489749B publication Critical patent/CN110489749B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/04Architecture, e.g. interconnection topology
    • G06N3/045Combinations of networks
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06QINFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
    • G06Q10/00Administration; Management
    • G06Q10/10Office automation; Time management
    • G06Q10/103Workflow collaboration or project management

Landscapes

  • Engineering & Computer Science (AREA)
  • Business, Economics & Management (AREA)
  • Theoretical Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • Strategic Management (AREA)
  • Human Resources & Organizations (AREA)
  • Entrepreneurship & Innovation (AREA)
  • General Physics & Mathematics (AREA)
  • Data Mining & Analysis (AREA)
  • Biophysics (AREA)
  • Biomedical Technology (AREA)
  • Molecular Biology (AREA)
  • Computing Systems (AREA)
  • General Engineering & Computer Science (AREA)
  • Evolutionary Computation (AREA)
  • Mathematical Physics (AREA)
  • Software Systems (AREA)
  • Computational Linguistics (AREA)
  • General Health & Medical Sciences (AREA)
  • Artificial Intelligence (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Health & Medical Sciences (AREA)
  • Economics (AREA)
  • Marketing (AREA)
  • Operations Research (AREA)
  • Quality & Reliability (AREA)
  • Tourism & Hospitality (AREA)
  • General Business, Economics & Management (AREA)
  • Management, Administration, Business Operations System, And Electronic Commerce (AREA)

Abstract

The present invention discloses a kind of intelligent Office-Automation System Work Flow Optimizing method, and the intelligent Office-Automation System operation flow mainly includes the building of the office system environment based on BPM workflow engine, the building in official document Word2vec term vector library, the regularization processing in term vector library, the extraction of official document feature set and the document information identification based on convolutional neural networks.Office system environment based on BPM workflow engine realizes the lasting upgrading optimization of information transmitting, data synchronization, business monitoring and business events flow path.The mapping of the system of the Environmental Support realistic model management business and workflow, comprehensively precisely defines the operation content of each task and operational staff, converts unified, automatic, standard computer system operation for Business Process Management.The Work Flow Optimizing based on convolutional neural networks is furthermore achieved in the building that the present invention also passes through official document Word2vec term vector library, cooperates with the various system resources of enterprises, reduces time cost, improve the response efficiency of business process system.

Description

Intelligent Office-Automation System Work Flow Optimizing
Technical field
The present invention relates to the document information intelligent identification technology in the office automation system, realize that operation flow pushes away automatically It recommends, relates generally to the building in official document Word2vec term vector library and the document information identification based on convolutional neural networks.
Background technique
Recently as the heating of big data and artificial intelligence concept, more and more enterprises start to pay attention to containing in data In commercial value, and increasingly mature with data mining technology, machine learning is in government, finance, communication, medical treatment, zero Sell etc. in multiple industries using more and more extensive and deeply, based on specific transactions target problem and historical data building data point Analyse model, the model managements behaviors such as implementation model storage, model verifying, model application deployment and model performance monitoring, it has also become Important link in business data mining process.To cope with continually changing market environment and continually changing customer demand, How to establish it is efficient, repeatable, can iteratively faster, and the model management process that flexibility ratio is high is that enterprise promotes the analysis of its data Ability and by its ability and resource conversion be value one of key subject, there is very strong research and practice significance.This patent The application model of Data Analysis Model management and workflow composing is explored based on this direction.
Conventional message-driven or event driven Workflow system can preferably support the operation of office system, but It is that can not keep up with the rapid development of artificial intelligence.It realizes and the combination of document information and historical data analysis and mentions For intelligent Service, it is necessary to be included in data driven technique in former Workflow system.In addition, Chinese government's office is mainly with official document For foundation, so the importance of the intelligent Service based on document is especially prominent.How document cooperateed with into processing technique and semanteme Processing technique is used for office system, realizes document information intelligent recognition and provides intelligent Service for user, needs solve Major issue.
Although existing system changes original document treatment mode, do not change the process flow of official document, I.e., it has no advanced management method increase is come in.Product more commonly used at present is made a general survey of, what they were capable of providing Function is substantially the processing around official document, and no matter additional function for depth or range far can not all enable us It is satisfied.Its basic reason is exactly that these systems only merely handle official document, and oneself processing official document is not used, that is, Say, do not accomplish accumulation of knowledge, can not oneself has according to these information be that enterprise needs the service of making a policy.Official document system exists Official document does not have automatic official document automating sorting function when distributing, and the requirement for user is relatively high, this is need urgently to be resolved It asks.In addition, official document caused by enterprise increases sharply with the rapid expansion of information technology, data volume is huge in process log how More efficiently effectively handled official business demand just urgently to be resolved at Liao Ge company using these data.Therefore, under privately owned cloud environment The characteristics of magnanimity, dispersion, dynamic change is presented in information data, needs to carry out the Business Stream under privately owned cloud environment based on data-driven Cheng Youhua key technology research.
Summary of the invention
Intelligent Office-Automation System operation flow recommends the method for being based primarily upon data-driven intelligently to know document information Not, the function of realization mainly has: being drawn based on BPM (Business process management, Business Process Management) workflow The building of the office system environment held up, the building in official document Word2vec term vector library, the regularization processing in term vector library, official document are special The extraction of collection and based on convolutional neural networks document information identification, as shown in Figure 1.
Intelligent Office-Automation System use the office system environment based on BPM workflow engine, realize information transmitting, The lasting upgrading optimization of data synchronization, business monitoring and business events flow path.The Environmental Support realistic model management business and work Make the system flowed mapping, comprehensively the operation content of each task and operational staff is precisely defined, by operation flow pipe Reason is converted into unified, automatic, standard computer system operation.This patent incorporates deep learning in this context simultaneously Frame as auxiliary support, be furthermore achieved by the building in official document Word2vec term vector library based on convolutional neural networks Work Flow Optimizing, cooperate with the various system resources of enterprises, reduce time cost, improve the sound of business process system Answer efficiency.
The present invention has the advantages that
1. the operation flow recommender system is using the OA based on BPM workflow engine, (Office Automation, is done Public affairs automation) system, conventional operation stream engine is compared, the system information conformability is strong, and the circulation data between document pass in real time It passs, and all kinds of IT systems and resource can be integrated, unified flow services are provided for other application, be fusion much information service Operation flow examination & approval management platform.In addition, the configurability of process height can greatly reduce exploitation amount, quick, low cost pipe Simple or complicated all kinds of operation flow lists are managed, meanwhile, the unified management of artificial process and automatic flow can adapt to process Fast-changing needs.
2. the technology in official document Word2vec term vector library is by neural network to context semanteme and context and target Between relationship modeled.This method proposes the form for converting word to vector for complicated document information, effectively solves It has determined the problem, dimension can be reduced and capture position related information of some word in official document.
3. Work Flow Optimizing system is on the basis of Word2vec term vector library technology, introducing TF-IDF, (word frequency-is inverse Text frequency) algorithm calculates the weight of each term vector in the text, and it is public for assessing the portion in a words or corpus The significance level of text.This for BPM workflow engine provide as degree of correlation between file and user query measurement or Grading.
4. Work Flow Optimizing system realizes the intelligent recognition of document information based on convolutional neural networks, difference has been used Convolution kernel convolution is carried out to official document matrix, the width of convolution kernel is equal to the length of term vector, then uses max-pooling The vector that (maximum value sampling) extracts each convolution kernel operates, the last corresponding number of each convolution kernel, this A little convolution kernels are stitched together, and have just obtained the sentence vector for characterizing the official document, have finally obtained business by each section Weight The recommendation results of process improve the accuracy of the efficiency and operation flow of office worker.
Detailed description of the invention
Fig. 1 is the Work Flow Optimizing system function frame based on BPM workflow engine
Fig. 2 is BPM workflow engine frame
Fig. 3 is operation flow message flow exemplary diagram
Fig. 4 is the mapping relations of official document and term vector based on Word2vec
Fig. 5 is feature extraction and the weight distribution flow chart of official document
Fig. 6 is the official document classification model of convolutional neural networks
Fig. 7 is operation flow intelligent recommendation system interface
Specific embodiment
The Work Flow Optimizing of intelligent Office-Automation System provided by the invention is carried out specifically with reference to the accompanying drawing It is bright:
1. the OA office system based on BPM workflow engine
BPM is the management based on operation flow itself, the design of comprehensive support process, execute, management, association, optimization it is each A aspect full-automation Collaboration.In order to which service procedure and artificial process are all integrated into BPMN (BPM Network), A kind of architecture design of complexity is needed, as shown in Fig. 2, the BPM engine for incorporating BPEL4People is the real core of whole system BPMN2 design test deployment process can be used to wherein in the heart, domain expert, and management console can be carried out each of process flow operation Kind management, BPM engine are not only responsible for sending task to human task system, be also responsible for each REST (Representational State Transfer) endpoint service interactive correspondence.Wherein BPM message flow acts on BPMN cooperation Figure, collaboration diagram show how two or more no central controlled processes interact in a synchronous manner.Message flow is table The mode how process individually controlled up to two is in communication with each other and cooperate, activity or event in a pond can be to another Message is initiated in pond, and message flow is depicted as a dotted line, and hollow circles indicate the source of message, and empty arrow indicates message expiration, disappears Breath stream example is as shown in Figure 3.Each process is included in the pond Pool of oneself, and pond is often marked with participant's title.Face Various resources and service can be driven to the process of service management, realizes a variety of inquiries and statistical analysis.The office system will be believed Breath exchange, knowledge, notice bulletin etc. are unified on process portal, improve the tissue in process implementation procedure and team collaboration, mistake Journey and result can all analyze retrospect.
2. the building in official document Word2vec term vector library and the extraction of official document feature set
Word2vec is the correlation model that a group is used to generate term vector.These models are double-deck neural network, are passed through Train construction term vector library again.After training is completed, word2vec model can be used to map each word to a vector, this to Amount can be used to indicate the relationship between word and word, and specific mapping process is as shown in Figure 4.
The training process of Word2vec is as follows:
1) word after word segmentation processing is indicated by the way of only heat type coding.
2) term vector obtained in the previous step is multiplied into projection matrix and obtains the input of hidden layer, then obtained by activation primitive hidden Hide the output of layer.But it calculates to generally use for simplicity and the output of input layer directly summation is obtained into the input of hidden layer.
3) output layer is a Hofman tree, and wherein leaf node corresponds to the word in vocabulary, and non-leaf nodes is equivalent to Parameter of the hidden layer to output layer.Pass through the weight between the available input layer of training pattern and hidden layer.
4) word that initial only heat type encodes is indicated and the multiplied by weight between input layer and hidden layer obtains word Term vector indicate.
3. the feature extraction and weight distribution of official document
This submodule is first to official document diCarry out word segmentation processing Wi=[w1,w2,…,wn], n be word number, then further according to Text after participle is replaced with low-dimensional numerical value vector by Word2vec term vector library For wi Term vector,K is the dimension of term vector, makes official document expressing from the intractable high latitude of neural network High sparse traditional data, the continuous dense matrix data for becoming similar image indicate.This official document expressing method avoids tradition The cumbersome work that manual features select in machine learning text classification algorithm, allows document information to obtain maximum reservation, And this patent uses modified TF-IDF algorithm and carries out term vector weight calculation, and specific process flow is as shown in Figure 5.
4. the document information based on convolutional neural networks identifies
Convolutional neural networks are generally made of input layer, multiple convolutional layers, pond layer, full articulamentum and softmax layers, are used It is as shown in Figure 6 in the convolutional neural networks model of document information identification.By multiple and different convolution kernel ω, ω in one convolutional layer ∈Rhk, h is the height of convolution kernel, and k is the Spatial Dimension of term vector, and convolution kernel is every to pass through a text with 1 slide downward of step-length Convolution algorithm is carried out when the window of vector h*k, generates a new characteristic value.Wi:i+hThe sequence of terms for being h+1 for a length (Wi,Wi+1,…,Wi+h), ω is convolution kernel matrix weight parameter, and b is bias term, and b ∈ R, operator () is convolutional calculation, f For activation primitive.One convolution kernel obtains a characteristic pattern c=(c after handling text vector1,c2,…,cn-h+1), n is official document The number of middle word.Pond layer is extracted using feature of the 1-max-pooling to characteristic pattern, cm=max { c }, passes through pond After changing layer processing, the text of different length all becomes the feature of equal length.The input of full articulamentum is the feature of pond layer Output, inputs and isP is the type of convolution kernel, and q is every kind of convolution kernel Number, output layer carries out kind judging using softmax function, to reach operation flow intelligent recommendation, implements boundary Face is as shown in Figure 7.

Claims (4)

1. intelligent Office-Automation System Work Flow Optimizing, it is characterised in that: the office system ring based on BPM workflow engine The building of border, the building in official document Word2vec term vector library, the regularization processing in term vector library, the extraction of official document feature set and base It is identified in the document information of convolutional neural networks.Office system environment based on BPM workflow engine, realize information transmitting, The lasting upgrading optimization of data synchronization, business monitoring and business events flow path;The building in official document Word2vec term vector library will be gone through History DOC DATA is mapped to term vector, efficiently solves the problems, such as the expression of official document complex information;The regularization in term vector library is handled It ensure that the consistency and validity of document information format, with the extraction of official document feature set convenient for carrying out official document using neural network Information processing;Document information identification based on convolutional neural networks carries out convolution weighting point to the matrix that official document term vector is constituted The recommendation results of operation flow can be obtained in class, improve the accuracy of the efficiency and operation flow of office worker.
2. intelligent Office-Automation System Work Flow Optimizing according to claim 1, it is characterised in that: the official document The relationship of the neural network to context semanteme and between context and target that be constructed by Word2vec term vector library carries out Modeling.Word in the present invention by official document after word segmentation processing is indicated by the way of one-hot coding;It will be obtained in the previous step Term vector multiplies projection matrix and obtains the input of hidden layer, then the output of input layer directly summation is obtained the input of hidden layer;It is defeated Layer is a Hofman tree out, and wherein leaf node is exactly the word in corresponding vocabulary, and non-leaf nodes is equivalent to hidden layer and arrives The parameter of output layer passes through the weight between the available input layer of training pattern and hidden layer;Initial only heat type is encoded Word indicate and multiplied by weight between input layer and hidden layer is to obtain the term vector of word to indicate.This method is for complexity Document information proposes the form for converting word to vector, efficiently solves the problems, such as this, can reduce dimension and capture Position related information of some word in official document.
3. intelligent Office-Automation System Work Flow Optimizing according to claim 1, it is characterised in that: term vector library The extraction of regularization processing and official document feature set is utilized TF-IDF algorithm and has carried out at distribution to term vector weight in the present invention Reason, first to official document diCarry out word segmentation processing Wi=[w1,w2,…,wn], n is word number, then further according to Word2vec term vector Text after participle is replaced with low-dimensional numerical value vector by library For wiTerm vector,K is the dimension of term vector, makes official document expressing from the high sparse biography of the intractable high latitude of neural network System data, the continuous dense matrix data for becoming similar image indicate.This official document expressing method avoids conventional machines study The cumbersome work that manual features select in text classification algorithm, allows document information to obtain maximum reservation.
4. intelligent Office-Automation System Work Flow Optimizing according to claim 1, it is characterised in that: based on convolution mind Document information identification through network, at input layer, multiple convolutional layers, pond layer, full articulamentum and softmax layers of information Reason.By multiple and different convolution kernel ω, ω ∈ R in one convolutional layerhk, h is the height of convolution kernel, and k is the space dimension of term vector Degree, for convolution kernel with 1 slide downward of step-length, when every window by a text vector h*k, carries out convolution algorithm, generate one it is new Characteristic value.Wi:i+hSequence of terms (the W for being h+1 for a lengthi,Wi+1,…,Wi+h), ω is convolution kernel matrix weight parameter, B is bias term, and b ∈ R, operator () is convolutional calculation, and f is activation primitive.One convolution kernel obtains after handling text vector To a characteristic pattern c=(c1,c2,…,cn-h+1), n is the number of word in official document.Pond layer is using 1-max-pooling to feature The feature of figure extracts, cm=max { c }, after being handled by pond layer, the text of different length all becomes the spy of equal length Sign.The input of full articulamentum is the feature output of pond layer, inputs and is P is the type of convolution kernel, and q is the number of every kind of convolution kernel, and output layer carries out kind judging using softmax function, to reach To operation flow intelligent recommendation.
CN201910723500.0A 2019-08-07 2019-08-07 Business process optimization method of intelligent office automation system Active CN110489749B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201910723500.0A CN110489749B (en) 2019-08-07 2019-08-07 Business process optimization method of intelligent office automation system

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201910723500.0A CN110489749B (en) 2019-08-07 2019-08-07 Business process optimization method of intelligent office automation system

Publications (2)

Publication Number Publication Date
CN110489749A true CN110489749A (en) 2019-11-22
CN110489749B CN110489749B (en) 2021-05-18

Family

ID=68550041

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201910723500.0A Active CN110489749B (en) 2019-08-07 2019-08-07 Business process optimization method of intelligent office automation system

Country Status (1)

Country Link
CN (1) CN110489749B (en)

Cited By (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN111782811A (en) * 2020-07-03 2020-10-16 湖南大学 E-government affair sensitive text detection method based on convolutional neural network and support vector machine
CN112712177A (en) * 2020-12-29 2021-04-27 上海永骁智能技术有限公司 Knowledge engineering method and device based on cooperative processing
CN114331226A (en) * 2022-03-08 2022-04-12 天津联创科技发展有限公司 Intelligent enterprise demand diagnosis method and system and storage medium
CN115907674A (en) * 2022-12-13 2023-04-04 广州明动软件股份有限公司 Intelligent efficiency analysis method and system based on AI algorithm and neural engine

Family Cites Families (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN102904885B (en) * 2012-09-26 2015-04-29 北京工业大学 Compound authentication method of multi-identity authentication information feature
CN108629550A (en) * 2017-03-17 2018-10-09 武汉唐科技有限公司 It is a kind of can backstage automatic running the office automation system implementation method
CN107133785A (en) * 2017-06-12 2017-09-05 山东浪潮云服务信息科技有限公司 A kind of pending business prompting method of office system based on domestic CPU
CN109635116B (en) * 2018-12-17 2023-03-24 腾讯科技(深圳)有限公司 Training method of text word vector model, electronic equipment and computer storage medium

Cited By (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN111782811A (en) * 2020-07-03 2020-10-16 湖南大学 E-government affair sensitive text detection method based on convolutional neural network and support vector machine
CN112712177A (en) * 2020-12-29 2021-04-27 上海永骁智能技术有限公司 Knowledge engineering method and device based on cooperative processing
CN114331226A (en) * 2022-03-08 2022-04-12 天津联创科技发展有限公司 Intelligent enterprise demand diagnosis method and system and storage medium
CN115907674A (en) * 2022-12-13 2023-04-04 广州明动软件股份有限公司 Intelligent efficiency analysis method and system based on AI algorithm and neural engine
CN115907674B (en) * 2022-12-13 2023-12-01 广州明动软件股份有限公司 Intelligent efficiency analysis method and system based on AI algorithm and neural engine

Also Published As

Publication number Publication date
CN110489749B (en) 2021-05-18

Similar Documents

Publication Publication Date Title
Yang et al. Social media data analytics for business decision making system to competitive analysis
CN110489749A (en) Intelligent Office-Automation System Work Flow Optimizing
US20130151305A1 (en) Method and Apparatus for Business Drivers and Outcomes to Enable Scenario Planning and Simulation
CN109491642A (en) A kind of Requirements Modeling system and method based on scene, information data processing terminal
US20060075391A1 (en) Distributed scenario generation
JP2008544407A (en) Technical methods and tools for capability-based multiple family of systems planning
CN114385150B (en) Low code development method based on intelligent engine
CN110490547A (en) Office system intellectualized technology
US8051154B2 (en) Enterprise service delivery technical framework
CN113902394A (en) Service platform for realizing artificial intelligence industry ecology
CN115686280A (en) Deep learning model management system, method, computer device and storage medium
US20120078967A1 (en) Integration of a Framework Application and a Task Database
CN113421056A (en) Internet human resource management system
CN110489667A (en) Intelligent circulation of official document technology based on user's portrait
CN116739408A (en) Power grid dispatching safety monitoring method and system based on data tag and electronic equipment
Nielsen et al. Level of visualization support for project communication in the Turkish construction industry: A quality function deployment approach
CN114924879A (en) Online design service system for enterprise marketing planning scheme
KR20060012572A (en) System and methods for managing distributed design chains
Li et al. Enterprise precision marketing effectiveness model based on data mining technology
CN112906683A (en) Text labeling method, device and equipment
Shen et al. Enhance the evaluation quality of project performance based on fuzzy aggregation weight effect
Adikara et al. Combining Organization Goal-Oriented Elicitation and Wireframe for Fishery Product Sales Mobile Application Design
Polat Digital Transformation Process in Selection and Implementation of an Electronic Document Management System
Wang Design and Simulation of Human Resource Allocation Model Based on Artificial Intelligence and In-Depth Data Analysis
Amissah et al. Data-centric development of architecture models with the DM2 and MS-SDF

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant