CN108269205A - A kind of electronic data identification systems using cloud platform - Google Patents
A kind of electronic data identification systems using cloud platform Download PDFInfo
- Publication number
- CN108269205A CN108269205A CN201810067591.2A CN201810067591A CN108269205A CN 108269205 A CN108269205 A CN 108269205A CN 201810067591 A CN201810067591 A CN 201810067591A CN 108269205 A CN108269205 A CN 108269205A
- Authority
- CN
- China
- Prior art keywords
- document
- unit
- voice
- voice messaging
- electronic data
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Pending
Links
- 238000000034 method Methods 0.000 claims description 20
- 230000008569 process Effects 0.000 claims description 16
- 239000011159 matrix material Substances 0.000 claims description 12
- 238000012790 confirmation Methods 0.000 claims description 6
- 230000006870 function Effects 0.000 claims description 6
- 238000006243 chemical reaction Methods 0.000 claims description 4
- 239000000203 mixture Substances 0.000 claims description 4
- 230000003595 spectral effect Effects 0.000 claims description 3
- 238000005516 engineering process Methods 0.000 description 4
- 238000004891 communication Methods 0.000 description 1
- 238000011161 development Methods 0.000 description 1
- 239000000686 essence Substances 0.000 description 1
- 230000006872 improvement Effects 0.000 description 1
- 238000007689 inspection Methods 0.000 description 1
- 238000012986 modification Methods 0.000 description 1
- 230000004048 modification Effects 0.000 description 1
- 238000012545 processing Methods 0.000 description 1
- 238000012216 screening Methods 0.000 description 1
- 238000012360 testing method Methods 0.000 description 1
- 238000012795 verification Methods 0.000 description 1
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06Q—INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
- G06Q50/00—Information and communication technology [ICT] specially adapted for implementation of business processes of specific business sectors, e.g. utilities or tourism
- G06Q50/10—Services
- G06Q50/18—Legal services
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/90—Details of database functions independent of the retrieved data types
- G06F16/93—Document management systems
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06Q—INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
- G06Q50/00—Information and communication technology [ICT] specially adapted for implementation of business processes of specific business sectors, e.g. utilities or tourism
- G06Q50/10—Services
- G06Q50/26—Government or public services
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/26—Speech to text systems
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L21/00—Speech or voice signal processing techniques to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
- G10L21/02—Speech enhancement, e.g. noise reduction or echo cancellation
- G10L21/0208—Noise filtering
Landscapes
- Engineering & Computer Science (AREA)
- Business, Economics & Management (AREA)
- Physics & Mathematics (AREA)
- Theoretical Computer Science (AREA)
- Tourism & Hospitality (AREA)
- Health & Medical Sciences (AREA)
- General Physics & Mathematics (AREA)
- General Business, Economics & Management (AREA)
- Acoustics & Sound (AREA)
- Multimedia (AREA)
- Databases & Information Systems (AREA)
- Computational Linguistics (AREA)
- Human Computer Interaction (AREA)
- Economics (AREA)
- General Health & Medical Sciences (AREA)
- Human Resources & Organizations (AREA)
- Marketing (AREA)
- Primary Health Care (AREA)
- Strategic Management (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Technology Law (AREA)
- Quality & Reliability (AREA)
- Development Economics (AREA)
- Educational Administration (AREA)
- Signal Processing (AREA)
- Data Mining & Analysis (AREA)
- General Engineering & Computer Science (AREA)
- Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
Abstract
The present invention provides a kind of electronic data identification systems using cloud platform, including:Cloud storage unit, for storing and the relevant document information of case;Voice collecting unit, for acquiring the voice messaging at case trial scene;Intelligent language and characters converting unit, for being converted into word according to voice messaging.This platform greatly improves the speech discrimination accuracy at trial scene, is conducive to improve trial efficiency.
Description
Technical field
The invention belongs to law big data processing technology fields, and in particular to a kind of electronic data using cloud platform is identified
System.
Background technology
With the continuous development of information technology, justice system also proposed increasingly higher demands to automated process.Method
All kinds of case information of institute's trial have been enough to form the pending data in big data meaning.Through retrieval, in the prior art Shen
Number please providing a kind of criminal case for CN201710426297.1, intelligently auxiliary is handled a case method, suitable for public security subsystem, inspection
It examines subsystem and law court's subsystem provides auxiliary of handling a case, the public security subsystem, procuratorial work subsystem and method subsystem are equal
With server communication, include the following steps:
Step 21, public security subsystem to one or more of the case instrument of evidence carry out evidence collection, typing and
Verification;
Step 22, public security subsystem obtains the pending request instruction of input, and is judged according to the check results of the instrument of evidence
Whether the requirement that proposes pending request is met, if so, sending out pending request to procuratorial work subsystem by server;
Step 23, procuratorial work subsystem obtains the instrument of evidence and its check results of the public security subsystem to the case;
Step 24, procuratorial work subsystem carries out chain of evidence examination to the instrument of evidence of the case;
Step 25, procuratorial work subsystem obtains the examination result that sends out inputted and instructs, and according to the chain of evidence of the instrument of evidence
Examination result judges whether to meet the requirement for sending out examination result, if so, by server to public security subsystem or method courtyard
System sends out examination result.
However, in the prior art, more and more sound and perfect with the law of country, the legal consciousness of people increasingly carries
Height, the quantity of judicial class case are also more and more.And people also habitually go to search relevant when handling a case
Case is referred to, so that the case point involved by case itself and relevant law are more known and understood.However, for existing
For case is inquired or retrieved, people are generally widely to be inquired by universal search engine, and this inquiry mode
Accuracy rate is inquired than relatively low, people is generally required and carries out just inquiring useful reference case after largely screening.In addition, people
It can also be inquired or retrieved by the dedicated system of judicial department, the universal search and this dedicated query mode compares
For engine, accuracy rate increases, but it is either in formality or in mode of operation, all comparatively laborious, and
Civil retrieved, also, conventional judicial class case is retrieved, and is also commonly based on the complete of keyword cannot be flexibly suitable for
Literary searching system is realized, and this retrieval can only directly retrieve related keyword whether occur, also compare in accuracy rate
It is relatively low.Also it is such for electronic authentication system.
Invention content
In order to improve the automatization level that justice system electronic data identification case examines, the present invention provides a kind of utilizations
The electronic data identification systems of cloud platform, including:
Cloud storage unit identifies the relevant document information of case for storing with electronic data;
Voice collecting unit, for acquiring the voice messaging at electronic data identification case trial scene;
Electronic data comparing unit searches corresponding text for being based on the voice messaging in the cloud storage unit
Shelves.
Further, the electronic data comparing unit includes:
Intelligent voice-text conversion unit, for being converted into word according to voice messaging;
Document selection unit, for choosing document according to the voice messaging;
Document markup unit, for marking the document according to the voice messaging;
Hearing process confirmation unit confirms document for generating hearing process according to the voice messaging.
Further, the voice collecting unit is multichannel voice collecting unit.
Further, the multichannel voice collecting unit is set more including speech signal analysis unit and distribution
A microphone.
Further, the document selection unit includes document title determination unit, for generating text according to voice messaging
Word, and document is chosen from the cloud storage unit according to the corresponding document title of word.
Further, the document markup unit is used to the word being added to the document.
Further, the cloud storage unit is relationship type cloud storage unit.
Further, the hearing process confirmation unit includes:
Document establishes unit, confirms document for establishing hearing process;
Content generation unit confirms document for the document and the word to be added to the hearing process.
Further, the document title determination unit includes:
S (w) be speech signal energy spectral density function, cnFor cepstrum coefficient, pass through cnCan obtain cepstrum distance l is:
N is microphone number, and d is poor relative to the criterion distance of trial judge position for each microphone position, and wherein N (n) is institute
State the noise signal in voice messaging, S (n) is the voice signal for removing the voice messaging after noise;pi(k) be based on cepstrum away from
From speech frequency component,
Short-time average energies of the voice signal x (n) of microphone n at the n moment be:
For Hamming window functions;
To the p of each microphone compositioni(k) matrix, the matrix and E are formednCharacteristic value be multiplied, acquire covariance matrix G;
Characteristic value A is asked for matrix G, and then obtains adjusting signal for the cepstrum of each microphone
Technical scheme of the present invention has the following advantages:
The electronic data identification systems using cloud platform of the live noise of trial can farthest be reduced by realizing, and be led to
The platform is crossed, the noise during word can be converted into voice effectively to be reduced, according to test, compared with the prior art
Its identification validity of voice-character recognition technology of the models such as HMM is higher by more than 70%, therefore be highly suitable for other than court
Any place carry out electronic data identification case trial when provide stablize and reliable text conversion, greatly improve examine
Manage efficiency.
Description of the drawings
Fig. 1 shows platform composition frame chart according to a preferred embodiment of the invention.
Specific embodiment
As shown in Figure 1, a kind of electronic data identification systems using cloud platform, including:
Cloud storage unit identifies the relevant document information of case for storing with electronic data;
Voice collecting unit, for acquiring the voice messaging at electronic data identification case trial scene;
Electronic data comparing unit searches corresponding text for being based on the voice messaging in the cloud storage unit
Shelves.
The electronic data comparing unit includes:
Intelligent voice-text conversion unit, for being converted into word according to voice messaging;
Document selection unit, for choosing document according to the voice messaging;
Document markup unit, for marking the document according to the voice messaging;
Hearing process confirmation unit confirms document for generating hearing process according to the voice messaging.
The voice collecting unit is multichannel voice collecting unit.
The multichannel voice collecting unit includes speech signal analysis unit and multiple microphones of distributed setting.
The document selection unit include document title determination unit, for according to voice messaging generate word, and according to
The corresponding document title of word chooses document from the cloud storage unit.
The document markup unit is used to the word being added to the document.
The cloud storage unit is relationship type cloud storage unit.
The hearing process confirmation unit includes:
Document establishes unit, confirms document for establishing hearing process;
Content generation unit confirms document for the document and the word to be added to the hearing process.
The document title determination unit includes:
S (w) be speech signal energy spectral density function, cnFor cepstrum coefficient, pass through cnCan obtain cepstrum distance l is:
N is microphone number, and d is poor relative to the criterion distance of trial judge position for each microphone position, and wherein N (n) is institute
State the noise signal in voice messaging, S (n) is the voice signal for removing the voice messaging after noise;pi(k) be based on cepstrum away from
From speech frequency component,
Short-time average energies of the voice signal x (n) of microphone n at the n moment be:
For Hamming window functions;
To the p of each microphone compositioni(k) matrix, the matrix and E are formednCharacteristic value be multiplied, acquire covariance matrix G;
Characteristic value A is asked for matrix G, and then obtains adjusting signal for the cepstrum of each microphone
The foregoing is merely illustrative of the preferred embodiments of the present invention, is not intended to limit the invention, all essences in the present invention
All any modification, equivalent and improvement made within refreshing and principle etc., should all be included in the protection scope of the present invention.
Claims (9)
1. a kind of electronic data identification systems using cloud platform, which is characterized in that including:
Cloud storage unit identifies the relevant document information of case for storing with electronic data;
Voice collecting unit, for acquiring the voice messaging at electronic data identification case trial scene;
Electronic data comparing unit searches corresponding document for being based on the voice messaging in the cloud storage unit.
2. system according to claim 1, which is characterized in that the electronic data comparing unit includes:
Intelligent voice-text conversion unit, for being converted into word according to voice messaging;
Document selection unit, for choosing document according to the voice messaging;
Document markup unit, for marking the document according to the voice messaging;
Hearing process confirmation unit confirms document for generating hearing process according to the voice messaging.
3. system according to claim 2, which is characterized in that the voice collecting unit is multichannel voice collecting list
Member.
4. system according to claim 3, which is characterized in that the multichannel voice collecting unit is included at voice messaging
Manage unit and multiple microphones of distributed setting.
5. system according to claim 2, which is characterized in that the document selection unit determines list including document title
Member for generating word according to voice messaging, and chooses text according to the corresponding document title of word from the cloud storage unit
Shelves.
6. system according to claim 2, which is characterized in that the document markup unit is used to the word being added to
The document.
7. system according to claim 2, which is characterized in that the cloud storage unit is relationship type cloud storage unit.
8. system according to claim 2, which is characterized in that the hearing process confirmation unit includes:
Document establishes unit, confirms document for establishing hearing process;
Content generation unit confirms document for the document and the word to be added to the hearing process.
9. system according to claim 2, which is characterized in that the document title determination unit includes:
S (w) be speech signal energy spectral density function, cnFor cepstrum coefficient, pass through cnCan obtain cepstrum distance l is:
N is microphone number, and d is poor relative to the criterion distance of trial judge position for each microphone position, and wherein N (n) is institute's predicate
Message breath in noise signal, S (n) be removal noise after voice messaging voice signal;pi(k) it is based on cepstrum distance
Speech frequency component,
Short-time average energies of the voice signal x (n) of microphone n at the n moment be:
For Hamming window functions;
To the p of each microphone compositioni(k) matrix, the matrix and E are formednCharacteristic value be multiplied, acquire covariance matrix G;
Characteristic value A is asked for matrix G, and then obtains adjusting signal for the cepstrum of each microphone
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201810067591.2A CN108269205A (en) | 2018-01-24 | 2018-01-24 | A kind of electronic data identification systems using cloud platform |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201810067591.2A CN108269205A (en) | 2018-01-24 | 2018-01-24 | A kind of electronic data identification systems using cloud platform |
Publications (1)
Publication Number | Publication Date |
---|---|
CN108269205A true CN108269205A (en) | 2018-07-10 |
Family
ID=62776484
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN201810067591.2A Pending CN108269205A (en) | 2018-01-24 | 2018-01-24 | A kind of electronic data identification systems using cloud platform |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN108269205A (en) |
Citations (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
JP2002527800A (en) * | 1998-10-02 | 2002-08-27 | インターナショナル・ビジネス・マシーンズ・コーポレーション | Conversation browser and conversation system |
CN101030129A (en) * | 2006-03-03 | 2007-09-05 | 北京速迅科技有限公司 | Character and language synchronizing method and synchronizer |
CN106326640A (en) * | 2016-08-12 | 2017-01-11 | 上海交通大学医学院附属瑞金医院卢湾分院 | Medical speech control system and control method thereof |
-
2018
- 2018-01-24 CN CN201810067591.2A patent/CN108269205A/en active Pending
Patent Citations (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
JP2002527800A (en) * | 1998-10-02 | 2002-08-27 | インターナショナル・ビジネス・マシーンズ・コーポレーション | Conversation browser and conversation system |
CN101030129A (en) * | 2006-03-03 | 2007-09-05 | 北京速迅科技有限公司 | Character and language synchronizing method and synchronizer |
CN106326640A (en) * | 2016-08-12 | 2017-01-11 | 上海交通大学医学院附属瑞金医院卢湾分院 | Medical speech control system and control method thereof |
Non-Patent Citations (1)
Title |
---|
王金甲: "噪声环境下鲁棒性文本自由说话人辨认系统的研究", 《中国优秀硕士学位论文 全文数据库 信息科技辑 第02期》 * |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
US6687671B2 (en) | Method and apparatus for automatic collection and summarization of meeting information | |
US7415409B2 (en) | Method to train the language model of a speech recognition system to convert and index voicemails on a search engine | |
CN101447188B (en) | Digital voice print identification system and validation and identification method | |
CN1270361A (en) | Method and device for audio information searching by content and loudspeaker information | |
CN107293309B (en) | Method for improving public opinion monitoring efficiency based on client emotion analysis | |
US20040163035A1 (en) | Method for automatic and semi-automatic classification and clustering of non-deterministic texts | |
CN112735383A (en) | Voice signal processing method, device, equipment and storage medium | |
CN105931642B (en) | Voice recognition method, device and system | |
CN109920435B (en) | Voiceprint recognition method and voiceprint recognition device | |
WO2016119604A1 (en) | Voice information search method and apparatus, and server | |
WO2009003328A1 (en) | Data query system and method | |
CN114399379A (en) | Artificial intelligence-based collection behavior recognition method, device, equipment and medium | |
CN105227557A (en) | A kind of account number processing method and device | |
CN116150651A (en) | AI-based depth synthesis detection method and system | |
CN116863960A (en) | Emergency broadcast terminal audio processing method and device, emergency broadcast terminal and medium | |
CN106095799A (en) | The storage of a kind of voice, search method and device | |
CN114722199A (en) | Risk identification method and device based on call recording, computer equipment and medium | |
CN114610840A (en) | Sensitive word-based accounting monitoring method, device, equipment and storage medium | |
CN116484052B (en) | Educational resource sharing system based on big data | |
KR20080046490A (en) | Method for identifying face using montage and apparatus thereof | |
US7340398B2 (en) | Selective sampling for sound signal classification | |
CN108269205A (en) | A kind of electronic data identification systems using cloud platform | |
CN108182570A (en) | A kind of case wisdom auditing system | |
CN108280188A (en) | Intelligence inspection business platform based on big data | |
CN113901839A (en) | User video information auditing method, device, equipment and storage medium |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
RJ01 | Rejection of invention patent application after publication | ||
RJ01 | Rejection of invention patent application after publication |
Application publication date: 20180710 |