CN108269205A - A kind of electronic data identification systems using cloud platform - Google Patents

A kind of electronic data identification systems using cloud platform Download PDF

Info

Publication number
CN108269205A
CN108269205A CN201810067591.2A CN201810067591A CN108269205A CN 108269205 A CN108269205 A CN 108269205A CN 201810067591 A CN201810067591 A CN 201810067591A CN 108269205 A CN108269205 A CN 108269205A
Authority
CN
China
Prior art keywords
document
unit
voice
voice messaging
electronic data
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN201810067591.2A
Other languages
Chinese (zh)
Inventor
蒋志群
李弘珊
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Chengdu Shun Siyuan Information Technology Co Ltd
Original Assignee
Chengdu Shun Siyuan Information Technology Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Chengdu Shun Siyuan Information Technology Co Ltd filed Critical Chengdu Shun Siyuan Information Technology Co Ltd
Priority to CN201810067591.2A priority Critical patent/CN108269205A/en
Publication of CN108269205A publication Critical patent/CN108269205A/en
Pending legal-status Critical Current

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06QINFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
    • G06Q50/00Information and communication technology [ICT] specially adapted for implementation of business processes of specific business sectors, e.g. utilities or tourism
    • G06Q50/10Services
    • G06Q50/18Legal services
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/90Details of database functions independent of the retrieved data types
    • G06F16/93Document management systems
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06QINFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
    • G06Q50/00Information and communication technology [ICT] specially adapted for implementation of business processes of specific business sectors, e.g. utilities or tourism
    • G06Q50/10Services
    • G06Q50/26Government or public services
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/26Speech to text systems
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L21/00Speech or voice signal processing techniques to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
    • G10L21/02Speech enhancement, e.g. noise reduction or echo cancellation
    • G10L21/0208Noise filtering

Landscapes

  • Engineering & Computer Science (AREA)
  • Business, Economics & Management (AREA)
  • Physics & Mathematics (AREA)
  • Theoretical Computer Science (AREA)
  • Tourism & Hospitality (AREA)
  • Health & Medical Sciences (AREA)
  • General Physics & Mathematics (AREA)
  • General Business, Economics & Management (AREA)
  • Acoustics & Sound (AREA)
  • Multimedia (AREA)
  • Databases & Information Systems (AREA)
  • Computational Linguistics (AREA)
  • Human Computer Interaction (AREA)
  • Economics (AREA)
  • General Health & Medical Sciences (AREA)
  • Human Resources & Organizations (AREA)
  • Marketing (AREA)
  • Primary Health Care (AREA)
  • Strategic Management (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Technology Law (AREA)
  • Quality & Reliability (AREA)
  • Development Economics (AREA)
  • Educational Administration (AREA)
  • Signal Processing (AREA)
  • Data Mining & Analysis (AREA)
  • General Engineering & Computer Science (AREA)
  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)

Abstract

The present invention provides a kind of electronic data identification systems using cloud platform, including:Cloud storage unit, for storing and the relevant document information of case;Voice collecting unit, for acquiring the voice messaging at case trial scene;Intelligent language and characters converting unit, for being converted into word according to voice messaging.This platform greatly improves the speech discrimination accuracy at trial scene, is conducive to improve trial efficiency.

Description

A kind of electronic data identification systems using cloud platform
Technical field
The invention belongs to law big data processing technology fields, and in particular to a kind of electronic data using cloud platform is identified System.
Background technology
With the continuous development of information technology, justice system also proposed increasingly higher demands to automated process.Method All kinds of case information of institute's trial have been enough to form the pending data in big data meaning.Through retrieval, in the prior art Shen Number please providing a kind of criminal case for CN201710426297.1, intelligently auxiliary is handled a case method, suitable for public security subsystem, inspection It examines subsystem and law court's subsystem provides auxiliary of handling a case, the public security subsystem, procuratorial work subsystem and method subsystem are equal With server communication, include the following steps:
Step 21, public security subsystem to one or more of the case instrument of evidence carry out evidence collection, typing and Verification;
Step 22, public security subsystem obtains the pending request instruction of input, and is judged according to the check results of the instrument of evidence Whether the requirement that proposes pending request is met, if so, sending out pending request to procuratorial work subsystem by server;
Step 23, procuratorial work subsystem obtains the instrument of evidence and its check results of the public security subsystem to the case;
Step 24, procuratorial work subsystem carries out chain of evidence examination to the instrument of evidence of the case;
Step 25, procuratorial work subsystem obtains the examination result that sends out inputted and instructs, and according to the chain of evidence of the instrument of evidence Examination result judges whether to meet the requirement for sending out examination result, if so, by server to public security subsystem or method courtyard System sends out examination result.
However, in the prior art, more and more sound and perfect with the law of country, the legal consciousness of people increasingly carries Height, the quantity of judicial class case are also more and more.And people also habitually go to search relevant when handling a case Case is referred to, so that the case point involved by case itself and relevant law are more known and understood.However, for existing For case is inquired or retrieved, people are generally widely to be inquired by universal search engine, and this inquiry mode Accuracy rate is inquired than relatively low, people is generally required and carries out just inquiring useful reference case after largely screening.In addition, people It can also be inquired or retrieved by the dedicated system of judicial department, the universal search and this dedicated query mode compares For engine, accuracy rate increases, but it is either in formality or in mode of operation, all comparatively laborious, and Civil retrieved, also, conventional judicial class case is retrieved, and is also commonly based on the complete of keyword cannot be flexibly suitable for Literary searching system is realized, and this retrieval can only directly retrieve related keyword whether occur, also compare in accuracy rate It is relatively low.Also it is such for electronic authentication system.
Invention content
In order to improve the automatization level that justice system electronic data identification case examines, the present invention provides a kind of utilizations The electronic data identification systems of cloud platform, including:
Cloud storage unit identifies the relevant document information of case for storing with electronic data;
Voice collecting unit, for acquiring the voice messaging at electronic data identification case trial scene;
Electronic data comparing unit searches corresponding text for being based on the voice messaging in the cloud storage unit Shelves.
Further, the electronic data comparing unit includes:
Intelligent voice-text conversion unit, for being converted into word according to voice messaging;
Document selection unit, for choosing document according to the voice messaging;
Document markup unit, for marking the document according to the voice messaging;
Hearing process confirmation unit confirms document for generating hearing process according to the voice messaging.
Further, the voice collecting unit is multichannel voice collecting unit.
Further, the multichannel voice collecting unit is set more including speech signal analysis unit and distribution A microphone.
Further, the document selection unit includes document title determination unit, for generating text according to voice messaging Word, and document is chosen from the cloud storage unit according to the corresponding document title of word.
Further, the document markup unit is used to the word being added to the document.
Further, the cloud storage unit is relationship type cloud storage unit.
Further, the hearing process confirmation unit includes:
Document establishes unit, confirms document for establishing hearing process;
Content generation unit confirms document for the document and the word to be added to the hearing process.
Further, the document title determination unit includes:
S (w) be speech signal energy spectral density function, cnFor cepstrum coefficient, pass through cnCan obtain cepstrum distance l is:
N is microphone number, and d is poor relative to the criterion distance of trial judge position for each microphone position, and wherein N (n) is institute State the noise signal in voice messaging, S (n) is the voice signal for removing the voice messaging after noise;pi(k) be based on cepstrum away from From speech frequency component,
Short-time average energies of the voice signal x (n) of microphone n at the n moment be:
For Hamming window functions;
To the p of each microphone compositioni(k) matrix, the matrix and E are formednCharacteristic value be multiplied, acquire covariance matrix G;
Characteristic value A is asked for matrix G, and then obtains adjusting signal for the cepstrum of each microphone
Technical scheme of the present invention has the following advantages:
The electronic data identification systems using cloud platform of the live noise of trial can farthest be reduced by realizing, and be led to The platform is crossed, the noise during word can be converted into voice effectively to be reduced, according to test, compared with the prior art Its identification validity of voice-character recognition technology of the models such as HMM is higher by more than 70%, therefore be highly suitable for other than court Any place carry out electronic data identification case trial when provide stablize and reliable text conversion, greatly improve examine Manage efficiency.
Description of the drawings
Fig. 1 shows platform composition frame chart according to a preferred embodiment of the invention.
Specific embodiment
As shown in Figure 1, a kind of electronic data identification systems using cloud platform, including:
Cloud storage unit identifies the relevant document information of case for storing with electronic data;
Voice collecting unit, for acquiring the voice messaging at electronic data identification case trial scene;
Electronic data comparing unit searches corresponding text for being based on the voice messaging in the cloud storage unit Shelves.
The electronic data comparing unit includes:
Intelligent voice-text conversion unit, for being converted into word according to voice messaging;
Document selection unit, for choosing document according to the voice messaging;
Document markup unit, for marking the document according to the voice messaging;
Hearing process confirmation unit confirms document for generating hearing process according to the voice messaging.
The voice collecting unit is multichannel voice collecting unit.
The multichannel voice collecting unit includes speech signal analysis unit and multiple microphones of distributed setting.
The document selection unit include document title determination unit, for according to voice messaging generate word, and according to The corresponding document title of word chooses document from the cloud storage unit.
The document markup unit is used to the word being added to the document.
The cloud storage unit is relationship type cloud storage unit.
The hearing process confirmation unit includes:
Document establishes unit, confirms document for establishing hearing process;
Content generation unit confirms document for the document and the word to be added to the hearing process.
The document title determination unit includes:
S (w) be speech signal energy spectral density function, cnFor cepstrum coefficient, pass through cnCan obtain cepstrum distance l is:
N is microphone number, and d is poor relative to the criterion distance of trial judge position for each microphone position, and wherein N (n) is institute State the noise signal in voice messaging, S (n) is the voice signal for removing the voice messaging after noise;pi(k) be based on cepstrum away from From speech frequency component,
Short-time average energies of the voice signal x (n) of microphone n at the n moment be:
For Hamming window functions;
To the p of each microphone compositioni(k) matrix, the matrix and E are formednCharacteristic value be multiplied, acquire covariance matrix G;
Characteristic value A is asked for matrix G, and then obtains adjusting signal for the cepstrum of each microphone
The foregoing is merely illustrative of the preferred embodiments of the present invention, is not intended to limit the invention, all essences in the present invention All any modification, equivalent and improvement made within refreshing and principle etc., should all be included in the protection scope of the present invention.

Claims (9)

1. a kind of electronic data identification systems using cloud platform, which is characterized in that including:
Cloud storage unit identifies the relevant document information of case for storing with electronic data;
Voice collecting unit, for acquiring the voice messaging at electronic data identification case trial scene;
Electronic data comparing unit searches corresponding document for being based on the voice messaging in the cloud storage unit.
2. system according to claim 1, which is characterized in that the electronic data comparing unit includes:
Intelligent voice-text conversion unit, for being converted into word according to voice messaging;
Document selection unit, for choosing document according to the voice messaging;
Document markup unit, for marking the document according to the voice messaging;
Hearing process confirmation unit confirms document for generating hearing process according to the voice messaging.
3. system according to claim 2, which is characterized in that the voice collecting unit is multichannel voice collecting list Member.
4. system according to claim 3, which is characterized in that the multichannel voice collecting unit is included at voice messaging Manage unit and multiple microphones of distributed setting.
5. system according to claim 2, which is characterized in that the document selection unit determines list including document title Member for generating word according to voice messaging, and chooses text according to the corresponding document title of word from the cloud storage unit Shelves.
6. system according to claim 2, which is characterized in that the document markup unit is used to the word being added to The document.
7. system according to claim 2, which is characterized in that the cloud storage unit is relationship type cloud storage unit.
8. system according to claim 2, which is characterized in that the hearing process confirmation unit includes:
Document establishes unit, confirms document for establishing hearing process;
Content generation unit confirms document for the document and the word to be added to the hearing process.
9. system according to claim 2, which is characterized in that the document title determination unit includes:
S (w) be speech signal energy spectral density function, cnFor cepstrum coefficient, pass through cnCan obtain cepstrum distance l is:
N is microphone number, and d is poor relative to the criterion distance of trial judge position for each microphone position, and wherein N (n) is institute's predicate Message breath in noise signal, S (n) be removal noise after voice messaging voice signal;pi(k) it is based on cepstrum distance Speech frequency component,
Short-time average energies of the voice signal x (n) of microphone n at the n moment be:
For Hamming window functions;
To the p of each microphone compositioni(k) matrix, the matrix and E are formednCharacteristic value be multiplied, acquire covariance matrix G;
Characteristic value A is asked for matrix G, and then obtains adjusting signal for the cepstrum of each microphone
CN201810067591.2A 2018-01-24 2018-01-24 A kind of electronic data identification systems using cloud platform Pending CN108269205A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201810067591.2A CN108269205A (en) 2018-01-24 2018-01-24 A kind of electronic data identification systems using cloud platform

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201810067591.2A CN108269205A (en) 2018-01-24 2018-01-24 A kind of electronic data identification systems using cloud platform

Publications (1)

Publication Number Publication Date
CN108269205A true CN108269205A (en) 2018-07-10

Family

ID=62776484

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201810067591.2A Pending CN108269205A (en) 2018-01-24 2018-01-24 A kind of electronic data identification systems using cloud platform

Country Status (1)

Country Link
CN (1) CN108269205A (en)

Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP2002527800A (en) * 1998-10-02 2002-08-27 インターナショナル・ビジネス・マシーンズ・コーポレーション Conversation browser and conversation system
CN101030129A (en) * 2006-03-03 2007-09-05 北京速迅科技有限公司 Character and language synchronizing method and synchronizer
CN106326640A (en) * 2016-08-12 2017-01-11 上海交通大学医学院附属瑞金医院卢湾分院 Medical speech control system and control method thereof

Patent Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP2002527800A (en) * 1998-10-02 2002-08-27 インターナショナル・ビジネス・マシーンズ・コーポレーション Conversation browser and conversation system
CN101030129A (en) * 2006-03-03 2007-09-05 北京速迅科技有限公司 Character and language synchronizing method and synchronizer
CN106326640A (en) * 2016-08-12 2017-01-11 上海交通大学医学院附属瑞金医院卢湾分院 Medical speech control system and control method thereof

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
王金甲: "噪声环境下鲁棒性文本自由说话人辨认系统的研究", 《中国优秀硕士学位论文 全文数据库 信息科技辑 第02期》 *

Similar Documents

Publication Publication Date Title
US6687671B2 (en) Method and apparatus for automatic collection and summarization of meeting information
US7415409B2 (en) Method to train the language model of a speech recognition system to convert and index voicemails on a search engine
CN101447188B (en) Digital voice print identification system and validation and identification method
CN1270361A (en) Method and device for audio information searching by content and loudspeaker information
CN107293309B (en) Method for improving public opinion monitoring efficiency based on client emotion analysis
US20040163035A1 (en) Method for automatic and semi-automatic classification and clustering of non-deterministic texts
CN112735383A (en) Voice signal processing method, device, equipment and storage medium
CN105931642B (en) Voice recognition method, device and system
CN109920435B (en) Voiceprint recognition method and voiceprint recognition device
WO2016119604A1 (en) Voice information search method and apparatus, and server
WO2009003328A1 (en) Data query system and method
CN114399379A (en) Artificial intelligence-based collection behavior recognition method, device, equipment and medium
CN105227557A (en) A kind of account number processing method and device
CN116150651A (en) AI-based depth synthesis detection method and system
CN116863960A (en) Emergency broadcast terminal audio processing method and device, emergency broadcast terminal and medium
CN106095799A (en) The storage of a kind of voice, search method and device
CN114722199A (en) Risk identification method and device based on call recording, computer equipment and medium
CN114610840A (en) Sensitive word-based accounting monitoring method, device, equipment and storage medium
CN116484052B (en) Educational resource sharing system based on big data
KR20080046490A (en) Method for identifying face using montage and apparatus thereof
US7340398B2 (en) Selective sampling for sound signal classification
CN108269205A (en) A kind of electronic data identification systems using cloud platform
CN108182570A (en) A kind of case wisdom auditing system
CN108280188A (en) Intelligence inspection business platform based on big data
CN113901839A (en) User video information auditing method, device, equipment and storage medium

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
RJ01 Rejection of invention patent application after publication
RJ01 Rejection of invention patent application after publication

Application publication date: 20180710