CN107342086A

CN107342086A - Method of speech processing and device

Info

Publication number: CN107342086A
Application number: CN201710458436.9A
Authority: CN
Inventors: 全刚
Original assignee: Beijing Yunzhisheng Information Technology Co Ltd
Current assignee: Beijing Yunzhisheng Information Technology Co Ltd
Priority date: 2017-06-16
Filing date: 2017-06-16
Publication date: 2017-11-10

Abstract

The present invention be on a kind of method of speech processing and device, wherein, method includes：The text message of default case is obtained from presetting database；Text identification is carried out to the text message, and carries out proper noun mark, to obtain lists of keywords；The speech recognition modeling according to corresponding to the text message and the lists of keywords generate the default case；The voice messaging on the default case is obtained, the voice messaging is identified according to the speech recognition modeling, to obtain identifying text message corresponding to the voice messaging.By the technical scheme, it can improve recognition result accuracy and discrimination when identifying the voice messagings such as court's trial recording, lift Consumer's Experience.

Description

Method of speech processing and device

Technical field

The present invention relates to technical field of voice recognition, more particularly to a kind of method of speech processing and device.

Background technology

In correlation technique, when court's trial recording to case etc. carries out speech recognition, due to a large amount of professional words wherein be present Converge, such as name, place name, identification get up to have certain difficulty, therefore, cause recognition result inaccurate.

The content of the invention

The embodiment of the present invention provides a kind of method of speech processing and device, to realize in voice letters such as identification court's trial recording During breath, the accuracy and discrimination of recognition result are improved, lifts Consumer's Experience.

First aspect according to embodiments of the present invention, there is provided a kind of method of speech processing, including：

The text message of default case is obtained from presetting database；

Text identification is carried out to the text message, and carries out proper noun mark, to obtain lists of keywords；

The speech recognition modeling according to corresponding to the text message and the lists of keywords generate the default case；

The voice messaging on the default case is obtained, the voice messaging is carried out according to the speech recognition modeling Identification, to obtain identifying text message corresponding to the voice messaging.

In one embodiment, the voice messaging on presetting case includes the court's trial recording of the default case.

In this embodiment, text identification is carried out to the text message for presetting case, marks out at least one proper noun, Form lists of keywords, and then the speech recognition mould according to corresponding to the text message of lists of keywords and case generates the case Type, so, when identifying the court's trial recording of the case, it can be identified according to speech recognition modeling, so as to improve identification knot The accuracy and discrimination of fruit, lift the usage experience of user.

For example, when done in law court a court's trial speech-to-text handle when, can in session before will include this case close Key person names, place, the text of time are uploaded in database, so, sound identification module can be generated in advance, then do front yard The speech recognition of recording is examined, then can effectively be lifted during this to key person's title, the discrimination in place.

In one embodiment, the text message of default case is obtained from presetting database, including：

The text message of the default case is obtained according to the case of default case mark.

In this embodiment, can be that each case is set for the ease of being made a distinction to case and being easy to user to search Case is identified, and then the text message for obtaining the case is conveniently identified according to case.

In one embodiment, the case mark includes any one of following：User Identity, customer equipment identification and Docket.

In this embodiment, when by case text input database, User Identity can be carried, it is final to mark Court's trial text entry.The program realizes suitable for single equipment, the equipment of clerk during such as court's trial.

It is, of course, also possible to when by case text input database, docket is taken.So can be in advance by multiple cases Text shift to an earlier date input database, form speech recognition modeling corresponding to each case.When opening a court session, docket is inputted, i.e., Corresponding speech recognition modeling can be loaded court's trial record is identified.It is, of course, also possible to it is used as case mark by the use of device identification Know.

In one embodiment, after the text message of default case is obtained, methods described also includes：

The text message is filtered, by the non-textual Content Transformation in the text message into content of text.

In this embodiment it is possible to filtered to the text message of case, so as to by non-textual Content Transformation into text Content.Such as case text is probably text, may also contain figure, voice, video.A kind of mode of text filtering is only to retain text This, abandons figure, voice, video etc.；Another way is that figure, voice, video etc. are all converted into word.

In one embodiment, methods described also includes：

Receive speech recognition modeling delete command corresponding to the default case of input；

According to the speech recognition modeling delete command, the speech recognition modeling is deleted.

In this embodiment, in order to avoid taking excessive memory space, after to court's trial case end of identification, can incite somebody to action Speech recognition modeling is deleted.

Second aspect according to embodiments of the present invention, there is provided a kind of voice processing apparatus, including：

Acquisition module, for obtaining the text message of default case from presetting database；

Labeling module, for carrying out text identification to the text message, and proper noun mark is carried out, to obtain key Word list；

Generation module, for the language according to corresponding to the text message and the lists of keywords generation default case Sound identification model；

Identification module, for obtaining the voice messaging on the default case, according to the speech recognition modeling to institute State voice messaging to be identified, to obtain identifying text message corresponding to the voice messaging.

In one embodiment, the acquisition module is used for：

In one embodiment, described device also includes：

Modular converter, will for after the text message of default case is obtained, being filtered to the text message Non-textual Content Transformation in the text message is into content of text.

In one embodiment, described device also includes：

Receiving module, for receiving speech recognition modeling delete command corresponding to the default case of input；

Removing module, for according to the speech recognition modeling delete command, deleting the speech recognition modeling.

It should be appreciated that the general description and following detailed description of the above are only exemplary and explanatory, not Can the limitation present invention.

Other features and advantages of the present invention will be illustrated in the following description, also, partly becomes from specification Obtain it is clear that or being understood by implementing the present invention.The purpose of the present invention and other advantages can be by the explanations write Specifically noted structure is realized and obtained in book, claims and accompanying drawing.

Below by drawings and examples, technical scheme is described in further detail.

Brief description of the drawings

Accompanying drawing herein is merged in specification and forms the part of this specification, shows the implementation for meeting the present invention Example, and for explaining principle of the invention together with specification.

Fig. 1 is a kind of flow chart of method of speech processing according to an exemplary embodiment.

Fig. 2 is the flow chart of another method of speech processing according to an exemplary embodiment.

Fig. 3 is the flow chart of another method of speech processing according to an exemplary embodiment.

Fig. 4 is a kind of block diagram of voice processing apparatus according to an exemplary embodiment.

Fig. 5 is the block diagram of another voice processing apparatus according to an exemplary embodiment.

Fig. 6 is the block diagram of another voice processing apparatus according to an exemplary embodiment.

Embodiment

Here exemplary embodiment will be illustrated in detail, its example is illustrated in the accompanying drawings.Following description is related to During accompanying drawing, unless otherwise indicated, the same numbers in different accompanying drawings represent same or analogous key element.Following exemplary embodiment Described in embodiment do not represent and the consistent all embodiments of the present invention.On the contrary, they be only with it is such as appended The example of the consistent apparatus and method of some aspects being described in detail in claims, of the invention.

Fig. 1 is a kind of flow chart of method of speech processing according to an exemplary embodiment.The method of speech processing Applied in server.As shown in figure 1, the method comprising the steps of S101-S104：

In step S101, the text message of default case is obtained from presetting database；User will can preset in advance The text information storage of case is into presetting database.

In step s 102, text identification is carried out to the text message, and carries out proper noun mark, to obtain key Word list；Wherein, proper noun includes name, place name etc..

In step s 103, the language according to corresponding to the text message and the lists of keywords generate the default case Sound identification model；It is trained using text message and lists of keywords, generates sound identification module.

In step S104, the voice messaging on the default case is obtained, according to the speech recognition modeling to institute State voice messaging to be identified, to obtain identifying text message corresponding to the voice messaging.

It is, of course, also possible to when by case text input database, docket is taken.So can be in advance by multiple cases Text shift to an earlier date input database, form speech recognition modeling corresponding to each case.When opening a court session, docket is inputted, i.e., Corresponding speech recognition modeling can be loaded court's trial record is identified.

As shown in Fig. 2 in one embodiment, after the text message of default case is obtained, the above method also includes Step S201：

In step s 201, the text message is filtered, the non-textual content in the text message is turned Change content of text into.

As shown in figure 3, in one embodiment, the above method also includes step S301-S302：

In step S301, speech recognition modeling delete command corresponding to the default case of input is received；

In step s 302, according to the speech recognition modeling delete command, the speech recognition modeling is deleted.

Following is apparatus of the present invention embodiment, can be used for performing the inventive method embodiment.

Fig. 4 is a kind of block diagram of voice processing apparatus according to an exemplary embodiment, and the device can be by soft Part, hardware or both are implemented in combination with as some or all of of server.As shown in figure 4, the voice processing apparatus bag Include：

Acquisition module 41, for obtaining the text message of default case from presetting database；

Labeling module 42, for carrying out text identification to the text message, and proper noun mark is carried out, to be closed Keyword list；

Generation module 43, corresponding to generating the default case according to the text message and the lists of keywords Speech recognition modeling；

Identification module 44, for obtaining the voice messaging on the default case, according to the speech recognition modeling pair The voice messaging is identified, to obtain identifying text message corresponding to the voice messaging.

In one embodiment, the acquisition module 41 is used for：

As shown in figure 5, in one embodiment, said apparatus also includes：

Modular converter 51, for after the text message of default case is obtained, being filtered to the text message, with By the non-textual Content Transformation in the text message into content of text.

As shown in fig. 6, in one embodiment, said apparatus also includes：

Receiving module 61, for receiving speech recognition modeling delete command corresponding to the default case of input；

Removing module 62, for according to the speech recognition modeling delete command, deleting the speech recognition modeling.

It should be understood by those skilled in the art that, embodiments of the invention can be provided as method, system or computer program Product.Therefore, the present invention can use the reality in terms of complete hardware embodiment, complete software embodiment or combination software and hardware Apply the form of example.Moreover, the present invention can use the computer for wherein including computer usable program code in one or more The shape for the computer program product that usable storage medium is implemented on (including but is not limited to magnetic disk storage and optical memory etc.) Formula.

The present invention is the flow with reference to method according to embodiments of the present invention, equipment (system) and computer program product Figure and/or block diagram describe.It should be understood that can be by every first-class in computer program instructions implementation process figure and/or block diagram Journey and/or the flow in square frame and flow chart and/or block diagram and/or the combination of square frame.These computer programs can be provided The processors of all-purpose computer, special-purpose computer, Embedded Processor or other programmable data processing devices is instructed to produce A raw machine so that produced by the instruction of computer or the computing device of other programmable data processing devices for real The device for the function of being specified in present one flow of flow chart or one square frame of multiple flows and/or block diagram or multiple square frames.

These computer program instructions, which may be alternatively stored in, can guide computer or other programmable data processing devices with spy Determine in the computer-readable memory that mode works so that the instruction being stored in the computer-readable memory, which produces, to be included referring to Make the manufacture of device, the command device realize in one flow of flow chart or multiple flows and/or one square frame of block diagram or The function of being specified in multiple square frames.

These computer program instructions can be also loaded into computer or other programmable data processing devices so that counted Series of operation steps is performed on calculation machine or other programmable devices to produce computer implemented processing, so as in computer or The instruction performed on other programmable devices is provided for realizing in one flow of flow chart or multiple flows and/or block diagram one The step of function of being specified in individual square frame or multiple square frames.

Obviously, those skilled in the art can carry out the essence of various changes and modification without departing from the present invention to the present invention God and scope.So, if these modifications and variations of the present invention belong to the scope of the claims in the present invention and its equivalent technologies Within, then the present invention is also intended to comprising including these changes and modification.

Claims

A kind of 1. method of speech processing, for server, it is characterised in that including：

The text message of default case is obtained from presetting database；

Text identification is carried out to the text message, and carries out proper noun mark, to obtain lists of keywords；

The speech recognition modeling according to corresponding to the text message and the lists of keywords generate the default case；

The voice messaging on the default case is obtained, the voice messaging is known according to the speech recognition modeling Not, to obtain identifying text message corresponding to the voice messaging.
2. according to the method for claim 1, it is characterised in that the text envelope of default case is obtained from presetting database Breath, including：

The text message of the default case is obtained according to the case of default case mark.
3. according to the method for claim 2, it is characterised in that the case mark includes any one of following：User identity Mark, customer equipment identification and docket.
4. according to the method for claim 1, it is characterised in that after the text message of default case is obtained, the side Method also includes：

The text message is filtered, by the non-textual Content Transformation in the text message into content of text.
5. according to the method for claim 1, it is characterised in that methods described also includes：

Receive speech recognition modeling delete command corresponding to the default case of input；

According to the speech recognition modeling delete command, the speech recognition modeling is deleted.
6. method according to any one of claim 1 to 5, it is characterised in that the voice letter on presetting case Breath includes the court's trial recording of the default case.
A kind of 7. voice processing apparatus, for server, it is characterised in that including：

Acquisition module, for obtaining the text message of default case from presetting database；

Labeling module, for carrying out text identification to the text message, and proper noun mark is carried out, to obtain keyword row Table；

Generation module, know for the voice according to corresponding to the text message and the lists of keywords generation default case Other model；

Identification module, for obtaining the voice messaging on the default case, according to the speech recognition modeling to institute's predicate Message breath is identified, to obtain identifying text message corresponding to the voice messaging.
8. device according to claim 7, it is characterised in that the acquisition module is used for：

The text message of the default case is obtained according to the case of default case mark.
9. device according to claim 8, it is characterised in that the case mark includes any one of following：User identity Mark, customer equipment identification and docket.
10. device according to claim 7, it is characterised in that described device also includes：

Modular converter, for after the text message of default case is obtained, being filtered to the text message, by described in Non-textual Content Transformation in text message is into content of text.
11. device according to claim 7, it is characterised in that described device also includes：

Receiving module, for receiving speech recognition modeling delete command corresponding to the default case of input；

Removing module, for according to the speech recognition modeling delete command, deleting the speech recognition modeling.
12. the device according to any one of claim 7 to 12, it is characterised in that the voice on presetting case The court's trial that information includes the default case is recorded.