CN107342086A - Method of speech processing and device - Google Patents
Method of speech processing and device Download PDFInfo
- Publication number
- CN107342086A CN107342086A CN201710458436.9A CN201710458436A CN107342086A CN 107342086 A CN107342086 A CN 107342086A CN 201710458436 A CN201710458436 A CN 201710458436A CN 107342086 A CN107342086 A CN 107342086A
- Authority
- CN
- China
- Prior art keywords
- case
- text message
- text
- speech recognition
- default case
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Pending
Links
- 238000000034 method Methods 0.000 title claims abstract description 33
- 230000009466 transformation Effects 0.000 claims description 8
- 238000002372 labelling Methods 0.000 claims description 3
- 238000010586 diagram Methods 0.000 description 12
- 238000004590 computer program Methods 0.000 description 7
- 230000009471 action Effects 0.000 description 3
- 235000013399 edible fruits Nutrition 0.000 description 3
- 238000001914 filtration Methods 0.000 description 3
- 230000006870 function Effects 0.000 description 3
- 230000004048 modification Effects 0.000 description 3
- 238000012986 modification Methods 0.000 description 3
- 238000005516 engineering process Methods 0.000 description 2
- 238000004364 calculation method Methods 0.000 description 1
- 230000008859 change Effects 0.000 description 1
- 238000004519 manufacturing process Methods 0.000 description 1
- 230000003287 optical effect Effects 0.000 description 1
- 230000008569 process Effects 0.000 description 1
Classifications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/26—Speech to text systems
Landscapes
- Engineering & Computer Science (AREA)
- Computational Linguistics (AREA)
- Health & Medical Sciences (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Human Computer Interaction (AREA)
- Physics & Mathematics (AREA)
- Acoustics & Sound (AREA)
- Multimedia (AREA)
- Telephonic Communication Services (AREA)
Abstract
The present invention be on a kind of method of speech processing and device, wherein, method includes:The text message of default case is obtained from presetting database;Text identification is carried out to the text message, and carries out proper noun mark, to obtain lists of keywords;The speech recognition modeling according to corresponding to the text message and the lists of keywords generate the default case;The voice messaging on the default case is obtained, the voice messaging is identified according to the speech recognition modeling, to obtain identifying text message corresponding to the voice messaging.By the technical scheme, it can improve recognition result accuracy and discrimination when identifying the voice messagings such as court's trial recording, lift Consumer's Experience.
Description
Technical field
The present invention relates to technical field of voice recognition, more particularly to a kind of method of speech processing and device.
Background technology
In correlation technique, when court's trial recording to case etc. carries out speech recognition, due to a large amount of professional words wherein be present
Converge, such as name, place name, identification get up to have certain difficulty, therefore, cause recognition result inaccurate.
The content of the invention
The embodiment of the present invention provides a kind of method of speech processing and device, to realize in voice letters such as identification court's trial recording
During breath, the accuracy and discrimination of recognition result are improved, lifts Consumer's Experience.
First aspect according to embodiments of the present invention, there is provided a kind of method of speech processing, including:
The text message of default case is obtained from presetting database;
Text identification is carried out to the text message, and carries out proper noun mark, to obtain lists of keywords;
The speech recognition modeling according to corresponding to the text message and the lists of keywords generate the default case;
The voice messaging on the default case is obtained, the voice messaging is carried out according to the speech recognition modeling
Identification, to obtain identifying text message corresponding to the voice messaging.
In one embodiment, the voice messaging on presetting case includes the court's trial recording of the default case.
In this embodiment, text identification is carried out to the text message for presetting case, marks out at least one proper noun,
Form lists of keywords, and then the speech recognition mould according to corresponding to the text message of lists of keywords and case generates the case
Type, so, when identifying the court's trial recording of the case, it can be identified according to speech recognition modeling, so as to improve identification knot
The accuracy and discrimination of fruit, lift the usage experience of user.
For example, when done in law court a court's trial speech-to-text handle when, can in session before will include this case close
Key person names, place, the text of time are uploaded in database, so, sound identification module can be generated in advance, then do front yard
The speech recognition of recording is examined, then can effectively be lifted during this to key person's title, the discrimination in place.
In one embodiment, the text message of default case is obtained from presetting database, including:
The text message of the default case is obtained according to the case of default case mark.
In this embodiment, can be that each case is set for the ease of being made a distinction to case and being easy to user to search
Case is identified, and then the text message for obtaining the case is conveniently identified according to case.
In one embodiment, the case mark includes any one of following:User Identity, customer equipment identification and
Docket.
In this embodiment, when by case text input database, User Identity can be carried, it is final to mark
Court's trial text entry.The program realizes suitable for single equipment, the equipment of clerk during such as court's trial.
It is, of course, also possible to when by case text input database, docket is taken.So can be in advance by multiple cases
Text shift to an earlier date input database, form speech recognition modeling corresponding to each case.When opening a court session, docket is inputted, i.e.,
Corresponding speech recognition modeling can be loaded court's trial record is identified.It is, of course, also possible to it is used as case mark by the use of device identification
Know.
In one embodiment, after the text message of default case is obtained, methods described also includes:
The text message is filtered, by the non-textual Content Transformation in the text message into content of text.
In this embodiment it is possible to filtered to the text message of case, so as to by non-textual Content Transformation into text
Content.Such as case text is probably text, may also contain figure, voice, video.A kind of mode of text filtering is only to retain text
This, abandons figure, voice, video etc.;Another way is that figure, voice, video etc. are all converted into word.
In one embodiment, methods described also includes:
Receive speech recognition modeling delete command corresponding to the default case of input;
According to the speech recognition modeling delete command, the speech recognition modeling is deleted.
In this embodiment, in order to avoid taking excessive memory space, after to court's trial case end of identification, can incite somebody to action
Speech recognition modeling is deleted.
Second aspect according to embodiments of the present invention, there is provided a kind of voice processing apparatus, including:
Acquisition module, for obtaining the text message of default case from presetting database;
Labeling module, for carrying out text identification to the text message, and proper noun mark is carried out, to obtain key
Word list;
Generation module, for the language according to corresponding to the text message and the lists of keywords generation default case
Sound identification model;
Identification module, for obtaining the voice messaging on the default case, according to the speech recognition modeling to institute
State voice messaging to be identified, to obtain identifying text message corresponding to the voice messaging.
In one embodiment, the acquisition module is used for:
The text message of the default case is obtained according to the case of default case mark.
In one embodiment, the case mark includes any one of following:User Identity, customer equipment identification and
Docket.
In one embodiment, described device also includes:
Modular converter, will for after the text message of default case is obtained, being filtered to the text message
Non-textual Content Transformation in the text message is into content of text.
In one embodiment, described device also includes:
Receiving module, for receiving speech recognition modeling delete command corresponding to the default case of input;
Removing module, for according to the speech recognition modeling delete command, deleting the speech recognition modeling.
In one embodiment, the voice messaging on presetting case includes the court's trial recording of the default case.
It should be appreciated that the general description and following detailed description of the above are only exemplary and explanatory, not
Can the limitation present invention.
Other features and advantages of the present invention will be illustrated in the following description, also, partly becomes from specification
Obtain it is clear that or being understood by implementing the present invention.The purpose of the present invention and other advantages can be by the explanations write
Specifically noted structure is realized and obtained in book, claims and accompanying drawing.
Below by drawings and examples, technical scheme is described in further detail.
Brief description of the drawings
Accompanying drawing herein is merged in specification and forms the part of this specification, shows the implementation for meeting the present invention
Example, and for explaining principle of the invention together with specification.
Fig. 1 is a kind of flow chart of method of speech processing according to an exemplary embodiment.
Fig. 2 is the flow chart of another method of speech processing according to an exemplary embodiment.
Fig. 3 is the flow chart of another method of speech processing according to an exemplary embodiment.
Fig. 4 is a kind of block diagram of voice processing apparatus according to an exemplary embodiment.
Fig. 5 is the block diagram of another voice processing apparatus according to an exemplary embodiment.
Fig. 6 is the block diagram of another voice processing apparatus according to an exemplary embodiment.
Embodiment
Here exemplary embodiment will be illustrated in detail, its example is illustrated in the accompanying drawings.Following description is related to
During accompanying drawing, unless otherwise indicated, the same numbers in different accompanying drawings represent same or analogous key element.Following exemplary embodiment
Described in embodiment do not represent and the consistent all embodiments of the present invention.On the contrary, they be only with it is such as appended
The example of the consistent apparatus and method of some aspects being described in detail in claims, of the invention.
Fig. 1 is a kind of flow chart of method of speech processing according to an exemplary embodiment.The method of speech processing
Applied in server.As shown in figure 1, the method comprising the steps of S101-S104:
In step S101, the text message of default case is obtained from presetting database;User will can preset in advance
The text information storage of case is into presetting database.
In step s 102, text identification is carried out to the text message, and carries out proper noun mark, to obtain key
Word list;Wherein, proper noun includes name, place name etc..
In step s 103, the language according to corresponding to the text message and the lists of keywords generate the default case
Sound identification model;It is trained using text message and lists of keywords, generates sound identification module.
In step S104, the voice messaging on the default case is obtained, according to the speech recognition modeling to institute
State voice messaging to be identified, to obtain identifying text message corresponding to the voice messaging.
In one embodiment, the voice messaging on presetting case includes the court's trial recording of the default case.
In this embodiment, text identification is carried out to the text message for presetting case, marks out at least one proper noun,
Form lists of keywords, and then the speech recognition mould according to corresponding to the text message of lists of keywords and case generates the case
Type, so, when identifying the court's trial recording of the case, it can be identified according to speech recognition modeling, so as to improve identification knot
The accuracy and discrimination of fruit, lift the usage experience of user.
For example, when done in law court a court's trial speech-to-text handle when, can in session before will include this case close
Key person names, place, the text of time are uploaded in database, so, sound identification module can be generated in advance, then do front yard
The speech recognition of recording is examined, then can effectively be lifted during this to key person's title, the discrimination in place.
In one embodiment, the text message of default case is obtained from presetting database, including:
The text message of the default case is obtained according to the case of default case mark.
In this embodiment, can be that each case is set for the ease of being made a distinction to case and being easy to user to search
Case is identified, and then the text message for obtaining the case is conveniently identified according to case.
In one embodiment, the case mark includes any one of following:User Identity, customer equipment identification and
Docket.
In this embodiment, when by case text input database, User Identity can be carried, it is final to mark
Court's trial text entry.The program realizes suitable for single equipment, the equipment of clerk during such as court's trial.
It is, of course, also possible to when by case text input database, docket is taken.So can be in advance by multiple cases
Text shift to an earlier date input database, form speech recognition modeling corresponding to each case.When opening a court session, docket is inputted, i.e.,
Corresponding speech recognition modeling can be loaded court's trial record is identified.
Fig. 2 is the flow chart of another method of speech processing according to an exemplary embodiment.
As shown in Fig. 2 in one embodiment, after the text message of default case is obtained, the above method also includes
Step S201:
In step s 201, the text message is filtered, the non-textual content in the text message is turned
Change content of text into.
In this embodiment it is possible to filtered to the text message of case, so as to by non-textual Content Transformation into text
Content.Such as case text is probably text, may also contain figure, voice, video.A kind of mode of text filtering is only to retain text
This, abandons figure, voice, video etc.;Another way is that figure, voice, video etc. are all converted into word.
Fig. 3 is the flow chart of another method of speech processing according to an exemplary embodiment.
As shown in figure 3, in one embodiment, the above method also includes step S301-S302:
In step S301, speech recognition modeling delete command corresponding to the default case of input is received;
In step s 302, according to the speech recognition modeling delete command, the speech recognition modeling is deleted.
In this embodiment, in order to avoid taking excessive memory space, after to court's trial case end of identification, can incite somebody to action
Speech recognition modeling is deleted.
Following is apparatus of the present invention embodiment, can be used for performing the inventive method embodiment.
Fig. 4 is a kind of block diagram of voice processing apparatus according to an exemplary embodiment, and the device can be by soft
Part, hardware or both are implemented in combination with as some or all of of server.As shown in figure 4, the voice processing apparatus bag
Include:
Acquisition module 41, for obtaining the text message of default case from presetting database;
Labeling module 42, for carrying out text identification to the text message, and proper noun mark is carried out, to be closed
Keyword list;
Generation module 43, corresponding to generating the default case according to the text message and the lists of keywords
Speech recognition modeling;
Identification module 44, for obtaining the voice messaging on the default case, according to the speech recognition modeling pair
The voice messaging is identified, to obtain identifying text message corresponding to the voice messaging.
In one embodiment, the voice messaging on presetting case includes the court's trial recording of the default case.
In this embodiment, text identification is carried out to the text message for presetting case, marks out at least one proper noun,
Form lists of keywords, and then the speech recognition mould according to corresponding to the text message of lists of keywords and case generates the case
Type, so, when identifying the court's trial recording of the case, it can be identified according to speech recognition modeling, so as to improve identification knot
The accuracy and discrimination of fruit, lift the usage experience of user.
For example, when done in law court a court's trial speech-to-text handle when, can in session before will include this case close
Key person names, place, the text of time are uploaded in database, so, sound identification module can be generated in advance, then do front yard
The speech recognition of recording is examined, then can effectively be lifted during this to key person's title, the discrimination in place.
In one embodiment, the acquisition module 41 is used for:
The text message of the default case is obtained according to the case of default case mark.
In this embodiment, can be that each case is set for the ease of being made a distinction to case and being easy to user to search
Case is identified, and then the text message for obtaining the case is conveniently identified according to case.
In one embodiment, the case mark includes any one of following:User Identity, customer equipment identification and
Docket.
In this embodiment, when by case text input database, User Identity can be carried, it is final to mark
Court's trial text entry.The program realizes suitable for single equipment, the equipment of clerk during such as court's trial.
It is, of course, also possible to when by case text input database, docket is taken.So can be in advance by multiple cases
Text shift to an earlier date input database, form speech recognition modeling corresponding to each case.When opening a court session, docket is inputted, i.e.,
Corresponding speech recognition modeling can be loaded court's trial record is identified.
Fig. 5 is the block diagram of another voice processing apparatus according to an exemplary embodiment.
As shown in figure 5, in one embodiment, said apparatus also includes:
Modular converter 51, for after the text message of default case is obtained, being filtered to the text message, with
By the non-textual Content Transformation in the text message into content of text.
In this embodiment it is possible to filtered to the text message of case, so as to by non-textual Content Transformation into text
Content.Such as case text is probably text, may also contain figure, voice, video.A kind of mode of text filtering is only to retain text
This, abandons figure, voice, video etc.;Another way is that figure, voice, video etc. are all converted into word.
Fig. 6 is the block diagram of another voice processing apparatus according to an exemplary embodiment.
As shown in fig. 6, in one embodiment, said apparatus also includes:
Receiving module 61, for receiving speech recognition modeling delete command corresponding to the default case of input;
Removing module 62, for according to the speech recognition modeling delete command, deleting the speech recognition modeling.
In this embodiment, in order to avoid taking excessive memory space, after to court's trial case end of identification, can incite somebody to action
Speech recognition modeling is deleted.
It should be understood by those skilled in the art that, embodiments of the invention can be provided as method, system or computer program
Product.Therefore, the present invention can use the reality in terms of complete hardware embodiment, complete software embodiment or combination software and hardware
Apply the form of example.Moreover, the present invention can use the computer for wherein including computer usable program code in one or more
The shape for the computer program product that usable storage medium is implemented on (including but is not limited to magnetic disk storage and optical memory etc.)
Formula.
The present invention is the flow with reference to method according to embodiments of the present invention, equipment (system) and computer program product
Figure and/or block diagram describe.It should be understood that can be by every first-class in computer program instructions implementation process figure and/or block diagram
Journey and/or the flow in square frame and flow chart and/or block diagram and/or the combination of square frame.These computer programs can be provided
The processors of all-purpose computer, special-purpose computer, Embedded Processor or other programmable data processing devices is instructed to produce
A raw machine so that produced by the instruction of computer or the computing device of other programmable data processing devices for real
The device for the function of being specified in present one flow of flow chart or one square frame of multiple flows and/or block diagram or multiple square frames.
These computer program instructions, which may be alternatively stored in, can guide computer or other programmable data processing devices with spy
Determine in the computer-readable memory that mode works so that the instruction being stored in the computer-readable memory, which produces, to be included referring to
Make the manufacture of device, the command device realize in one flow of flow chart or multiple flows and/or one square frame of block diagram or
The function of being specified in multiple square frames.
These computer program instructions can be also loaded into computer or other programmable data processing devices so that counted
Series of operation steps is performed on calculation machine or other programmable devices to produce computer implemented processing, so as in computer or
The instruction performed on other programmable devices is provided for realizing in one flow of flow chart or multiple flows and/or block diagram one
The step of function of being specified in individual square frame or multiple square frames.
Obviously, those skilled in the art can carry out the essence of various changes and modification without departing from the present invention to the present invention
God and scope.So, if these modifications and variations of the present invention belong to the scope of the claims in the present invention and its equivalent technologies
Within, then the present invention is also intended to comprising including these changes and modification.
Claims (12)
- A kind of 1. method of speech processing, for server, it is characterised in that including:The text message of default case is obtained from presetting database;Text identification is carried out to the text message, and carries out proper noun mark, to obtain lists of keywords;The speech recognition modeling according to corresponding to the text message and the lists of keywords generate the default case;The voice messaging on the default case is obtained, the voice messaging is known according to the speech recognition modeling Not, to obtain identifying text message corresponding to the voice messaging.
- 2. according to the method for claim 1, it is characterised in that the text envelope of default case is obtained from presetting database Breath, including:The text message of the default case is obtained according to the case of default case mark.
- 3. according to the method for claim 2, it is characterised in that the case mark includes any one of following:User identity Mark, customer equipment identification and docket.
- 4. according to the method for claim 1, it is characterised in that after the text message of default case is obtained, the side Method also includes:The text message is filtered, by the non-textual Content Transformation in the text message into content of text.
- 5. according to the method for claim 1, it is characterised in that methods described also includes:Receive speech recognition modeling delete command corresponding to the default case of input;According to the speech recognition modeling delete command, the speech recognition modeling is deleted.
- 6. method according to any one of claim 1 to 5, it is characterised in that the voice letter on presetting case Breath includes the court's trial recording of the default case.
- A kind of 7. voice processing apparatus, for server, it is characterised in that including:Acquisition module, for obtaining the text message of default case from presetting database;Labeling module, for carrying out text identification to the text message, and proper noun mark is carried out, to obtain keyword row Table;Generation module, know for the voice according to corresponding to the text message and the lists of keywords generation default case Other model;Identification module, for obtaining the voice messaging on the default case, according to the speech recognition modeling to institute's predicate Message breath is identified, to obtain identifying text message corresponding to the voice messaging.
- 8. device according to claim 7, it is characterised in that the acquisition module is used for:The text message of the default case is obtained according to the case of default case mark.
- 9. device according to claim 8, it is characterised in that the case mark includes any one of following:User identity Mark, customer equipment identification and docket.
- 10. device according to claim 7, it is characterised in that described device also includes:Modular converter, for after the text message of default case is obtained, being filtered to the text message, by described in Non-textual Content Transformation in text message is into content of text.
- 11. device according to claim 7, it is characterised in that described device also includes:Receiving module, for receiving speech recognition modeling delete command corresponding to the default case of input;Removing module, for according to the speech recognition modeling delete command, deleting the speech recognition modeling.
- 12. the device according to any one of claim 7 to 12, it is characterised in that the voice on presetting case The court's trial that information includes the default case is recorded.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201710458436.9A CN107342086A (en) | 2017-06-16 | 2017-06-16 | Method of speech processing and device |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201710458436.9A CN107342086A (en) | 2017-06-16 | 2017-06-16 | Method of speech processing and device |
Publications (1)
Publication Number | Publication Date |
---|---|
CN107342086A true CN107342086A (en) | 2017-11-10 |
Family
ID=60219987
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN201710458436.9A Pending CN107342086A (en) | 2017-06-16 | 2017-06-16 | Method of speech processing and device |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN107342086A (en) |
Cited By (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN113808582A (en) * | 2020-06-17 | 2021-12-17 | 北京字节跳动网络技术有限公司 | Voice recognition method, device, equipment and storage medium |
Citations (12)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN101388078A (en) * | 2008-09-27 | 2009-03-18 | 腾讯科技(深圳)有限公司 | Text identification method and device based on verification |
CN101465960A (en) * | 2007-12-19 | 2009-06-24 | 深圳富泰宏精密工业有限公司 | Photographic device with voice control function and use method thereof |
CN101763508A (en) * | 2008-12-24 | 2010-06-30 | 新奥特硅谷视频技术有限责任公司 | Voice information acquiring, converting and identifying method and device |
CN102915731A (en) * | 2012-10-10 | 2013-02-06 | 百度在线网络技术(北京)有限公司 | Method and device for recognizing personalized speeches |
US20130151252A1 (en) * | 2009-11-13 | 2013-06-13 | At&T Intellectual Property I, L.P. | System and method for standardized speech recognition |
CN103165129A (en) * | 2011-12-13 | 2013-06-19 | 北京百度网讯科技有限公司 | Method and system for optimizing voice recognition acoustic model |
CN103365988A (en) * | 2013-07-05 | 2013-10-23 | 百度在线网络技术(北京)有限公司 | Method and device for loud reading pictures and characters of mobile terminal and mobile terminal |
CN104464423A (en) * | 2014-12-19 | 2015-03-25 | 科大讯飞股份有限公司 | Calibration optimization method and system for speaking test evaluation |
CN105609104A (en) * | 2016-01-22 | 2016-05-25 | 北京云知声信息技术有限公司 | Information processing method and apparatus, and intelligent voice router controller |
CN105913838A (en) * | 2016-05-19 | 2016-08-31 | 努比亚技术有限公司 | Device and method of audio management |
CN106448675A (en) * | 2016-10-21 | 2017-02-22 | 科大讯飞股份有限公司 | Recognition text correction method and system |
CN106601236A (en) * | 2016-12-22 | 2017-04-26 | 北京云知声信息技术有限公司 | Speech recognition method and apparatus |
-
2017
- 2017-06-16 CN CN201710458436.9A patent/CN107342086A/en active Pending
Patent Citations (12)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN101465960A (en) * | 2007-12-19 | 2009-06-24 | 深圳富泰宏精密工业有限公司 | Photographic device with voice control function and use method thereof |
CN101388078A (en) * | 2008-09-27 | 2009-03-18 | 腾讯科技(深圳)有限公司 | Text identification method and device based on verification |
CN101763508A (en) * | 2008-12-24 | 2010-06-30 | 新奥特硅谷视频技术有限责任公司 | Voice information acquiring, converting and identifying method and device |
US20130151252A1 (en) * | 2009-11-13 | 2013-06-13 | At&T Intellectual Property I, L.P. | System and method for standardized speech recognition |
CN103165129A (en) * | 2011-12-13 | 2013-06-19 | 北京百度网讯科技有限公司 | Method and system for optimizing voice recognition acoustic model |
CN102915731A (en) * | 2012-10-10 | 2013-02-06 | 百度在线网络技术(北京)有限公司 | Method and device for recognizing personalized speeches |
CN103365988A (en) * | 2013-07-05 | 2013-10-23 | 百度在线网络技术(北京)有限公司 | Method and device for loud reading pictures and characters of mobile terminal and mobile terminal |
CN104464423A (en) * | 2014-12-19 | 2015-03-25 | 科大讯飞股份有限公司 | Calibration optimization method and system for speaking test evaluation |
CN105609104A (en) * | 2016-01-22 | 2016-05-25 | 北京云知声信息技术有限公司 | Information processing method and apparatus, and intelligent voice router controller |
CN105913838A (en) * | 2016-05-19 | 2016-08-31 | 努比亚技术有限公司 | Device and method of audio management |
CN106448675A (en) * | 2016-10-21 | 2017-02-22 | 科大讯飞股份有限公司 | Recognition text correction method and system |
CN106601236A (en) * | 2016-12-22 | 2017-04-26 | 北京云知声信息技术有限公司 | Speech recognition method and apparatus |
Cited By (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN113808582A (en) * | 2020-06-17 | 2021-12-17 | 北京字节跳动网络技术有限公司 | Voice recognition method, device, equipment and storage medium |
CN113808582B (en) * | 2020-06-17 | 2024-04-09 | 抖音视界有限公司 | Speech recognition method, device, equipment and storage medium |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN107452405B (en) | Method and device for evaluating data according to voice content | |
CN106570496B (en) | Emotion identification method and apparatus and intelligent interactive method and equipment | |
CN107968959B (en) | Knowledge point segmentation method for teaching video | |
CN111666746B (en) | Conference summary generation method and device, electronic equipment and storage medium | |
CN109361825A (en) | Meeting summary recording method, terminal and computer storage medium | |
CN104951433A (en) | Method and system for intention recognition based on context | |
CN110148400A (en) | The pronunciation recognition methods of type, the training method of model, device and equipment | |
TW201113870A (en) | Method for analyzing sentence emotion, sentence emotion analyzing system, computer readable and writable recording medium and multimedia device | |
CN108470188B (en) | Interaction method based on image analysis and electronic equipment | |
CN111144097B (en) | Modeling method and device for emotion tendency classification model of dialogue text | |
CN107291775A (en) | The reparation language material generation method and device of error sample | |
CN110309295B (en) | Method and device for generating examined and found sections of referee document | |
CN110059178A (en) | Problem distributing method and device | |
CN110992988A (en) | Speech emotion recognition method and device based on domain confrontation | |
Rao et al. | Sentiment analysis on user-generated video, audio and text | |
CN113342942B (en) | Corpus automatic acquisition method and device, computer equipment and storage medium | |
CN108810625A (en) | A kind of control method for playing back of multi-medium data, device and terminal | |
TWI771632B (en) | Learning support device, learning support method, and recording medium | |
CN107342086A (en) | Method of speech processing and device | |
CN104504104A (en) | Picture material processing method and device for search engine, and search engine | |
JP2006236037A (en) | Voice interaction content creation method, device, program and recording medium | |
US20230402030A1 (en) | Embedded Dictation Detection | |
CN112560811B (en) | End-to-end automatic detection research method for audio-video depression | |
CN113868271A (en) | Method and device for updating knowledge base of intelligent customer service, electronic equipment and storage medium | |
Tang | Manual transcription |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
RJ01 | Rejection of invention patent application after publication | ||
RJ01 | Rejection of invention patent application after publication |
Application publication date: 20171110 |