CN108364653A - Voice data processing method and processing unit - Google Patents
Voice data processing method and processing unit Download PDFInfo
- Publication number
- CN108364653A CN108364653A CN201810145265.9A CN201810145265A CN108364653A CN 108364653 A CN108364653 A CN 108364653A CN 201810145265 A CN201810145265 A CN 201810145265A CN 108364653 A CN108364653 A CN 108364653A
- Authority
- CN
- China
- Prior art keywords
- text information
- information
- word
- data processing
- voice data
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Granted
Links
- 238000012545 processing Methods 0.000 title claims abstract description 45
- 238000003672 processing method Methods 0.000 title claims abstract description 26
- 230000004044 response Effects 0.000 claims abstract description 28
- 238000000034 method Methods 0.000 claims abstract description 22
- 230000001360 synchronised effect Effects 0.000 claims abstract description 8
- 238000004891 communication Methods 0.000 claims description 3
- 230000011218 segmentation Effects 0.000 claims 1
- 238000010586 diagram Methods 0.000 description 10
- 230000008569 process Effects 0.000 description 6
- 238000005516 engineering process Methods 0.000 description 5
- 238000004590 computer program Methods 0.000 description 4
- 238000012937 correction Methods 0.000 description 4
- 230000006870 function Effects 0.000 description 3
- 230000004048 modification Effects 0.000 description 3
- 238000012986 modification Methods 0.000 description 3
- 210000000056 organ Anatomy 0.000 description 3
- 238000004422 calculation algorithm Methods 0.000 description 2
- 238000005336 cracking Methods 0.000 description 2
- 238000003860 storage Methods 0.000 description 2
- 241001124569 Lycaenidae Species 0.000 description 1
- 238000004458 analytical method Methods 0.000 description 1
- 238000013528 artificial neural network Methods 0.000 description 1
- 230000006399 behavior Effects 0.000 description 1
- 230000008901 benefit Effects 0.000 description 1
- 238000004364 calculation method Methods 0.000 description 1
- 238000006243 chemical reaction Methods 0.000 description 1
- 238000010276 construction Methods 0.000 description 1
- 238000012217 deletion Methods 0.000 description 1
- 230000037430 deletion Effects 0.000 description 1
- 230000000694 effects Effects 0.000 description 1
- 238000004519 manufacturing process Methods 0.000 description 1
- 230000003287 optical effect Effects 0.000 description 1
- 238000005192 partition Methods 0.000 description 1
- 238000007781 pre-processing Methods 0.000 description 1
- 230000000750 progressive effect Effects 0.000 description 1
- 229910052709 silver Inorganic materials 0.000 description 1
- 239000004332 silver Substances 0.000 description 1
- 230000009466 transformation Effects 0.000 description 1
Classifications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/26—Speech to text systems
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/22—Procedures used during a speech recognition process, e.g. man-machine dialogue
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L17/00—Speaker identification or verification techniques
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L17/00—Speaker identification or verification techniques
- G10L17/18—Artificial neural networks; Connectionist approaches
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N23/00—Cameras or camera modules comprising electronic image sensors; Control thereof
- H04N23/50—Constructional details
- H04N23/54—Mounting of pick-up tubes, electronic image sensors, deviation or focusing coils
Landscapes
- Engineering & Computer Science (AREA)
- Multimedia (AREA)
- Health & Medical Sciences (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Human Computer Interaction (AREA)
- Physics & Mathematics (AREA)
- Acoustics & Sound (AREA)
- Computational Linguistics (AREA)
- Signal Processing (AREA)
- Artificial Intelligence (AREA)
- Evolutionary Computation (AREA)
- User Interface Of Digital Computer (AREA)
Abstract
The present invention provides a kind of voice data processing method and processing unit, the method includes:Obtain voice messaging;Acquired voice messaging is synchronously converted into the first text information, and by first word-information display in the first area of display device;In response to the first operation, word processing carried out to the first text information to be edited, and by treated word-information display the display device second area;In response to the second operation, first text information that do not changed is synchronized to the second area, and generate the second text information;It is operated in response to third, third text information is generated based on second text information, and by the third word-information display in the third region of the display device.Voice data processing method according to the ... of the embodiment of the present invention can by voice messaging be converted into text information precisely in real time.
Description
Technical field
The present invention relates to audio, video data process fields, specifically, be related to a kind of procuratorial work hearing record processing method and
Processing unit.
Background technology
As an important link indispensable in cracking of cases, the prior art is being done for the hearing work of procuratorate
When case is inquested, personnel in charge of the case often can not entire session log in comprehensive and accurate record Interrogation Procedure, and can only handle a case
It is recorded one by one in real time by typing personnel in the process, heavy workload efficiency is low and is also possible to have deviation so that the notes of case
Information is sufficiently complete accurate, meanwhile, the understanding that data is integrally put down to same case is not easy to slap, and greatly reduces hearing effect
Rate affects the progress of cracking of cases.
Play the role of the trend that most important record of trial work is faced with Promotion Transformation in current hearing work, with
In-depth input that science and technology is examined by force and modernization science and technology are constantly progressive, convenience that procuratorial organ work to hearing record, accurately
Property and advanced also have higher requirement.
Invention content
In view of this, it is an object of the invention to propose a kind of voice data processing method and processing unit, so as to reality
When, the text information that is accurately converted to voice messaging.
To solve the above-mentioned problems, according to an aspect of the present invention, a kind of voice data processing method, the side are provided
Method includes:Obtain voice messaging;Acquired voice messaging is synchronously converted into the first text information, and literary by described first
Word presentation of information is in the first area of display device;In response to the first operation, word is carried out to the first text information to be edited
Processing, and by treated word-information display the display device second area;In response to the second operation, will not repaiied
First text information changed is synchronized to the second area, and generates the second text information;It operates, is based in response to third
Second text information generates third text information, and by the third word-information display the display device third
Region.
To solve the above-mentioned problems, according to another aspect of the present invention, a kind of voice data processing method is provided, it is described
Method includes:Obtain voice messaging;Acquired voice messaging is synchronously converted into the first text information, and by described first
Word-information display is in the first area of display device;In response to the 4th operation, first text information is carried out at word
Reason to generate the second text information, and by second word-information display the display device second area;In response to
Third operates, and generates third text information based on second text information, and by the third word-information display described
The third region of display device.
To solve the above-mentioned problems, according to a further aspect of the invention, a kind of voice data processing apparatus is provided, it is described
Device includes:At least one processor;And the memory being connect at least one processor communication;Wherein, described to deposit
Reservoir is stored with the order that can be executed by least one processor, and the order is executed by least one processor,
So that at least one processor is able to carry out acquisition voice messaging;Acquired voice messaging is synchronously converted to first
Text information, and by first word-information display in the first area of display device;In response to the first operation, to be edited
The first text information carry out word processing, and by treated word-information display the display device second area;
In response to the second operation, first text information that do not changed is synchronized to the second area, and generate the second word
Information;It is operated in response to third, third text information is generated based on second text information, and by the third text information
It is shown in the third region of the display device.
Voice data processing method according to the ... of the embodiment of the present invention and processing system can be shown in same display equipment
Show that voice with record, check and correction notes and final notes, can also realize accomplice retrieval in Interrogation Procedure with screen, decorrelation letter
Breath, to greatly facilitate the making that scene is put down when procuratorial organs handle a case.
It should be understood that the voice data processing method and device of the present invention can not only be applied in case Interrogation Procedure,
He is any required to convert voice messaging to text information in real time, and the occasion having higher requirements to the precision of text information is equal
It can apply.
Description of the drawings
By referring to the drawings to the description of the embodiment of the present invention, the above and other purposes of the present invention, feature and
Advantage will be apparent from, in the accompanying drawings:
Fig. 1 is the flow chart according to a kind of voice data processing method of the specific embodiment of the invention;
Fig. 2 is that procuratorate according to the ... of the embodiment of the present invention handles a case the structure diagram of system;
Fig. 3 A are the schematic diagrames of the first text information shown in the first region;
Fig. 3 B are the schematic diagrames of the text information after second area display processing;
Fig. 4 schematically shows the flow charts of the query-related information in first, second and/or third text information;
And
Fig. 5 is the structure diagram of voice data processing apparatus according to the ... of the embodiment of the present invention.
Specific implementation mode
Illustrate the embodiment of the present invention with reference to the accompanying drawings.Described in the attached drawing of the present invention or a kind of embodiment
Elements and features can be combined with elements and features shown in one or more other accompanying drawings or embodiment.It should note
Portion unrelated to the invention, known to persons of ordinary skill in the art is omitted in attached drawing and explanation for purposes of clarity in meaning
The expression and description of part and processing.
It will be understood by those skilled in the art that the terms such as " first ", " second " in the present invention are only used for distinguishing different lists
Member, module or step etc., neither represent any particular technology meaning, also do not indicate that the inevitable logical order between them, also not
Embody the importance degree of different units, module or step defined by it.
In embodiments, those features different from other embodiment only are described emphatically, and be omitted and other implementations
The same or similar feature of example.
The inventors found that by same display device split screen display available voice with recording data, check and correction notes with
And put down according to the final scene that check and correction notes generate, greatly facilitate the editing of scene notes.
Referring to Fig. 1, the flow chart of voice data processing method according to the ... of the embodiment of the present invention, this method S100 packets are shown
Include following steps:Obtain voice messaging S110;Acquired voice messaging is synchronously converted into the first text information, and by institute
State the first word-information display display device first area S130;In response to the first operation, to the first word to be edited
Information carries out word processing, and the text information S150 after the second area display processing of the display device;In response to
Two operations, are synchronized to the second area, to generate the second text information S170 by first text information that do not changed;
It is operated in response to third, third text information is generated based on second text information, and by the third word-information display
In the third region S190 of the display device.
Hereinafter, will be put down when being inquested with procuratorate with record is that scene specifically describes at the voice data of the embodiment of the present invention
Reason method S100.It should be understood that the voice data processing method of the embodiment of the present invention is not limited to hearing scene, can be applied to
The various occasions that voice input is converted to editable text information.
Fig. 2 is that procuratorate according to the ... of the embodiment of the present invention handles a case the structure diagram of system.As shown in Fig. 2, the system of handling a case can
Including multiple harvesters in hearing room are arranged, to be respectively used to acquire image, the voice messaging of handle a case people and suspect.
Specifically, harvester can be provided in hearing room, be located at multiple video cameras of different location.
Interrogator can be turned on or off by being used as the hearing host computer control different video camera of controller, and
The voice data processing apparatus that the voice signal of acquisition is sent.
Voice data processing apparatus can convert the voice signal of acquisition, style of writing of going forward side by side word processing, finally by institute
The notes information needed shows on the display device.
In alternatively possible example, special voice server can be configured, with to collected voice signal into
Row processing.
With reference to figure 1, in step s 110, start to operate in response to operating personnel, voice data processing apparatus starts to connect
Receive the voice data of input.In the present embodiment, voice data can be by the acquisition in such as video camera of hearing room is arranged
The dialogue of device collected procurator and suspect in real time.
In a possible example, it can be operating personnel to click or touch manner is alternatively provided to start operation
Beginning virtual push button on control device.Control device can be the specialized control equipment for the system specially developed, and can also lead to
It crosses and the mode of software is installed on the existing equipments such as intelligent mobile terminal, tablet computer to realize.
After operating personnel click start button, setting is activated in the indoor harvester of hearing, starts to obtain hearing
Then the voice messaging got is transmitted to voice data processing apparatus by indoor image information and voice messaging.
In step S120, voice data processing apparatus can utilize the various speech recognition engines being stored therein that will be obtained
What the voice messaging taken synchronized is converted to text information.Hereinafter, for convenience's sake, it will be converted by speech recognition engine
Text information be known as the first text information.
For example, being referred to the relevant technologies, Markov speech recognition modeling, neural network algorithm, supporting vector are used
The various speech recognition engines of machine scheduling algorithm model construction.
Then, include in the specific region of display device in real time by the first text information.Hereinafter, it rises for convenience
See, which is become into first area.Meanwhile special mark (ID) and memory space use can be configured in processing unit
In storing the first text information.
In a possible example, a special window can be set for showing that the word of conversion is believed in real time
Breath passes through the modes such as the towing position that voluntarily selection window is placed as needed by operating personnel.In another possible example
In, it can be as needed, display area is divided into multiple regions, by different presentation of information in different regions.For example,
Can be 3 regions, left area, intermediate region and right area, and the word that will identify that by display area fixed partition
A presentation of information region wherein, in left area.
In the present embodiment, the record of trial can be shown based on the identity information for the personnel for participating in hearing.For example, when examining
When sentencing personnel and speaking, speech recognition engine starts to identify the voice messaging of speaker, the voice messaging that processing unit will identify that
And identity information (personnel A) display of speaker is in the first region, and when talker person becomes crime from interrogator
When suspect, speech recognition engine judges the diversification in role of speaker according to the variation of sound, and at this moment, switching paragraph is shown newly
The voice messaging of the other personnel (personnel B) identified.
In a possible example, the identity information of the personnel that speak identified can be set by operating personnel.For example,
" personnel A " in the first text information that operating personnel can would indicate that is revised as " hearing people ", and " personnel B " is revised as
" suspect ".And in another possible example, it can be according to the voice messaging acquired in different audio collecting devices
To distinguish different Role Informations.
In a possible embodiment, first text information includes the time of the acquired voice messaging
Stamp.Fig. 3 A are the schematic diagrames of the first text information shown in the first region.As shown in Figure 3A, voice data processing apparatus can
The timestamp for getting voice messaging to be recorded, and with the identity information and voice messaging of the personnel that speak identified
It shows together in the first region.
Then, word processing is carried out to the first text information in response to the first operation of operating personnel in step S150.
In a possible example, the first operation can be the double click operation chosen after the first information to be edited.
As an alternative, after choosing the first information to be edited edit operation can be realized by clicking preset button.When user selects
In after the first information to be edited, which can be presented on to the specific region of display device.Hereinafter, it is side
Just for the sake of, which is become into second area.
As set forth above, it is possible to which a special window is arranged for showing the text information to be edited chosen in real time.Together
Sample, the display mode in 3 panes described above may be used, by word-information display to be edited in display device
Intermediate region.
Then, operating personnel such as can increase the text information in second area, delete at the operations, and show modification
Text information afterwards.In a possible example, modification operation can call back office interface by first by processing unit
The mark (ID) of text information modifies to the word being stored in processing unit.
Fig. 3 B are the schematic diagrames of the text information after second area display processing.Come below in conjunction with Fig. 3 A and Fig. 3 B specific
Ground description carries out pretreated process to the hearing information recorded.
As shown in Figure 3A, the hearing pen of the interrogator and suspect of acquisition are shown in the first area of display device
Record.In the example shown in Fig. 3 A and 3B, need to carry out preliminary treatment to the notes of suspect, it is unrelated with content is inquested to remove
Information, i.e. " heartily " the two words.In order to modify to this text information, operating personnel are such as to touch or mouse
The modes such as click, choose corresponding text information, i.e., correspondent time " 10 in figure:50:55 " corresponding text informations.Work as operation
, can be by the first operation after personnel choose information to be edited, such as double-click the information so that the information, including timestamp
Full content including identity information etc., is extracted in second area.Then, operating personnel can delete in the second area
Except " heartily " the two words.As a result, as shown in Figure 3B, showing modified text information in the second area.
According to described above, operating personnel have carried out in pre- place text information to be edited by the first operation, and
These pretreated text informations are shown in the second area, at this point, the information of not selected processing of modifying is not
Display is in the second area.First text information that do not changed is synchronized in response to the second operation in step S170
The second area, and generate and completely pass through pretreated notes information.Hereinafter, in the second area complete will be presented
Whole pretreated text information becomes the second text information.
Second operation for example can click preset virtual push button on the control device by operating personnel to realize.
In a possible embodiment, the second text information is editable.After generating the second text information, behaviour
Editing and processing in detail can be carried out to the second text information by making personnel.
Then, it in step S190, is operated in response to the third of operating personnel, third is generated based on second text information
Text information, and by the third word-information display the display device third region S190.
As described above, third operation can click preset virtual push button on the control device come real by operating personnel
It is existing.
In a possible embodiment, third text information is editable.
For example, when operating personnel trigger third operation after, can in special region, i.e. third region, in create one
Window is to show the final notes of generation, i.e. third text information.Similarly, as set forth above, it is possible to using described above
The display mode of 3 panes exists the second word-information display by the first word-information display in the left area of display device
The intermediate region of display device, by third word-information display display device right area.
It should be understood that in the present embodiment, third region is simultaneously not exclusively suitable for display third text information.For example, generating the
Three text informations and after preserving, operating personnel can be by the closes for showing third text information, and by third area
Domain is for showing other information.
A voice data processing method according to the ... of the embodiment of the present invention is described above in association with Fig. 1-Fig. 3 B, using this reality
The method described in example is applied, voice can be shown in same display equipment with record, check and correction notes and final notes, to greatly
The making for facilitating scene when procuratorial organs handle a case and putting down.It should be understood that the voice data processing method described in the present embodiment is not
It can only apply in case Interrogation Procedure, other any need convert voice messaging to text information in real time, and to text
The occasion that the precision of word information has higher requirements can be applied.
In another embodiment of the present invention, relevant letter can be inquired in the case of split screen display available puts down information
Breath.Fig. 4 schematically shows the flow charts of the query-related information in first, second and/or third text information.In this reality
It applies in example, any sentence in the first text information, the second text information or third text information can be selected, in order to
To carry out the inquiry of relevant information in being put down in entire chapter.Hereinafter, by for selecting a certain sentence in the second text information,
Explain the operation of the query-related information in accomplice notes in detail in conjunction with Fig. 4.
Operating personnel can be in the second text information that second area is shown, as unit of sentence, and arbitrary selection it is expected
The text information of understanding.For example, in step S410, when operating personnel selected by way of click in second area it is " subsequent
Zhang San has turned 1,000,000 to the account of my Bank of Communications " after this sentence, in step S430, the sentence chosen can be divided
Word processing, and by default rule, the word of not practical significance is removed, has extracted heavy duty word, i.e. " Zhang San ", " traffic
Bank ", " account " and " 10,000,000 ".
Then, it in step S450, is inquired in first, second and/or third text information and refers to " Zhang San ", " traffic silver
The sentence of row " " account " or " 10,000,000 ", and these the associated word-information displays inquired are come out.For example,
In one possible example, all associated word-information displays retrieved can be shown in specific area in third region
Domain.In another possible example, it can will be inquired in first, second and/or third text information shown in same screen
Relevant information be highlighted.
In the present embodiment, accomplice retrieval analysis is presented by same screen, comprehensive note to case Interrogation Procedure may be implemented
Record, quick clear up a criminal case have reached best implementation result.It should be understood that the voice data processing method according to the present embodiment is answered
It when used in other word processing scenes, can rapidly analyze, positioning operation personnel it is expected all information understood.
It in another embodiment of the present invention, can be by preset word processing rule, to realize to the first word
The pretreatment of information, to generate the second text information.
Specifically, different words can be added in rule base by operating personnel, can when generating the second text information
Directly first information word is executed and the operations such as the matched replacement of preset rules, deletion according to default rule with elder generation.
For example, in a possible example, in response to the 4th operation of user, such as grasped by clicking, touching setting
Make interface execution preprocessing function virtual push button, can automatically to the first text information execute with it is each in rule file
The operation of kind rule match, to directly generate the second text information.For example, by preset rules, the word in being elected to exists
Such as " heartily " hip-hop word when, automatically these words can be deleted.
In another possible example, as described in foregoing embodiments, when operating personnel choose from the first text information
After sentence to be edited, first directly the word chosen can be executed and the word match in rule file according to default rule
Text information carries out the scheduled processing such as replacing, delete, then is confirmed whether to also need to carry out further by operating personnel
Modification.After the completion of to be modified, in the operation by operating personnel, in response to the second operation, first text that will do not changed
Word synchronizing information is to the second area, as described in the step S170 in foregoing embodiments.
In the present embodiment, the generation and display of voice messaging and the first text information and third text information are obtained,
Similar with manner described above, details are not described herein.
Voice data processing method according to the present embodiment, can according to the first text information of default rule pair into
Row pretreatment, to simplify the generation of the second text information.
The voice data processing method of the embodiment of the present invention is described above in association with Fig. 1 to Fig. 4.In fact, the present invention is also
Provide a kind of voice data processing apparatus for executing above-mentioned voice data processing method.Fig. 5 is implemented according to the present invention
The structure diagram of the voice data processing apparatus of example.With reference to figure 5, which includes:
Memory 53 and one or at least processor 51;
Wherein, the memory 53 is communicated to connect with one or more of processors 51, is stored in the memory 53
There are the instruction that can be executed by one or more of processors, described instruction to be executed by one or more of processors 51, with
One or more of processors 51 are made to execute:Obtain voice messaging;Acquired voice messaging is synchronously converted to first
Text information, and by first word-information display in the first area of display device;In response to the first operation, to be edited
The first text information carry out word processing, and by treated word-information display the display device second area;
In response to the second operation, first text information that do not changed is synchronized to the second area, and generate the second word
Information;It is operated in response to third, third text information is generated based on second text information, and by the third text information
It is shown in the third region of the display device.
Processor can execute the voice data processing method with reference to the one embodiment for combining Fig. 1 to Fig. 3 B descriptions, or
Person executes the voice data processing method of another embodiment described with reference to figure 4, and details are not described herein for detail.
It will be understood by those skilled in the art that the embodiment of the present invention can be provided as method, system or computer program production
Product.Therefore, the form of hardware embodiment, software implementation or embodiment combining software and hardware aspects can be used in the present invention.
Moreover, the present invention can be used in one or more wherein include computer usable program code computer-usable storage medium
The form for the computer program product implemented on (including but not limited to magnetic disk storage and optical memory etc.).
These computer program instructions, which may also be stored in, can guide computer or other programmable data processing devices with spy
Determine in the computer-readable memory that mode works so that instruction generation stored in the computer readable memory includes referring to
Enable the manufacture of device, the command device realize in one flow of flow chart or multiple flows and/or one box of block diagram or
The function of being specified in multiple boxes.
These computer program instructions also can be loaded onto a computer or other programmable data processing device so that count
Series of operation steps are executed on calculation machine or other programmable devices to generate computer implemented processing, in computer or
The instruction executed on other programmable devices is provided for realizing in one flow of flow chart or multiple flows and/or block diagram one
The step of function of being specified in a box or multiple boxes.
The foregoing is only a preferred embodiment of the present invention, is not intended to limit the scope of the present invention.
Finally, it is to be noted that, the terms "include", "comprise" or its any other variant be intended to it is non-exclusive
Property include so that including a series of elements process, method, article or equipment not only include those elements, but also
Further include other elements that are not explicitly listed, or further include for this process, method, article or equipment it is intrinsic
Element.In addition, in the absence of more restrictions, the element limited by sentence "including a ...", it is not excluded that wrapping
Include in the process, method, article or equipment of the element that there is also other identical elements.
Claims (6)
1. a kind of voice data processing method, including:
Obtain voice messaging;
Acquired voice messaging is synchronously converted into the first text information, and first word-information display is being shown
The first area of device;
In response to the first operation, word processing is carried out to the first text information to be edited, and text information is shown by treated
Show the second area in the display device;
In response to the second operation, first text information that do not changed is synchronized to the second area, and generate second
Text information;
It is operated in response to third, third text information is generated based on second text information, and by the third text information
It is shown in the third region of the display device.
2. voice data processing method according to claim 1, it is characterised in that:First text information includes obtaining
The timestamp of the voice messaging.
3. voice data processing method according to claim 1, it is characterised in that:
Second text information is editable.
4. voice data processing method according to claim 3, it is characterised in that:
In the case that specific character is selected in described first, second or third text information, to the selected text
Word carries out word segmentation processing, and participle inquiry is carried out in described first, second and/or third text information, and shows and inquire
All associated text informations.
5. a kind of voice data processing method, including:
Obtain voice messaging;
Acquired voice messaging is synchronously converted into the first text information, and first word-information display is being shown
The first area of device;
In response to the 4th operation, word processing is carried out to first text information to generate the second text information, and will be described
Second area of second word-information display in the display device;
It is operated in response to third, third text information is generated based on second text information, and by the third text information
It is shown in the third region of the display device.
6. a kind of voice data processing apparatus, including:
At least one processor;And
The memory being connect at least one processor communication;Wherein, be stored with can be by described at least one for the memory
The order that a processor executes, the order is executed by least one processor, so that at least one processor energy
Enough perform claims require the method described in any one of 1-5.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201810145265.9A CN108364653B (en) | 2018-02-12 | 2018-02-12 | Voice data processing method and processing device |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201810145265.9A CN108364653B (en) | 2018-02-12 | 2018-02-12 | Voice data processing method and processing device |
Publications (2)
Publication Number | Publication Date |
---|---|
CN108364653A true CN108364653A (en) | 2018-08-03 |
CN108364653B CN108364653B (en) | 2021-08-13 |
Family
ID=63005689
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN201810145265.9A Active CN108364653B (en) | 2018-02-12 | 2018-02-12 | Voice data processing method and processing device |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN108364653B (en) |
Cited By (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN109599114A (en) * | 2018-11-07 | 2019-04-09 | 重庆海特科技发展有限公司 | Method of speech processing, storage medium and device |
CN109597991A (en) * | 2018-11-27 | 2019-04-09 | 北京巅峰科技有限公司 | Vehicle remote detection method |
CN109634935A (en) * | 2018-11-07 | 2019-04-16 | 重庆海特科技发展有限公司 | Method of speech processing, storage medium and device |
CN109788132A (en) * | 2018-12-29 | 2019-05-21 | 努比亚技术有限公司 | A kind of message treatment method, mobile terminal and computer readable storage medium |
CN110070873A (en) * | 2019-05-07 | 2019-07-30 | 上海良相智能化工程有限公司 | A kind of supervision digital interrogation system and equipment |
Citations (24)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN1394331A (en) * | 2000-09-08 | 2003-01-29 | 皇家菲利浦电子有限公司 | Speech recognition method with replace command |
CN1460246A (en) * | 2001-03-29 | 2003-12-03 | 皇家菲利浦电子有限公司 | Text editing for recognized speech during synchronous playback |
CN1568501A (en) * | 2001-10-12 | 2005-01-19 | 皇家飞利浦电子股份有限公司 | Correction device for marking parts of recognized text |
US20050038653A1 (en) * | 2001-09-05 | 2005-02-17 | Voice Signal Technologies, Inc. | Word recognition using word transformation commands |
JP2005070645A (en) * | 2003-08-27 | 2005-03-17 | Casio Comput Co Ltd | Text and voice synchronizing device and text and voice synchronization processing program |
US7526431B2 (en) * | 2001-09-05 | 2009-04-28 | Voice Signal Technologies, Inc. | Speech recognition using ambiguous or phone key spelling and/or filtering |
CN101763360A (en) * | 2008-12-08 | 2010-06-30 | 新奥特硅谷视频技术有限责任公司 | Method and device for converting voice message into text message in court trial process |
CN101763881A (en) * | 2008-12-08 | 2010-06-30 | 新奥特硅谷视频技术有限责任公司 | Method for recording remote court trial data in real time and device thereof |
CN101763364A (en) * | 2008-12-08 | 2010-06-30 | 新奥特硅谷视频技术有限责任公司 | Method and device for converting court trial voice |
CN101950199A (en) * | 2010-09-10 | 2011-01-19 | 中国联合网络通信集团有限公司 | Method, device and system for word processing |
CN102956231A (en) * | 2011-08-23 | 2013-03-06 | 上海交通大学 | Voice key information recording device and method based on semi-automatic correction |
CN203117958U (en) * | 2013-03-22 | 2013-08-07 | 祁磊 | Real-time recording device used for court criminal justice |
CN104217039A (en) * | 2014-10-10 | 2014-12-17 | 谭希韬 | Method and system for recording telephone conversations in real time and converting telephone conversations into declarative sentences |
US20150058007A1 (en) * | 2013-08-26 | 2015-02-26 | Samsung Electronics Co. Ltd. | Method for modifying text data corresponding to voice data and electronic device for the same |
CN104488027A (en) * | 2012-07-09 | 2015-04-01 | 独立行政法人情报通信研究机构 | Speech processing system and terminal device |
WO2015156443A1 (en) * | 2014-04-11 | 2015-10-15 | 네무스텍(주) | Cartoon-type mobile personal secretary service system |
CN105094833A (en) * | 2015-08-03 | 2015-11-25 | 联想(北京)有限公司 | Data Processing method and system |
CN105808197A (en) * | 2014-12-30 | 2016-07-27 | 联想(北京)有限公司 | Information processing method and electronic device |
CN106067310A (en) * | 2016-06-27 | 2016-11-02 | 乐视控股(北京)有限公司 | Recording data processing method and processing device |
CN106409296A (en) * | 2016-09-14 | 2017-02-15 | 安徽声讯信息技术有限公司 | Voice rapid transcription and correction system based on multi-core processing technology |
CN106952655A (en) * | 2017-02-23 | 2017-07-14 | 深圳市金立通信设备有限公司 | A kind of input method and terminal |
JP2017126252A (en) * | 2016-01-15 | 2017-07-20 | 株式会社リクルートライフスタイル | Device, method, and program for speech translation |
CN107172382A (en) * | 2017-06-29 | 2017-09-15 | 深圳双猴科技有限公司 | A kind of intelligent meeting system and method |
CN107644646A (en) * | 2017-09-27 | 2018-01-30 | 北京搜狗科技发展有限公司 | Method of speech processing, device and the device for speech processes |
-
2018
- 2018-02-12 CN CN201810145265.9A patent/CN108364653B/en active Active
Patent Citations (24)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN1394331A (en) * | 2000-09-08 | 2003-01-29 | 皇家菲利浦电子有限公司 | Speech recognition method with replace command |
CN1460246A (en) * | 2001-03-29 | 2003-12-03 | 皇家菲利浦电子有限公司 | Text editing for recognized speech during synchronous playback |
US20050038653A1 (en) * | 2001-09-05 | 2005-02-17 | Voice Signal Technologies, Inc. | Word recognition using word transformation commands |
US7526431B2 (en) * | 2001-09-05 | 2009-04-28 | Voice Signal Technologies, Inc. | Speech recognition using ambiguous or phone key spelling and/or filtering |
CN1568501A (en) * | 2001-10-12 | 2005-01-19 | 皇家飞利浦电子股份有限公司 | Correction device for marking parts of recognized text |
JP2005070645A (en) * | 2003-08-27 | 2005-03-17 | Casio Comput Co Ltd | Text and voice synchronizing device and text and voice synchronization processing program |
CN101763360A (en) * | 2008-12-08 | 2010-06-30 | 新奥特硅谷视频技术有限责任公司 | Method and device for converting voice message into text message in court trial process |
CN101763881A (en) * | 2008-12-08 | 2010-06-30 | 新奥特硅谷视频技术有限责任公司 | Method for recording remote court trial data in real time and device thereof |
CN101763364A (en) * | 2008-12-08 | 2010-06-30 | 新奥特硅谷视频技术有限责任公司 | Method and device for converting court trial voice |
CN101950199A (en) * | 2010-09-10 | 2011-01-19 | 中国联合网络通信集团有限公司 | Method, device and system for word processing |
CN102956231A (en) * | 2011-08-23 | 2013-03-06 | 上海交通大学 | Voice key information recording device and method based on semi-automatic correction |
CN104488027A (en) * | 2012-07-09 | 2015-04-01 | 独立行政法人情报通信研究机构 | Speech processing system and terminal device |
CN203117958U (en) * | 2013-03-22 | 2013-08-07 | 祁磊 | Real-time recording device used for court criminal justice |
US20150058007A1 (en) * | 2013-08-26 | 2015-02-26 | Samsung Electronics Co. Ltd. | Method for modifying text data corresponding to voice data and electronic device for the same |
WO2015156443A1 (en) * | 2014-04-11 | 2015-10-15 | 네무스텍(주) | Cartoon-type mobile personal secretary service system |
CN104217039A (en) * | 2014-10-10 | 2014-12-17 | 谭希韬 | Method and system for recording telephone conversations in real time and converting telephone conversations into declarative sentences |
CN105808197A (en) * | 2014-12-30 | 2016-07-27 | 联想(北京)有限公司 | Information processing method and electronic device |
CN105094833A (en) * | 2015-08-03 | 2015-11-25 | 联想(北京)有限公司 | Data Processing method and system |
JP2017126252A (en) * | 2016-01-15 | 2017-07-20 | 株式会社リクルートライフスタイル | Device, method, and program for speech translation |
CN106067310A (en) * | 2016-06-27 | 2016-11-02 | 乐视控股(北京)有限公司 | Recording data processing method and processing device |
CN106409296A (en) * | 2016-09-14 | 2017-02-15 | 安徽声讯信息技术有限公司 | Voice rapid transcription and correction system based on multi-core processing technology |
CN106952655A (en) * | 2017-02-23 | 2017-07-14 | 深圳市金立通信设备有限公司 | A kind of input method and terminal |
CN107172382A (en) * | 2017-06-29 | 2017-09-15 | 深圳双猴科技有限公司 | A kind of intelligent meeting system and method |
CN107644646A (en) * | 2017-09-27 | 2018-01-30 | 北京搜狗科技发展有限公司 | Method of speech processing, device and the device for speech processes |
Cited By (6)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN109599114A (en) * | 2018-11-07 | 2019-04-09 | 重庆海特科技发展有限公司 | Method of speech processing, storage medium and device |
CN109634935A (en) * | 2018-11-07 | 2019-04-16 | 重庆海特科技发展有限公司 | Method of speech processing, storage medium and device |
CN109597991A (en) * | 2018-11-27 | 2019-04-09 | 北京巅峰科技有限公司 | Vehicle remote detection method |
CN109597991B (en) * | 2018-11-27 | 2023-04-28 | 北京巅峰科技有限公司 | Automobile remote detection method |
CN109788132A (en) * | 2018-12-29 | 2019-05-21 | 努比亚技术有限公司 | A kind of message treatment method, mobile terminal and computer readable storage medium |
CN110070873A (en) * | 2019-05-07 | 2019-07-30 | 上海良相智能化工程有限公司 | A kind of supervision digital interrogation system and equipment |
Also Published As
Publication number | Publication date |
---|---|
CN108364653B (en) | 2021-08-13 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN108364653A (en) | Voice data processing method and processing unit | |
CN103023961B (en) | Via the workspace collaboration of wall type computing device | |
Tang et al. | An observational methodology for studying group design activity | |
CN110166650B (en) | Video set generation method and device, computer equipment and readable medium | |
CN110610705B (en) | Voice interaction prompter based on artificial intelligence | |
CN107423363A (en) | Art generation method, device, equipment and storage medium based on artificial intelligence | |
DE112020005253T5 (en) | DISSOLUTION OF ANAPHERA | |
CN108399923A (en) | More human hairs call the turn spokesman's recognition methods and device | |
KR20090097891A (en) | Controlling a document based on user behavioral signals detected from a 3d captured image stream | |
CN106971009A (en) | Speech data library generating method and device, storage medium, electronic equipment | |
CN108629548A (en) | A kind of schedule processing method and processing device | |
CN1726496A (en) | System and method for annotating multi-modal characteristics in multimedia documents | |
CN110377908A (en) | Semantic understanding method, apparatus, equipment and readable storage medium storing program for executing | |
Baur et al. | eXplainable cooperative machine learning with NOVA | |
WO2022127486A1 (en) | Interface theme switching method and apparatus, terminal, and storage medium | |
CN109460891A (en) | Data processing method, device and computer equipment based on satisfaction evaluation | |
CN109920436A (en) | It is a kind of that the device and method of ancillary service is provided | |
CN112270918A (en) | Information processing method, device, system, electronic equipment and storage medium | |
CN109782997A (en) | A kind of data processing method, device and storage medium | |
JP2022042423A (en) | Remote work assisting system and remote work assisting method | |
CN112287082A (en) | Data processing method, device, equipment and storage medium combining RPA and AI | |
JPH0749695A (en) | Time sequential data recording and reproducing device | |
CN108055192A (en) | Group's generation method, apparatus and system | |
CN109033052A (en) | Page processing method and device, medium and calculating equipment | |
CN108255917A (en) | Image management method, equipment and electronic equipment |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
GR01 | Patent grant | ||
GR01 | Patent grant |