CN109559764A - The treating method and apparatus of audio file - Google Patents
The treating method and apparatus of audio file Download PDFInfo
- Publication number
- CN109559764A CN109559764A CN201710890678.5A CN201710890678A CN109559764A CN 109559764 A CN109559764 A CN 109559764A CN 201710890678 A CN201710890678 A CN 201710890678A CN 109559764 A CN109559764 A CN 109559764A
- Authority
- CN
- China
- Prior art keywords
- critical field
- audio file
- sound
- critical
- temporal information
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Pending
Links
- 238000000034 method Methods 0.000 title claims abstract description 94
- 230000002123 temporal effect Effects 0.000 claims abstract description 100
- 238000012545 processing Methods 0.000 claims description 24
- 238000003860 storage Methods 0.000 claims description 21
- 238000003672 processing method Methods 0.000 claims description 20
- 230000005236 sound signal Effects 0.000 claims description 6
- 238000005516 engineering process Methods 0.000 abstract description 7
- 238000010586 diagram Methods 0.000 description 11
- 238000004590 computer program Methods 0.000 description 10
- 230000006870 function Effects 0.000 description 6
- 230000000694 effects Effects 0.000 description 4
- 230000003287 optical effect Effects 0.000 description 3
- 238000012552 review Methods 0.000 description 2
- 238000004458 analytical method Methods 0.000 description 1
- 230000005540 biological transmission Effects 0.000 description 1
- 238000004364 calculation method Methods 0.000 description 1
- 238000009826 distribution Methods 0.000 description 1
- PCHJSUWPFVWCPO-UHFFFAOYSA-N gold Chemical compound [Au] PCHJSUWPFVWCPO-UHFFFAOYSA-N 0.000 description 1
- 239000010931 gold Substances 0.000 description 1
- 229910052737 gold Inorganic materials 0.000 description 1
- 238000009434 installation Methods 0.000 description 1
- 238000004519 manufacturing process Methods 0.000 description 1
- 238000012986 modification Methods 0.000 description 1
- 230000004048 modification Effects 0.000 description 1
- 230000003068 static effect Effects 0.000 description 1
- 238000012546 transfer Methods 0.000 description 1
Classifications
-
- G—PHYSICS
- G11—INFORMATION STORAGE
- G11B—INFORMATION STORAGE BASED ON RELATIVE MOVEMENT BETWEEN RECORD CARRIER AND TRANSDUCER
- G11B27/00—Editing; Indexing; Addressing; Timing or synchronising; Monitoring; Measuring tape travel
- G11B27/10—Indexing; Addressing; Timing or synchronising; Measuring tape travel
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/26—Speech to text systems
Landscapes
- Engineering & Computer Science (AREA)
- Computational Linguistics (AREA)
- Health & Medical Sciences (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Human Computer Interaction (AREA)
- Physics & Mathematics (AREA)
- Acoustics & Sound (AREA)
- Multimedia (AREA)
- Stereophonic System (AREA)
Abstract
This application discloses a kind for the treatment of method and apparatus of audio file.This method comprises: carrying out sound collection by the sound card of multichannel in court trial process, wherein the corresponding sound collector of each sound channel on sound card, each sound collector are used to acquire the sound using object;Collected voice signal is parsed, multiple critical fielies in the corresponding text information of identification voice signal;The temporal information for obtaining each critical field in multiple critical fielies, obtains target audio file, wherein the temporal information of each critical field in multiple critical fielies and multiple critical fielies is carried in target audio file;Show the temporal information of each critical field and each critical field simultaneously when playing target audio file.By the application, solve the problems, such as that the efficiency for obtaining target information from the audio file of court's trial in the related technology is lower.
Description
Technical field
This application involves audio signal processing technique fields, in particular to a kind for the treatment of method and apparatus of audio file.
Background technique
The voice play-back technology in webpage substantially has audio player audio and video player video, traditional at present
Video web page player (such as JW FLV of AS (flash) programming plus third party control (such as ckplayer) and some open sources
Player, it is most popular, most flexible Web media player, it can play all formats that Flash is supported, including
FLV, MP4, MP3, AAC, JPG, PNG and GIF.RTMP, HTTP live media stream is also supported to support a variety of played column tables
Formula) etc..But they can only be fast by clicking the playing progress bar that respectively default or by defining in being respectively arranged on lower keyboard
Prompt key controls broadcasting speed or is played back.Had also been used in judicial court trial process various software recording audios so as to
Review operation after the court's trials such as judge.But what this method can only be respectively arranged by the playing progress bar respectively defaulted or press
Shortcut key carry out control audio playback to reach the review to court's trial content, such as want grab court's trial in as case main idea, case by,
Law court, region, using law, party, the side of telling party, judge, lawyer, the time of concluding, evidence, it is concerning foreign affairs, judgement the amount of money it is thin
The key messages such as item and lawyer's office must be listened since the audio file of court's trial, and efficiency is lower.
For the lower problem of efficiency for obtaining target information from the audio file of court's trial in the related technology, not yet mention at present
Effective solution scheme out.
Summary of the invention
The main purpose of the application is to provide a kind for the treatment of method and apparatus of audio file, to solve in the related technology
The lower problem of efficiency of target information is obtained from the audio file of court's trial.
To achieve the goals above, according to the one aspect of the application, a kind of processing method of audio file is provided.It should
Method includes: to carry out sound collection by the sound card of multichannel, wherein each sound channel on the sound card in court trial process
A corresponding sound collector, each sound collector are used to acquire the sound using object;To collected voice signal into
Row parsing, identifies multiple critical fielies in the corresponding text information of the voice signal;It obtains in the multiple critical field
The temporal information of each critical field, obtains target audio file, wherein multiple keys are carried in the target audio file
The temporal information of each critical field in field and the multiple critical field;It is shown simultaneously when playing the target audio file
Show the temporal information of each critical field and each critical field.
Further, in court trial process, by the sound card of multichannel carry out sound collection include: in court trial process,
Sound collection is carried out by the sound card of multichannel, obtains original audio file, wherein the original audio file includes multiple sound
Sound signal;Collected voice signal is parsed, identifies multiple keys in the corresponding text information of the voice signal
Field includes: to parse to multiple voice signals in the original audio file, identifies the corresponding text of the voice signal
Multiple critical fielies in this information;The temporal information for obtaining each critical field in the multiple critical field, obtains target
Audio file includes: the temporal information for obtaining each critical field in the multiple critical field;What be will acquire is the multiple
The temporal information of each critical field is added in the original audio file in critical field;Will execute addition treated just
Beginning audio file is as the target audio file.
Further, at the beginning of including each critical field in the temporal information of each critical field,
Show the temporal information of each critical field and each critical field simultaneously when playing the target audio file
Before, the method also includes: determine that each critical field is playing at the beginning of based on each critical field
Corresponding position in progress bar, wherein the playing progress bar be used for when playing the target audio file show play into
Degree;In each critical field, the corresponding label of each critical field is added in corresponding position in playing progress bar;
Show the temporal information of each critical field and each critical field simultaneously when playing the target audio file
It include: while to show the playing progress bar for carrying the corresponding label of each critical field when playing the target audio file.
Further, at the beginning of the temporal information of each critical field includes each critical field,
When playing the target audio file at the same show each critical field and each critical field temporal information it
Before, the method also includes: creation broadcast information table, wherein in the broadcast information table include each critical field and
At the beginning of each critical field;When playing the target audio file simultaneously show each critical field and
The temporal information of each critical field includes: while to show the broadcast information when playing the target audio file
Table.
Further, at the beginning of the temporal information of each critical field includes each critical field and institute
The end time for stating each critical field, in obtaining the multiple critical field after the temporal information of each critical field,
The method also includes: at the beginning of the original audio file, each critical field, each critical field
End time storage with each critical field is in the preset database.
Further, in court trial process, before carrying out sound collection by the sound card of multichannel, the method is also wrapped
It includes: configuring the corresponding relationship on the sound card between each sound channel and each court's trial object role;It, will according to the corresponding relationship
The corresponding sound collector of each court's trial object role is attached with each sound channel.
To achieve the goals above, according to the one aspect of the application, a kind of processing unit of audio file is provided, is wrapped
It includes: acquisition unit, for carrying out sound collection by the sound card of multichannel, wherein every on the sound card in court trial process
The corresponding sound collector of a sound channel, each sound collector are used to acquire the sound using object;Recognition unit, for pair
Collected voice signal is parsed, and identifies multiple critical fielies in the corresponding text information of the voice signal;It obtains
Unit obtains target audio file for obtaining the temporal information of each critical field in the multiple critical field, wherein
The time letter of each critical field in multiple critical fielies and the multiple critical field is carried in the target audio file
Breath;Broadcast unit, for showing each critical field and each pass simultaneously when playing the target audio file
The temporal information of key field.
Further, the acquisition unit is also used in court trial process, carries out sound collection by the sound card of multichannel,
Obtain original audio file, wherein the original audio file includes multiple voice signals;The recognition unit is also used to adopting
The voice signal collected is parsed, and identifies that multiple critical fielies in the corresponding text information of the voice signal include: pair
Multiple voice signals in the original audio file are parsed, and are identified more in the corresponding text information of the voice signal
A critical field;The acquiring unit further include: module is obtained, for obtaining each critical field in the multiple critical field
Temporal information;Adding module, the temporal information of each critical field adds in the multiple critical field for will acquire
It adds in the original audio file;Determining module adds treated original audio file as the mesh for that will execute
Mark with phonetic symbols frequency file.
To achieve the goals above, according to the another aspect of the application, a kind of storage medium, the storage medium are provided
Program including storage, wherein described program executes the processing method of audio file described in above-mentioned any one.
To achieve the goals above, according to the another aspect of the application, a kind of processor is provided, the processor is used for
Run program, wherein described program executes the processing method of audio file described in above-mentioned any one when running.
By the application, using following steps: in court trial process, sound collection is carried out by the sound card of multichannel,
In, the corresponding sound collector of each sound channel on sound card, each sound collector is used to acquire the sound using object;It is right
Collected voice signal is parsed, multiple critical fielies in the corresponding text information of identification voice signal;It obtains multiple
The temporal information of each critical field, obtains target audio file in critical field, wherein carries in target audio file more
The temporal information of each critical field in a critical field and multiple critical fielies;It is shown simultaneously when playing target audio file
The temporal information of each critical field and each critical field solves the collected voice signal in the related technology from court's trial
The middle lower problem of efficiency for obtaining target information.By showing each critical field and every simultaneously when playing target audio file
The temporal information of a critical field, so as to according to the temporal information of each critical field and each critical field shown
Prompt, target information is rapidly obtained from target audio file, so reached promotion obtained from trial audio file
Take the effect of the efficiency of target information.
Detailed description of the invention
The attached drawing constituted part of this application is used to provide further understanding of the present application, the schematic reality of the application
Example and its explanation are applied for explaining the application, is not constituted an undue limitation on the present application.In the accompanying drawings:
Fig. 1 is the flow chart according to the processing method of audio file provided by the embodiments of the present application;
Fig. 2 is the schematic diagram according to each character location distribution in court scene in the embodiment of the present application;
Fig. 3 is the schematic diagram of the sound card in the processing according to audio file provided by the embodiments of the present application;
Fig. 4 is the schematic diagram of voice signal in the processing according to audio file provided by the embodiments of the present application;And
Fig. 5 is the schematic diagram according to the processing unit of audio file provided by the embodiments of the present application.
Specific embodiment
It should be noted that in the absence of conflict, the features in the embodiments and the embodiments of the present application can phase
Mutually combination.The application is described in detail below with reference to the accompanying drawings and in conjunction with the embodiments.
In order to make those skilled in the art more fully understand application scheme, below in conjunction in the embodiment of the present application
Attached drawing, the technical scheme in the embodiment of the application is clearly and completely described, it is clear that described embodiment is only
The embodiment of the application a part, instead of all the embodiments.Based on the embodiment in the application, ordinary skill people
Member's every other embodiment obtained without making creative work, all should belong to the model of the application protection
It encloses.
It should be noted that the description and claims of this application and term " first " in above-mentioned attached drawing, "
Two " etc. be to be used to distinguish similar objects, without being used to describe a particular order or precedence order.It should be understood that using in this way
Data be interchangeable under appropriate circumstances, so as to embodiments herein described herein.In addition, term " includes " and " tool
Have " and their any deformation, it is intended that cover it is non-exclusive include, for example, containing a series of steps or units
Process, method, system, product or equipment those of are not necessarily limited to be clearly listed step or unit, but may include without clear
Other step or units listing to Chu or intrinsic for these process, methods, product or equipment.
According to an embodiment of the present application, a kind of processing method of audio file is provided.
Fig. 1 is the flow chart according to the processing method of the audio file of the embodiment of the present application.As shown in Figure 1, this method packet
Include following steps:
Step S101 carries out sound collection by the sound card of multichannel, wherein each of on sound card in court trial process
Sound channel corresponds to a sound collector, and each sound collector is used to acquire the sound using object.
In this application, in court trial process, the approximate location of each role in court scene is as shown in Figure 2.There are more sound in court
Road sound card, sound card are connected on clerk's computer, the corresponding microphone of each sound channel (corresponding above-mentioned sound collection on sound card
Device), sound channel is corresponding with court's trial role relation, and the microphone of each role is connected to sound card according to this corresponded manner, such as Fig. 3 institute
Show.When carrying out court's trial, by the sound of the multiple roles using microphone of the sound DAQ of multichannel, voice signal is obtained, is adopted
The voice signal collected is one section of Wave data, for example, as shown in Figure 4.
Optionally, in the processing method of audio file provided by the embodiments of the present application, in court trial process, pass through more sound
Before the sound card in road carries out sound collection, this method further include: on configuration sound card each sound channel and each court's trial object role it
Between corresponding relationship;According to corresponding relationship, the corresponding sound collector of each court's trial object role is connected with each sound channel
It connects.
Alternatively, by configuring the corresponding relationship between each court's trial object role and each sound collector;Then according to
Each sound collector sound channel corresponding with each court's trial object role is attached by corresponding relationship.
It should be noted that can be applied by the processing method of the audio file of the embodiment of the present application in court's trial software
In, that is, being embedded in the processing method of the audio file of the embodiment of the present application in court's trial software.Before starting court's trial, installation
The court's trial software, and the corresponding relationship of role's sound channel is set in software.
Step S102 parses collected voice signal, more in the corresponding text information of identification voice signal
A critical field.
It should be noted that above-mentioned parse to collected voice signal can be to adopt on one side in court trial process
Collect voice signal, on one side to acquisition to voice signal parse, or in court trial process voice signal is adopted
After collection finishes, the set of voice signal is parsed, this is not especially limited in this application.
It should be noted that above-mentioned multiple critical fielies can for case by, case main idea, concerning foreign affairs, evidence, the amount of money of sentencing
Thin item etc., it should be noted that the critical field in the application can be to the corresponding text information of audio signal
After habit, determine text information first object content be case by content, determine text information the second object content be case
Content of main idea etc..
Step S103 obtains the temporal information of each critical field in multiple critical fielies, obtains target audio file,
In, the temporal information of each critical field in multiple critical fielies and multiple critical fielies is carried in target audio file.
It should be noted that above-mentioned temporal information is the corresponding temporal information in voice signal of each critical field.
For example, some critical field is case by carrying out to the corresponding text information of audio signal in multiple critical fielies
After study, determine text information first object content be case by content, be accordingly matched in voice signal in first object
Hold corresponding starting and end time, for as case at the beginning of and the end time, case is at the beginning of and terminates
Time be case by temporal information.
The corresponding temporal information in voice signal of each critical field is got by the above method.
Step S104 shows the time of each critical field and each critical field when playing target audio file simultaneously
Information.
By showing the temporal information of each critical field and each critical field simultaneously when playing target audio file,
It is fast from audio file so as to according to the prompt of the temporal information of each critical field and each critical field shown
Speed gets target information, and then has achieved the effect that be promoted the efficiency that target information is obtained from trial audio file.
If browser end sends to server and requests when browser plays target audio file, by the audio text of upload
Part, critical field, the temporal information of critical field etc. identified during parsing audio file read and are shown from database,
The temporal information for showing each critical field and each critical field simultaneously when playing target audio file passes through the method reality
Positioning playback function of the present browser end to audio file.
Optionally, in the processing method of audio file provided by the embodiments of the present application, in court trial process, pass through more sound
It includes: to carry out sound collection in court trial process by the sound card of multichannel, obtain initial sound that the sound card in road, which carries out sound collection,
Frequency file, wherein original audio file includes multiple voice signals;Collected voice signal is parsed, identifies sound
Multiple critical fielies in the corresponding text information of signal include: to solve to multiple voice signals in original audio file
Analysis identifies multiple critical fielies in the corresponding text information of voice signal;Obtain each critical field in multiple critical fielies
Temporal information, obtaining target audio file includes: to obtain the temporal information of each critical field in multiple critical fielies;It will obtain
The temporal information of each critical field is added in original audio file in the multiple critical fielies got;Addition processing will be executed
Original audio file afterwards is as target audio file.
It should be noted that above-mentioned original audio file is to pass through the sound card carry out sound of multichannel in court trial process
The set of the collected voice signal of sound, that is, through the above scheme, will first acquiring the voice signal in court trial process, adopting
Collection finishes, and parses to original audio file, multiple critical fielies in original audio file is identified, to original audio file
The temporal information for adding each critical field in multiple critical fielies, obtains target audio file, is playing target to realize
The temporal information of each critical field and each critical field is shown when audio file simultaneously, so that the positioning to audio file is returned
Playing function.
Optionally, in the processing method of audio file provided by the embodiments of the present application, the time of each critical field believes
Breath includes the end time at the beginning of each critical field with each critical field, each in obtaining multiple critical fielies
After the temporal information of critical field, this method further include: by original audio file, each critical field, each critical field
At the beginning of and each critical field end time storage in the preset database.
In obtaining multiple critical fielies after the temporal information of each critical field, by original audio file, Mei Geguan
It is stored in the preset database at the beginning of key field, each critical field with the end time of each critical field, namely
Whole audio file in entire court trial process is stored in presetting database, to guarantee the integrality of file.
Optionally, in the processing method of audio file provided by the embodiments of the present application, the time of each critical field believes
In breath include each critical field at the beginning of, when playing target audio file simultaneously show each critical field and each
Before the temporal information of critical field, this method further include: each keyword is determined at the beginning of based on each critical field
Section in playing progress bar corresponding position, wherein playing progress bar be used for when playing target audio file show play into
Degree;In each critical field, the corresponding label of each critical field is added in corresponding position in playing progress bar;Playing mesh
The temporal information for showing each critical field and each critical field when mark with phonetic symbols frequency file simultaneously includes: to play target audio text
When part, while showing the playing progress bar for carrying the corresponding label of each critical field.
For example, case is the 14th second in audio file at the beginning of, the 14th second of corresponding playing progress bar
Corresponding label is added in position, when playing target audio file, while showing and carrying the corresponding label of each critical field
Playing progress bar.
Optionally, in the processing method of audio file provided by the embodiments of the present application, the time of each critical field believes
At the beginning of breath includes each critical field, each critical field and each pass are shown simultaneously when playing target audio file
Before the temporal information of key field, this method further include: creation broadcast information table, wherein include each pass in broadcast information table
At the beginning of key field and each critical field;Show each critical field and each simultaneously when playing target audio file
The temporal information of critical field includes: while to show broadcast information table when playing target audio file.
For example, multiple critical fielies in the embodiment of the present application can for case by, case main idea, concerning foreign affairs, evidence, gold of sentencing
Thin item of volume etc. is based on creating broadcast information table at the beginning of each critical field and each critical field, for example, such as following table
Shown in 1:
Table 1
Critical field | Case by | Case main idea | It is concerning foreign affairs | Evidence | Sentence the thin item of the amount of money |
Time started (S) | 14 | 23 | 35 | 60 | 74 |
When playing target audio file, while showing table 1, the position of display table 1 is not construed as limiting in this application, is passed through
Information in table 1, user can intuitively get the time point in audio file where multiple target informations, so as to straight
It connects and switches to the corresponding time, obtain target information, to improve the effect for obtaining target information from trial audio file
Rate.
To sum up, the processing method of audio file provided by the embodiments of the present application, by passing through multichannel in court trial process
Sound card carry out sound collection, obtain audio file, wherein include multiple voice signals in audio file, each of on sound card
Sound channel corresponds to a sound collector, and each sound collector is used to acquire the sound using object;To the sound in audio file
Sound signal is parsed, multiple critical fielies in the corresponding text information of identification voice signal;It obtains in multiple critical fielies
The temporal information of each critical field;Show each critical field and each critical field simultaneously when playing target audio file
Temporal information, solve the problems, such as in the related technology from the audio file of court's trial obtain target information efficiency it is lower.Pass through
Show the temporal information of each critical field and each critical field, simultaneously when playing target audio file so as to basis
The prompt of the temporal information of each critical field and each critical field that show, is rapidly obtained mesh from audio file
Information is marked, and then has achieved the effect that be promoted the efficiency for obtaining target information from trial audio file.
It should be noted that step shown in the flowchart of the accompanying drawings can be in such as a group of computer-executable instructions
It is executed in computer system, although also, logical order is shown in flow charts, and it in some cases, can be with not
The sequence being same as herein executes shown or described step.
The embodiment of the present application also provides a kind of processing units of audio file, it should be noted that the embodiment of the present application
The processing unit of audio file can be used for executing the processing method that audio file is used for provided by the embodiment of the present application.With
Under the processing unit of audio file provided by the embodiments of the present application is introduced.
Fig. 5 is the schematic diagram according to the processing unit of the audio file of the embodiment of the present application.As shown in figure 5, the device packet
It includes: acquisition unit 10, recognition unit 20, acquiring unit 30, broadcast unit 40.
Specifically, acquisition unit 10, for carrying out sound collection by the sound card of multichannel in court trial process, wherein
The corresponding sound collector of each sound channel on the sound card, each sound collector are used to acquire the sound using object.
Recognition unit 20 identifies the corresponding text of the voice signal for parsing to collected voice signal
Multiple critical fielies in information.
Acquiring unit 30 obtains target for obtaining the temporal information of each critical field in the multiple critical field
Audio file, wherein each pass in multiple critical fielies and the multiple critical field is carried in the target audio file
The temporal information of key field.
Broadcast unit 40, for showing each critical field and described simultaneously when playing the target audio file
The temporal information of each critical field.
The processing unit of audio file provided by the embodiments of the present application passes through through acquisition unit 10 in court trial process
The sound card of multichannel carries out sound collection, wherein the corresponding sound collector of each sound channel on sound card, each sound collection
Device is used to acquire the sound using object;Recognition unit 20 parses collected voice signal, identifies voice signal pair
Multiple critical fielies in the text information answered;Acquiring unit 30 obtains the time letter of each critical field in multiple critical fielies
Breath, obtains target audio file, wherein carries in target audio file each in multiple critical fielies and multiple critical fielies
The temporal information of critical field;Broadcast unit 40 shows each critical field and each pass when playing target audio file simultaneously
The temporal information of key field.
Optionally, in the processing unit of audio file provided by the embodiments of the present application, acquisition unit is also used in court's trial
In the process, sound collection is carried out by the sound card of multichannel, obtains original audio file, wherein original audio file includes more
A voice signal;Recognition unit is also used to parse collected voice signal, the corresponding text envelope of identification voice signal
Multiple critical fielies in breath include: to parse to multiple voice signals in original audio file, identify voice signal pair
Multiple critical fielies in the text information answered;Acquiring unit further include: module is obtained, it is every in multiple critical fielies for obtaining
The temporal information of a critical field;Adding module, the time of each critical field in multiple critical fielies for will acquire
Information is added in original audio file;Determining module adds treated original audio file as target for that will execute
Audio file.
Optionally, in the processing unit of audio file provided by the embodiments of the present application, the time of each critical field believes
At the beginning of including each critical field in breath, the device further include: determination unit, for when playing target audio file
Before showing the temporal information of each critical field and each critical field simultaneously, based on true at the beginning of each critical field
Fixed each critical field corresponding position in playing progress bar, wherein playing progress bar is used to play target audio file
When show playback progress;In each critical field, the corresponding mark of each critical field is added in corresponding position in playing progress bar
Label;Broadcast unit is also used to when playing target audio file, while being shown and being carried broadcasting for each corresponding label of critical field
Put progress bar.
Optionally, in the processing unit of audio file provided by the embodiments of the present application, the time of each critical field believes
At the beginning of breath includes each critical field, the device further include: creating unit, for same when playing target audio file
When show the temporal information of each critical field and each critical field before, create broadcast information table, wherein broadcast information table
In include each critical field and each critical field at the beginning of;Broadcast unit is also used to playing target audio file
When, while showing broadcast information table.
Optionally, in the processing unit of audio file provided by the embodiments of the present application, the time of each critical field believes
Breath includes the end time at the beginning of each critical field with each critical field, the device further include: storage unit is used
After the temporal information of each critical field in obtaining multiple critical fielies, by original audio file, each critical field,
It is stored in the preset database at the beginning of each critical field with the end time of each critical field.
Optionally, in the processing unit of audio file provided by the embodiments of the present application, the device further include: configuration is single
Member before carrying out sound collection by the sound card of multichannel, configures on sound card each sound channel and each in court trial process
Corresponding relationship between court's trial object role;Connection unit is used for according to corresponding relationship, and each court's trial object role is corresponding
Sound collector is attached with each sound channel.
The processing unit of audio file includes processor and memory, and above-mentioned acquisition unit 10, obtains list at recognition unit 20
Member 30, broadcast unit 40 etc. store in memory as program unit, are executed on stored in memory by processor
Program unit is stated to realize corresponding function.
Include kernel in processor, is gone in memory to transfer corresponding program unit by kernel.Kernel can be set one
Or more, audio file is handled by adjusting kernel parameter.
Memory may include the non-volatile memory in computer-readable medium, random access memory (RAM) and/
Or the forms such as Nonvolatile memory, if read-only memory (ROM) or flash memory (flash RAM), memory include that at least one is deposited
Store up chip.
The embodiment of the invention provides a kind of storage mediums, are stored thereon with program, real when which is executed by processor
The processing method of existing audio file.
The embodiment of the invention provides a kind of processor, processor is for running program, wherein program executes sound when running
The processing method of frequency file.
The embodiment of the invention provides a kind of equipment, equipment include processor, memory and storage on a memory and can
The program run on a processor, processor perform the steps of in court trial process when executing program, pass through the sound of multichannel
Card carries out sound collection, wherein the corresponding sound collector of each sound channel on sound card, each sound collector is for acquiring
Use the sound of object;Collected voice signal is parsed, it is multiple in the corresponding text information of identification voice signal
Critical field;The temporal information for obtaining each critical field in multiple critical fielies, obtains target audio file, wherein target
The temporal information of each critical field in multiple critical fielies and multiple critical fielies is carried in audio file;Playing target
The temporal information of each critical field and each critical field is shown when audio file simultaneously.
In court trial process, carrying out sound collection by the sound card of multichannel includes: to pass through multichannel in court trial process
Sound card carry out sound collection, obtain original audio file, wherein original audio file includes multiple voice signals;To acquisition
To voice signal parsed, multiple critical fielies in the corresponding text information of identification voice signal include: to initial sound
Multiple voice signals in frequency file are parsed, multiple critical fielies in the corresponding text information of identification voice signal;It obtains
The temporal information for taking each critical field in multiple critical fielies, obtaining target audio file includes: to obtain multiple critical fielies
In each critical field temporal information;The temporal information of each critical field is added in the multiple critical fielies that will acquire
In original audio file;Addition treated original audio file will be executed as target audio file.
At the beginning of including each critical field in the temporal information of each critical field, target audio file is being played
When simultaneously show the temporal information of each critical field and each critical field before, this method further include: be based on each key
Each critical field corresponding position in playing progress bar is determined at the beginning of field, wherein playing progress bar is used for
Playback progress is shown when playing target audio file;In each critical field, corresponding position addition is each in playing progress bar
The corresponding label of critical field;Shown simultaneously when playing target audio file each critical field and each critical field when
Between information include: while to show the playback progress for carrying the corresponding label of each critical field when playing target audio file
Item.
At the beginning of the temporal information of each critical field includes each critical field, when playing target audio file
Before showing the temporal information of each critical field and each critical field simultaneously, this method further include: creation broadcast information table,
Wherein, at the beginning of including each critical field and each critical field in broadcast information table;Playing target audio file
When simultaneously show that the temporal information of each critical field and each critical field includes: when playing target audio file, simultaneously
Show broadcast information table.
The temporal information of each critical field includes at the beginning of each critical field and the end of each critical field
Time, in obtaining multiple critical fielies after the temporal information of each critical field, this method further include: by initial audio text
At the beginning of part, each critical field, each critical field and the end time of each critical field is stored in preset data
In library.
In court trial process, before carrying out sound collection by the sound card of multichannel, this method further include: on configuration sound card
Corresponding relationship between each sound channel and each court's trial object role;It is according to corresponding relationship, each court's trial object role is corresponding
Sound collector be attached with each sound channel.Equipment herein can be server, PC, PAD, mobile phone etc..
Present invention also provides a kind of computer program products, when executing on data processing equipment, are adapted for carrying out just
The program of beginningization there are as below methods step: in court trial process, the sound card for passing through multichannel carries out sound collection, wherein sound card
On the corresponding sound collector of each sound channel, each sound collector is used to acquire the sound for using object;To collecting
Voice signal parsed, multiple critical fielies in the corresponding text information of identification voice signal;Obtain multiple keywords
The temporal information of each critical field, obtains target audio file, wherein multiple keys are carried in target audio file in section
The temporal information of each critical field in field and multiple critical fielies;Show each pass simultaneously when playing target audio file
The temporal information of key field and each critical field.
In court trial process, carrying out sound collection by the sound card of multichannel includes: to pass through multichannel in court trial process
Sound card carry out sound collection, obtain original audio file, wherein original audio file includes multiple voice signals;To acquisition
To voice signal parsed, multiple critical fielies in the corresponding text information of identification voice signal include: to initial sound
Multiple voice signals in frequency file are parsed, multiple critical fielies in the corresponding text information of identification voice signal;It obtains
The temporal information for taking each critical field in multiple critical fielies, obtaining target audio file includes: to obtain multiple critical fielies
In each critical field temporal information;The temporal information of each critical field is added in the multiple critical fielies that will acquire
In original audio file;Addition treated original audio file will be executed as target audio file.
At the beginning of including each critical field in the temporal information of each critical field, target audio file is being played
When simultaneously show the temporal information of each critical field and each critical field before, this method further include: be based on each key
Each critical field corresponding position in playing progress bar is determined at the beginning of field, wherein playing progress bar is used for
Playback progress is shown when playing target audio file;In each critical field, corresponding position addition is each in playing progress bar
The corresponding label of critical field;Shown simultaneously when playing target audio file each critical field and each critical field when
Between information include: while to show the playback progress for carrying the corresponding label of each critical field when playing target audio file
Item.
At the beginning of the temporal information of each critical field includes each critical field, when playing target audio file
Before showing the temporal information of each critical field and each critical field simultaneously, this method further include: creation broadcast information table,
Wherein, at the beginning of including each critical field and each critical field in broadcast information table;Playing target audio file
When simultaneously show that the temporal information of each critical field and each critical field includes: when playing target audio file, simultaneously
Show broadcast information table.
The temporal information of each critical field includes at the beginning of each critical field and the end of each critical field
Time, in obtaining multiple critical fielies after the temporal information of each critical field, this method further include: by initial audio text
At the beginning of part, each critical field, each critical field and the end time of each critical field is stored in preset data
In library.
In court trial process, before carrying out sound collection by the sound card of multichannel, this method further include: on configuration sound card
Corresponding relationship between each sound channel and each court's trial object role;It is according to corresponding relationship, each court's trial object role is corresponding
Sound collector be attached with each sound channel.
It should be understood by those skilled in the art that, embodiments herein can provide as method, system or computer program
Product.Therefore, complete hardware embodiment, complete software embodiment or reality combining software and hardware aspects can be used in the application
Apply the form of example.Moreover, it wherein includes the computer of computer usable program code that the application, which can be used in one or more,
The computer program implemented in usable storage medium (including but not limited to magnetic disk storage, CD-ROM, optical memory etc.) produces
The form of product.
The application is referring to method, the process of equipment (system) and computer program product according to the embodiment of the present application
Figure and/or block diagram describe.It should be understood that every one stream in flowchart and/or the block diagram can be realized by computer program instructions
The combination of process and/or box in journey and/or box and flowchart and/or the block diagram.It can provide these computer programs
Instruct the processor of general purpose computer, special purpose computer, Embedded Processor or other programmable data processing devices to produce
A raw machine, so that being generated by the instruction that computer or the processor of other programmable data processing devices execute for real
The device for the function of being specified in present one or more flows of the flowchart and/or one or more blocks of the block diagram.
These computer program instructions, which may also be stored in, is able to guide computer or other programmable data processing devices with spy
Determine in the computer-readable memory that mode works, so that it includes referring to that instruction stored in the computer readable memory, which generates,
Enable the manufacture of device, the command device realize in one box of one or more flows of the flowchart and/or block diagram or
The function of being specified in multiple boxes.
These computer program instructions also can be loaded onto a computer or other programmable data processing device, so that counting
Series of operation steps are executed on calculation machine or other programmable devices to generate computer implemented processing, thus in computer or
The instruction executed on other programmable devices is provided for realizing in one or more flows of the flowchart and/or block diagram one
The step of function of being specified in a box or multiple boxes.
In a typical configuration, calculating equipment includes one or more processors (CPU), input/output interface, net
Network interface and memory.
Memory may include the non-volatile memory in computer-readable medium, random access memory (RAM) and/
Or the forms such as Nonvolatile memory, such as read-only memory (ROM) or flash memory (flash RAM).Memory is computer-readable Jie
The example of matter.
Computer-readable medium includes permanent and non-permanent, removable and non-removable media can be by any method
Or technology come realize information store.Information can be computer readable instructions, data structure, the module of program or other data.
The example of the storage medium of computer includes, but are not limited to phase change memory (PRAM), static random access memory (SRAM), moves
State random access memory (DRAM), other kinds of random access memory (RAM), read-only memory (ROM), electric erasable
Programmable read only memory (EEPROM), flash memory or other memory techniques, read-only disc read only memory (CD-ROM) (CD-ROM),
Digital versatile disc (DVD) or other optical storage, magnetic cassettes, tape magnetic disk storage or other magnetic storage devices
Or any other non-transmission medium, can be used for storage can be accessed by a computing device information.As defined in this article, it calculates
Machine readable medium does not include temporary computer readable media (transitory media), such as the data-signal and carrier wave of modulation.
It should also be noted that, the terms "include", "comprise" or its any other variant are intended to nonexcludability
It include so that the process, method, commodity or the equipment that include a series of elements not only include those elements, but also to wrap
Include other elements that are not explicitly listed, or further include for this process, method, commodity or equipment intrinsic want
Element.In the absence of more restrictions, the element limited by sentence "including a ...", it is not excluded that including element
There is also other identical elements in process, method, commodity or equipment.
It will be understood by those skilled in the art that embodiments herein can provide as method, system or computer program product.
Therefore, complete hardware embodiment, complete software embodiment or embodiment combining software and hardware aspects can be used in the application
Form.It is deposited moreover, the application can be used to can be used in the computer that one or more wherein includes computer usable program code
The shape for the computer program product implemented on storage media (including but not limited to magnetic disk storage, CD-ROM, optical memory etc.)
Formula.
The above is only embodiments herein, are not intended to limit this application.To those skilled in the art,
Various changes and changes are possible in this application.It is all within the spirit and principles of the present application made by any modification, equivalent replacement,
Improve etc., it should be included within the scope of the claims of this application.
Claims (10)
1. a kind of processing method of audio file characterized by comprising
In court trial process, sound collection is carried out by the sound card of multichannel, wherein each sound channel corresponding one on the sound card
A sound collector, each sound collector are used to acquire the sound using object;
Collected voice signal is parsed, identifies multiple keywords in the corresponding text information of the voice signal
Section;
The temporal information for obtaining each critical field in the multiple critical field, obtains target audio file, wherein the mesh
The temporal information of each critical field in multiple critical fielies and the multiple critical field is carried in mark with phonetic symbols frequency file;
Show the time of each critical field and each critical field simultaneously when playing the target audio file
Information.
2. the method according to claim 1, wherein
In court trial process, carrying out sound collection by the sound card of multichannel includes: to pass through the sound of multichannel in court trial process
Card carries out sound collection, obtains original audio file, wherein the original audio file includes multiple voice signals;
Collected voice signal is parsed, identifies multiple critical fielies in the corresponding text information of the voice signal
Include: that multiple voice signals in the original audio file are parsed, identifies the corresponding text envelope of the voice signal
Multiple critical fielies in breath;
The temporal information for obtaining each critical field in the multiple critical field, obtaining target audio file includes:
Obtain the temporal information of each critical field in the multiple critical field;
The temporal information of each critical field is added to the original audio file in the multiple critical field that will acquire
In;
Addition treated original audio file will be executed as the target audio file.
3. method according to claim 1 or 2, which is characterized in that include in the temporal information of each critical field
At the beginning of each critical field,
Show the time of each critical field and each critical field simultaneously when playing the target audio file
Before information, the method also includes: determine that each critical field exists at the beginning of based on each critical field
Corresponding position in playing progress bar, wherein the playing progress bar is broadcast for showing when playing the target audio file
Degree of putting into;In each critical field, the corresponding mark of each critical field is added in corresponding position in playing progress bar
Label;
Show the time of each critical field and each critical field simultaneously when playing the target audio file
Information include: when playing the target audio file, while show carry the broadcasting of the corresponding label of each critical field into
Spend item.
4. method according to claim 1 or 2, which is characterized in that the temporal information of each critical field includes institute
At the beginning of stating each critical field,
Show the time of each critical field and each critical field simultaneously when playing the target audio file
Before information, the method also includes: creation broadcast information table, wherein include each key in the broadcast information table
At the beginning of field and each critical field;
Show the time of each critical field and each critical field simultaneously when playing the target audio file
Information includes: while to show the broadcast information table when playing the target audio file.
5. according to the method described in claim 2, it is characterized in that, the temporal information of each critical field includes described every
It is each in obtaining the multiple critical field at the beginning of a critical field and the end time of each critical field
After the temporal information of critical field, the method also includes:
It will be at the beginning of the original audio file, each critical field, each critical field and described each
The end time storage of critical field is in the preset database.
6. method according to claim 1 or 2, which is characterized in that in court trial process, carried out by the sound card of multichannel
Before sound collection, the method also includes:
Configure the corresponding relationship on the sound card between each sound channel and each court's trial object role;
According to the corresponding relationship, the corresponding sound collector of each court's trial object role is attached with each sound channel.
7. a kind of processing unit of audio file characterized by comprising
Acquisition unit, for carrying out sound collection by the sound card of multichannel, wherein on the sound card in court trial process
The corresponding sound collector of each sound channel, each sound collector are used to acquire the sound using object;
Recognition unit identifies in the corresponding text information of the voice signal for parsing to collected voice signal
Multiple critical fielies;
Acquiring unit obtains target audio text for obtaining the temporal information of each critical field in the multiple critical field
Part, wherein each critical field in multiple critical fielies and the multiple critical field is carried in the target audio file
Temporal information;
Broadcast unit, for showing each critical field and each pass simultaneously when playing the target audio file
The temporal information of key field.
8. device according to claim 7, which is characterized in that
The acquisition unit is also used in court trial process, is carried out sound collection by the sound card of multichannel, is obtained initial audio
File, wherein the original audio file includes multiple voice signals;
The recognition unit is also used to parse collected voice signal, identifies the corresponding text envelope of the voice signal
Multiple critical fielies in breath include: to parse to multiple voice signals in the original audio file, identify the sound
Multiple critical fielies in the corresponding text information of sound signal;
The acquiring unit further include:
Module is obtained, for obtaining the temporal information of each critical field in the multiple critical field;
Adding module, the temporal information of each critical field is added to described in the multiple critical field for will acquire
In original audio file;
Determining module adds treated original audio file as the target audio file for that will execute.
9. a kind of storage medium, which is characterized in that the storage medium includes the program of storage, wherein described program right of execution
Benefit require any one of 1 to 6 described in audio file processing method.
10. a kind of processor, which is characterized in that the processor is for running program, wherein right of execution when described program is run
Benefit require any one of 1 to 6 described in audio file processing method.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201710890678.5A CN109559764A (en) | 2017-09-27 | 2017-09-27 | The treating method and apparatus of audio file |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201710890678.5A CN109559764A (en) | 2017-09-27 | 2017-09-27 | The treating method and apparatus of audio file |
Publications (1)
Publication Number | Publication Date |
---|---|
CN109559764A true CN109559764A (en) | 2019-04-02 |
Family
ID=65864033
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN201710890678.5A Pending CN109559764A (en) | 2017-09-27 | 2017-09-27 | The treating method and apparatus of audio file |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN109559764A (en) |
Cited By (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN112333554A (en) * | 2020-10-27 | 2021-02-05 | 腾讯科技(深圳)有限公司 | Multimedia data processing method and device, electronic equipment and storage medium |
Citations (7)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN101482880A (en) * | 2008-01-09 | 2009-07-15 | 索尼株式会社 | Video searching apparatus, editing apparatus, video searching method, and program |
CN101996195A (en) * | 2009-08-28 | 2011-03-30 | 中国移动通信集团公司 | Searching method and device of voice information in audio files and equipment |
CN104078044A (en) * | 2014-07-02 | 2014-10-01 | 深圳市中兴移动通信有限公司 | Mobile terminal and sound recording search method and device of mobile terminal |
CN105653729A (en) * | 2016-01-28 | 2016-06-08 | 努比亚技术有限公司 | Device and method for indexing sound recording file |
CN105913838A (en) * | 2016-05-19 | 2016-08-31 | 努比亚技术有限公司 | Device and method of audio management |
US20160364102A1 (en) * | 2015-06-11 | 2016-12-15 | Yaron Galant | Method and apparatus for using gestures during video capture |
US20170186465A1 (en) * | 2015-12-23 | 2017-06-29 | Bryant E. Walters | System for playing files associated with tagged interest items |
-
2017
- 2017-09-27 CN CN201710890678.5A patent/CN109559764A/en active Pending
Patent Citations (7)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN101482880A (en) * | 2008-01-09 | 2009-07-15 | 索尼株式会社 | Video searching apparatus, editing apparatus, video searching method, and program |
CN101996195A (en) * | 2009-08-28 | 2011-03-30 | 中国移动通信集团公司 | Searching method and device of voice information in audio files and equipment |
CN104078044A (en) * | 2014-07-02 | 2014-10-01 | 深圳市中兴移动通信有限公司 | Mobile terminal and sound recording search method and device of mobile terminal |
US20160364102A1 (en) * | 2015-06-11 | 2016-12-15 | Yaron Galant | Method and apparatus for using gestures during video capture |
US20170186465A1 (en) * | 2015-12-23 | 2017-06-29 | Bryant E. Walters | System for playing files associated with tagged interest items |
CN105653729A (en) * | 2016-01-28 | 2016-06-08 | 努比亚技术有限公司 | Device and method for indexing sound recording file |
CN105913838A (en) * | 2016-05-19 | 2016-08-31 | 努比亚技术有限公司 | Device and method of audio management |
Cited By (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN112333554A (en) * | 2020-10-27 | 2021-02-05 | 腾讯科技(深圳)有限公司 | Multimedia data processing method and device, electronic equipment and storage medium |
CN112333554B (en) * | 2020-10-27 | 2024-02-06 | 腾讯科技(深圳)有限公司 | Multimedia data processing method and device, electronic equipment and storage medium |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
US20210287662A1 (en) | Methods and apparatus to segment audio and determine audio segment similarities | |
US9092531B2 (en) | Customized content consumption interface | |
CN109478195A (en) | The method and system of selection and optimization for search engine | |
CN103974143B (en) | A kind of method and apparatus for generating media data | |
CN112511854B (en) | Live video highlight generation method, device, medium and equipment | |
US11669296B2 (en) | Computerized systems and methods for hosting and dynamically generating and providing customized media and media experiences | |
US11176194B2 (en) | User configurable radio | |
CN109565621A (en) | Video segmentation in system for managing video | |
US10832700B2 (en) | Sound file sound quality identification method and apparatus | |
Hujran et al. | Big data and its effect on the music industry | |
CN108259985A (en) | Live audio sound mixing method, device, readable storage medium storing program for executing and equipment | |
CN109561339A (en) | The treating method and apparatus of video file | |
US20140129571A1 (en) | Electronic media signature based applications | |
CN106909567B (en) | Data processing method and device | |
CN107680584B (en) | Method and device for segmenting audio | |
WO2016171900A1 (en) | Gapless media generation | |
CN109388740A (en) | A kind of monitoring method and device of spreading network information effect | |
CN109559764A (en) | The treating method and apparatus of audio file | |
CN110019923A (en) | The lookup method and device of speech message | |
CN112349303B (en) | Audio playing method, device and storage medium | |
CN109213971A (en) | The generation method and device of court's trial notes | |
CN110046263A (en) | Multimedia recommendation method, device, server and storage medium | |
CN107799138A (en) | The method and device of audio recording | |
CN108874815A (en) | The search method and device of audio-video | |
Narayana et al. | Effect of noise-in-speech on mfcc parameters |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
CB02 | Change of applicant information |
Address after: 100083 No. 401, 4th Floor, Haitai Building, 229 North Fourth Ring Road, Haidian District, Beijing Applicant after: BEIJING GRIDSUM TECHNOLOGY Co.,Ltd. Address before: 100086 Beijing city Haidian District Shuangyushu Area No. 76 Zhichun Road cuigongfandian 8 layer A Applicant before: BEIJING GRIDSUM TECHNOLOGY Co.,Ltd. |
|
CB02 | Change of applicant information | ||
RJ01 | Rejection of invention patent application after publication |
Application publication date: 20190402 |
|
RJ01 | Rejection of invention patent application after publication |