CN109559764A - The treating method and apparatus of audio file - Google Patents

The treating method and apparatus of audio file Download PDF

Info

Publication number
CN109559764A
CN109559764A CN201710890678.5A CN201710890678A CN109559764A CN 109559764 A CN109559764 A CN 109559764A CN 201710890678 A CN201710890678 A CN 201710890678A CN 109559764 A CN109559764 A CN 109559764A
Authority
CN
China
Prior art keywords
critical field
audio file
sound
critical
temporal information
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN201710890678.5A
Other languages
Chinese (zh)
Inventor
张珍心
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Beijing Gridsum Technology Co Ltd
Original Assignee
Beijing Gridsum Technology Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Beijing Gridsum Technology Co Ltd filed Critical Beijing Gridsum Technology Co Ltd
Priority to CN201710890678.5A priority Critical patent/CN109559764A/en
Publication of CN109559764A publication Critical patent/CN109559764A/en
Pending legal-status Critical Current

Links

Classifications

    • GPHYSICS
    • G11INFORMATION STORAGE
    • G11BINFORMATION STORAGE BASED ON RELATIVE MOVEMENT BETWEEN RECORD CARRIER AND TRANSDUCER
    • G11B27/00Editing; Indexing; Addressing; Timing or synchronising; Monitoring; Measuring tape travel
    • G11B27/10Indexing; Addressing; Timing or synchronising; Measuring tape travel
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/26Speech to text systems

Landscapes

  • Engineering & Computer Science (AREA)
  • Computational Linguistics (AREA)
  • Health & Medical Sciences (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Human Computer Interaction (AREA)
  • Physics & Mathematics (AREA)
  • Acoustics & Sound (AREA)
  • Multimedia (AREA)
  • Stereophonic System (AREA)

Abstract

This application discloses a kind for the treatment of method and apparatus of audio file.This method comprises: carrying out sound collection by the sound card of multichannel in court trial process, wherein the corresponding sound collector of each sound channel on sound card, each sound collector are used to acquire the sound using object;Collected voice signal is parsed, multiple critical fielies in the corresponding text information of identification voice signal;The temporal information for obtaining each critical field in multiple critical fielies, obtains target audio file, wherein the temporal information of each critical field in multiple critical fielies and multiple critical fielies is carried in target audio file;Show the temporal information of each critical field and each critical field simultaneously when playing target audio file.By the application, solve the problems, such as that the efficiency for obtaining target information from the audio file of court's trial in the related technology is lower.

Description

The treating method and apparatus of audio file
Technical field
This application involves audio signal processing technique fields, in particular to a kind for the treatment of method and apparatus of audio file.
Background technique
The voice play-back technology in webpage substantially has audio player audio and video player video, traditional at present Video web page player (such as JW FLV of AS (flash) programming plus third party control (such as ckplayer) and some open sources Player, it is most popular, most flexible Web media player, it can play all formats that Flash is supported, including FLV, MP4, MP3, AAC, JPG, PNG and GIF.RTMP, HTTP live media stream is also supported to support a variety of played column tables Formula) etc..But they can only be fast by clicking the playing progress bar that respectively default or by defining in being respectively arranged on lower keyboard Prompt key controls broadcasting speed or is played back.Had also been used in judicial court trial process various software recording audios so as to Review operation after the court's trials such as judge.But what this method can only be respectively arranged by the playing progress bar respectively defaulted or press Shortcut key carry out control audio playback to reach the review to court's trial content, such as want grab court's trial in as case main idea, case by, Law court, region, using law, party, the side of telling party, judge, lawyer, the time of concluding, evidence, it is concerning foreign affairs, judgement the amount of money it is thin The key messages such as item and lawyer's office must be listened since the audio file of court's trial, and efficiency is lower.
For the lower problem of efficiency for obtaining target information from the audio file of court's trial in the related technology, not yet mention at present Effective solution scheme out.
Summary of the invention
The main purpose of the application is to provide a kind for the treatment of method and apparatus of audio file, to solve in the related technology The lower problem of efficiency of target information is obtained from the audio file of court's trial.
To achieve the goals above, according to the one aspect of the application, a kind of processing method of audio file is provided.It should Method includes: to carry out sound collection by the sound card of multichannel, wherein each sound channel on the sound card in court trial process A corresponding sound collector, each sound collector are used to acquire the sound using object;To collected voice signal into Row parsing, identifies multiple critical fielies in the corresponding text information of the voice signal;It obtains in the multiple critical field The temporal information of each critical field, obtains target audio file, wherein multiple keys are carried in the target audio file The temporal information of each critical field in field and the multiple critical field;It is shown simultaneously when playing the target audio file Show the temporal information of each critical field and each critical field.
Further, in court trial process, by the sound card of multichannel carry out sound collection include: in court trial process, Sound collection is carried out by the sound card of multichannel, obtains original audio file, wherein the original audio file includes multiple sound Sound signal;Collected voice signal is parsed, identifies multiple keys in the corresponding text information of the voice signal Field includes: to parse to multiple voice signals in the original audio file, identifies the corresponding text of the voice signal Multiple critical fielies in this information;The temporal information for obtaining each critical field in the multiple critical field, obtains target Audio file includes: the temporal information for obtaining each critical field in the multiple critical field;What be will acquire is the multiple The temporal information of each critical field is added in the original audio file in critical field;Will execute addition treated just Beginning audio file is as the target audio file.
Further, at the beginning of including each critical field in the temporal information of each critical field, Show the temporal information of each critical field and each critical field simultaneously when playing the target audio file Before, the method also includes: determine that each critical field is playing at the beginning of based on each critical field Corresponding position in progress bar, wherein the playing progress bar be used for when playing the target audio file show play into Degree;In each critical field, the corresponding label of each critical field is added in corresponding position in playing progress bar; Show the temporal information of each critical field and each critical field simultaneously when playing the target audio file It include: while to show the playing progress bar for carrying the corresponding label of each critical field when playing the target audio file.
Further, at the beginning of the temporal information of each critical field includes each critical field, When playing the target audio file at the same show each critical field and each critical field temporal information it Before, the method also includes: creation broadcast information table, wherein in the broadcast information table include each critical field and At the beginning of each critical field;When playing the target audio file simultaneously show each critical field and The temporal information of each critical field includes: while to show the broadcast information when playing the target audio file Table.
Further, at the beginning of the temporal information of each critical field includes each critical field and institute The end time for stating each critical field, in obtaining the multiple critical field after the temporal information of each critical field, The method also includes: at the beginning of the original audio file, each critical field, each critical field End time storage with each critical field is in the preset database.
Further, in court trial process, before carrying out sound collection by the sound card of multichannel, the method is also wrapped It includes: configuring the corresponding relationship on the sound card between each sound channel and each court's trial object role;It, will according to the corresponding relationship The corresponding sound collector of each court's trial object role is attached with each sound channel.
To achieve the goals above, according to the one aspect of the application, a kind of processing unit of audio file is provided, is wrapped It includes: acquisition unit, for carrying out sound collection by the sound card of multichannel, wherein every on the sound card in court trial process The corresponding sound collector of a sound channel, each sound collector are used to acquire the sound using object;Recognition unit, for pair Collected voice signal is parsed, and identifies multiple critical fielies in the corresponding text information of the voice signal;It obtains Unit obtains target audio file for obtaining the temporal information of each critical field in the multiple critical field, wherein The time letter of each critical field in multiple critical fielies and the multiple critical field is carried in the target audio file Breath;Broadcast unit, for showing each critical field and each pass simultaneously when playing the target audio file The temporal information of key field.
Further, the acquisition unit is also used in court trial process, carries out sound collection by the sound card of multichannel, Obtain original audio file, wherein the original audio file includes multiple voice signals;The recognition unit is also used to adopting The voice signal collected is parsed, and identifies that multiple critical fielies in the corresponding text information of the voice signal include: pair Multiple voice signals in the original audio file are parsed, and are identified more in the corresponding text information of the voice signal A critical field;The acquiring unit further include: module is obtained, for obtaining each critical field in the multiple critical field Temporal information;Adding module, the temporal information of each critical field adds in the multiple critical field for will acquire It adds in the original audio file;Determining module adds treated original audio file as the mesh for that will execute Mark with phonetic symbols frequency file.
To achieve the goals above, according to the another aspect of the application, a kind of storage medium, the storage medium are provided Program including storage, wherein described program executes the processing method of audio file described in above-mentioned any one.
To achieve the goals above, according to the another aspect of the application, a kind of processor is provided, the processor is used for Run program, wherein described program executes the processing method of audio file described in above-mentioned any one when running.
By the application, using following steps: in court trial process, sound collection is carried out by the sound card of multichannel, In, the corresponding sound collector of each sound channel on sound card, each sound collector is used to acquire the sound using object;It is right Collected voice signal is parsed, multiple critical fielies in the corresponding text information of identification voice signal;It obtains multiple The temporal information of each critical field, obtains target audio file in critical field, wherein carries in target audio file more The temporal information of each critical field in a critical field and multiple critical fielies;It is shown simultaneously when playing target audio file The temporal information of each critical field and each critical field solves the collected voice signal in the related technology from court's trial The middle lower problem of efficiency for obtaining target information.By showing each critical field and every simultaneously when playing target audio file The temporal information of a critical field, so as to according to the temporal information of each critical field and each critical field shown Prompt, target information is rapidly obtained from target audio file, so reached promotion obtained from trial audio file Take the effect of the efficiency of target information.
Detailed description of the invention
The attached drawing constituted part of this application is used to provide further understanding of the present application, the schematic reality of the application Example and its explanation are applied for explaining the application, is not constituted an undue limitation on the present application.In the accompanying drawings:
Fig. 1 is the flow chart according to the processing method of audio file provided by the embodiments of the present application;
Fig. 2 is the schematic diagram according to each character location distribution in court scene in the embodiment of the present application;
Fig. 3 is the schematic diagram of the sound card in the processing according to audio file provided by the embodiments of the present application;
Fig. 4 is the schematic diagram of voice signal in the processing according to audio file provided by the embodiments of the present application;And
Fig. 5 is the schematic diagram according to the processing unit of audio file provided by the embodiments of the present application.
Specific embodiment
It should be noted that in the absence of conflict, the features in the embodiments and the embodiments of the present application can phase Mutually combination.The application is described in detail below with reference to the accompanying drawings and in conjunction with the embodiments.
In order to make those skilled in the art more fully understand application scheme, below in conjunction in the embodiment of the present application Attached drawing, the technical scheme in the embodiment of the application is clearly and completely described, it is clear that described embodiment is only The embodiment of the application a part, instead of all the embodiments.Based on the embodiment in the application, ordinary skill people Member's every other embodiment obtained without making creative work, all should belong to the model of the application protection It encloses.
It should be noted that the description and claims of this application and term " first " in above-mentioned attached drawing, " Two " etc. be to be used to distinguish similar objects, without being used to describe a particular order or precedence order.It should be understood that using in this way Data be interchangeable under appropriate circumstances, so as to embodiments herein described herein.In addition, term " includes " and " tool Have " and their any deformation, it is intended that cover it is non-exclusive include, for example, containing a series of steps or units Process, method, system, product or equipment those of are not necessarily limited to be clearly listed step or unit, but may include without clear Other step or units listing to Chu or intrinsic for these process, methods, product or equipment.
According to an embodiment of the present application, a kind of processing method of audio file is provided.
Fig. 1 is the flow chart according to the processing method of the audio file of the embodiment of the present application.As shown in Figure 1, this method packet Include following steps:
Step S101 carries out sound collection by the sound card of multichannel, wherein each of on sound card in court trial process Sound channel corresponds to a sound collector, and each sound collector is used to acquire the sound using object.
In this application, in court trial process, the approximate location of each role in court scene is as shown in Figure 2.There are more sound in court Road sound card, sound card are connected on clerk's computer, the corresponding microphone of each sound channel (corresponding above-mentioned sound collection on sound card Device), sound channel is corresponding with court's trial role relation, and the microphone of each role is connected to sound card according to this corresponded manner, such as Fig. 3 institute Show.When carrying out court's trial, by the sound of the multiple roles using microphone of the sound DAQ of multichannel, voice signal is obtained, is adopted The voice signal collected is one section of Wave data, for example, as shown in Figure 4.
Optionally, in the processing method of audio file provided by the embodiments of the present application, in court trial process, pass through more sound Before the sound card in road carries out sound collection, this method further include: on configuration sound card each sound channel and each court's trial object role it Between corresponding relationship;According to corresponding relationship, the corresponding sound collector of each court's trial object role is connected with each sound channel It connects.
Alternatively, by configuring the corresponding relationship between each court's trial object role and each sound collector;Then according to Each sound collector sound channel corresponding with each court's trial object role is attached by corresponding relationship.
It should be noted that can be applied by the processing method of the audio file of the embodiment of the present application in court's trial software In, that is, being embedded in the processing method of the audio file of the embodiment of the present application in court's trial software.Before starting court's trial, installation The court's trial software, and the corresponding relationship of role's sound channel is set in software.
Step S102 parses collected voice signal, more in the corresponding text information of identification voice signal A critical field.
It should be noted that above-mentioned parse to collected voice signal can be to adopt on one side in court trial process Collect voice signal, on one side to acquisition to voice signal parse, or in court trial process voice signal is adopted After collection finishes, the set of voice signal is parsed, this is not especially limited in this application.
It should be noted that above-mentioned multiple critical fielies can for case by, case main idea, concerning foreign affairs, evidence, the amount of money of sentencing Thin item etc., it should be noted that the critical field in the application can be to the corresponding text information of audio signal After habit, determine text information first object content be case by content, determine text information the second object content be case Content of main idea etc..
Step S103 obtains the temporal information of each critical field in multiple critical fielies, obtains target audio file, In, the temporal information of each critical field in multiple critical fielies and multiple critical fielies is carried in target audio file.
It should be noted that above-mentioned temporal information is the corresponding temporal information in voice signal of each critical field.
For example, some critical field is case by carrying out to the corresponding text information of audio signal in multiple critical fielies After study, determine text information first object content be case by content, be accordingly matched in voice signal in first object Hold corresponding starting and end time, for as case at the beginning of and the end time, case is at the beginning of and terminates Time be case by temporal information.
The corresponding temporal information in voice signal of each critical field is got by the above method.
Step S104 shows the time of each critical field and each critical field when playing target audio file simultaneously Information.
By showing the temporal information of each critical field and each critical field simultaneously when playing target audio file, It is fast from audio file so as to according to the prompt of the temporal information of each critical field and each critical field shown Speed gets target information, and then has achieved the effect that be promoted the efficiency that target information is obtained from trial audio file.
If browser end sends to server and requests when browser plays target audio file, by the audio text of upload Part, critical field, the temporal information of critical field etc. identified during parsing audio file read and are shown from database, The temporal information for showing each critical field and each critical field simultaneously when playing target audio file passes through the method reality Positioning playback function of the present browser end to audio file.
Optionally, in the processing method of audio file provided by the embodiments of the present application, in court trial process, pass through more sound It includes: to carry out sound collection in court trial process by the sound card of multichannel, obtain initial sound that the sound card in road, which carries out sound collection, Frequency file, wherein original audio file includes multiple voice signals;Collected voice signal is parsed, identifies sound Multiple critical fielies in the corresponding text information of signal include: to solve to multiple voice signals in original audio file Analysis identifies multiple critical fielies in the corresponding text information of voice signal;Obtain each critical field in multiple critical fielies Temporal information, obtaining target audio file includes: to obtain the temporal information of each critical field in multiple critical fielies;It will obtain The temporal information of each critical field is added in original audio file in the multiple critical fielies got;Addition processing will be executed Original audio file afterwards is as target audio file.
It should be noted that above-mentioned original audio file is to pass through the sound card carry out sound of multichannel in court trial process The set of the collected voice signal of sound, that is, through the above scheme, will first acquiring the voice signal in court trial process, adopting Collection finishes, and parses to original audio file, multiple critical fielies in original audio file is identified, to original audio file The temporal information for adding each critical field in multiple critical fielies, obtains target audio file, is playing target to realize The temporal information of each critical field and each critical field is shown when audio file simultaneously, so that the positioning to audio file is returned Playing function.
Optionally, in the processing method of audio file provided by the embodiments of the present application, the time of each critical field believes Breath includes the end time at the beginning of each critical field with each critical field, each in obtaining multiple critical fielies After the temporal information of critical field, this method further include: by original audio file, each critical field, each critical field At the beginning of and each critical field end time storage in the preset database.
In obtaining multiple critical fielies after the temporal information of each critical field, by original audio file, Mei Geguan It is stored in the preset database at the beginning of key field, each critical field with the end time of each critical field, namely Whole audio file in entire court trial process is stored in presetting database, to guarantee the integrality of file.
Optionally, in the processing method of audio file provided by the embodiments of the present application, the time of each critical field believes In breath include each critical field at the beginning of, when playing target audio file simultaneously show each critical field and each Before the temporal information of critical field, this method further include: each keyword is determined at the beginning of based on each critical field Section in playing progress bar corresponding position, wherein playing progress bar be used for when playing target audio file show play into Degree;In each critical field, the corresponding label of each critical field is added in corresponding position in playing progress bar;Playing mesh The temporal information for showing each critical field and each critical field when mark with phonetic symbols frequency file simultaneously includes: to play target audio text When part, while showing the playing progress bar for carrying the corresponding label of each critical field.
For example, case is the 14th second in audio file at the beginning of, the 14th second of corresponding playing progress bar Corresponding label is added in position, when playing target audio file, while showing and carrying the corresponding label of each critical field Playing progress bar.
Optionally, in the processing method of audio file provided by the embodiments of the present application, the time of each critical field believes At the beginning of breath includes each critical field, each critical field and each pass are shown simultaneously when playing target audio file Before the temporal information of key field, this method further include: creation broadcast information table, wherein include each pass in broadcast information table At the beginning of key field and each critical field;Show each critical field and each simultaneously when playing target audio file The temporal information of critical field includes: while to show broadcast information table when playing target audio file.
For example, multiple critical fielies in the embodiment of the present application can for case by, case main idea, concerning foreign affairs, evidence, gold of sentencing Thin item of volume etc. is based on creating broadcast information table at the beginning of each critical field and each critical field, for example, such as following table Shown in 1:
Table 1
Critical field Case by Case main idea It is concerning foreign affairs Evidence Sentence the thin item of the amount of money
Time started (S) 14 23 35 60 74
When playing target audio file, while showing table 1, the position of display table 1 is not construed as limiting in this application, is passed through Information in table 1, user can intuitively get the time point in audio file where multiple target informations, so as to straight It connects and switches to the corresponding time, obtain target information, to improve the effect for obtaining target information from trial audio file Rate.
To sum up, the processing method of audio file provided by the embodiments of the present application, by passing through multichannel in court trial process Sound card carry out sound collection, obtain audio file, wherein include multiple voice signals in audio file, each of on sound card Sound channel corresponds to a sound collector, and each sound collector is used to acquire the sound using object;To the sound in audio file Sound signal is parsed, multiple critical fielies in the corresponding text information of identification voice signal;It obtains in multiple critical fielies The temporal information of each critical field;Show each critical field and each critical field simultaneously when playing target audio file Temporal information, solve the problems, such as in the related technology from the audio file of court's trial obtain target information efficiency it is lower.Pass through Show the temporal information of each critical field and each critical field, simultaneously when playing target audio file so as to basis The prompt of the temporal information of each critical field and each critical field that show, is rapidly obtained mesh from audio file Information is marked, and then has achieved the effect that be promoted the efficiency for obtaining target information from trial audio file.
It should be noted that step shown in the flowchart of the accompanying drawings can be in such as a group of computer-executable instructions It is executed in computer system, although also, logical order is shown in flow charts, and it in some cases, can be with not The sequence being same as herein executes shown or described step.
The embodiment of the present application also provides a kind of processing units of audio file, it should be noted that the embodiment of the present application The processing unit of audio file can be used for executing the processing method that audio file is used for provided by the embodiment of the present application.With Under the processing unit of audio file provided by the embodiments of the present application is introduced.
Fig. 5 is the schematic diagram according to the processing unit of the audio file of the embodiment of the present application.As shown in figure 5, the device packet It includes: acquisition unit 10, recognition unit 20, acquiring unit 30, broadcast unit 40.
Specifically, acquisition unit 10, for carrying out sound collection by the sound card of multichannel in court trial process, wherein The corresponding sound collector of each sound channel on the sound card, each sound collector are used to acquire the sound using object.
Recognition unit 20 identifies the corresponding text of the voice signal for parsing to collected voice signal Multiple critical fielies in information.
Acquiring unit 30 obtains target for obtaining the temporal information of each critical field in the multiple critical field Audio file, wherein each pass in multiple critical fielies and the multiple critical field is carried in the target audio file The temporal information of key field.
Broadcast unit 40, for showing each critical field and described simultaneously when playing the target audio file The temporal information of each critical field.
The processing unit of audio file provided by the embodiments of the present application passes through through acquisition unit 10 in court trial process The sound card of multichannel carries out sound collection, wherein the corresponding sound collector of each sound channel on sound card, each sound collection Device is used to acquire the sound using object;Recognition unit 20 parses collected voice signal, identifies voice signal pair Multiple critical fielies in the text information answered;Acquiring unit 30 obtains the time letter of each critical field in multiple critical fielies Breath, obtains target audio file, wherein carries in target audio file each in multiple critical fielies and multiple critical fielies The temporal information of critical field;Broadcast unit 40 shows each critical field and each pass when playing target audio file simultaneously The temporal information of key field.
Optionally, in the processing unit of audio file provided by the embodiments of the present application, acquisition unit is also used in court's trial In the process, sound collection is carried out by the sound card of multichannel, obtains original audio file, wherein original audio file includes more A voice signal;Recognition unit is also used to parse collected voice signal, the corresponding text envelope of identification voice signal Multiple critical fielies in breath include: to parse to multiple voice signals in original audio file, identify voice signal pair Multiple critical fielies in the text information answered;Acquiring unit further include: module is obtained, it is every in multiple critical fielies for obtaining The temporal information of a critical field;Adding module, the time of each critical field in multiple critical fielies for will acquire Information is added in original audio file;Determining module adds treated original audio file as target for that will execute Audio file.
Optionally, in the processing unit of audio file provided by the embodiments of the present application, the time of each critical field believes At the beginning of including each critical field in breath, the device further include: determination unit, for when playing target audio file Before showing the temporal information of each critical field and each critical field simultaneously, based on true at the beginning of each critical field Fixed each critical field corresponding position in playing progress bar, wherein playing progress bar is used to play target audio file When show playback progress;In each critical field, the corresponding mark of each critical field is added in corresponding position in playing progress bar Label;Broadcast unit is also used to when playing target audio file, while being shown and being carried broadcasting for each corresponding label of critical field Put progress bar.
Optionally, in the processing unit of audio file provided by the embodiments of the present application, the time of each critical field believes At the beginning of breath includes each critical field, the device further include: creating unit, for same when playing target audio file When show the temporal information of each critical field and each critical field before, create broadcast information table, wherein broadcast information table In include each critical field and each critical field at the beginning of;Broadcast unit is also used to playing target audio file When, while showing broadcast information table.
Optionally, in the processing unit of audio file provided by the embodiments of the present application, the time of each critical field believes Breath includes the end time at the beginning of each critical field with each critical field, the device further include: storage unit is used After the temporal information of each critical field in obtaining multiple critical fielies, by original audio file, each critical field, It is stored in the preset database at the beginning of each critical field with the end time of each critical field.
Optionally, in the processing unit of audio file provided by the embodiments of the present application, the device further include: configuration is single Member before carrying out sound collection by the sound card of multichannel, configures on sound card each sound channel and each in court trial process Corresponding relationship between court's trial object role;Connection unit is used for according to corresponding relationship, and each court's trial object role is corresponding Sound collector is attached with each sound channel.
The processing unit of audio file includes processor and memory, and above-mentioned acquisition unit 10, obtains list at recognition unit 20 Member 30, broadcast unit 40 etc. store in memory as program unit, are executed on stored in memory by processor Program unit is stated to realize corresponding function.
Include kernel in processor, is gone in memory to transfer corresponding program unit by kernel.Kernel can be set one Or more, audio file is handled by adjusting kernel parameter.
Memory may include the non-volatile memory in computer-readable medium, random access memory (RAM) and/ Or the forms such as Nonvolatile memory, if read-only memory (ROM) or flash memory (flash RAM), memory include that at least one is deposited Store up chip.
The embodiment of the invention provides a kind of storage mediums, are stored thereon with program, real when which is executed by processor The processing method of existing audio file.
The embodiment of the invention provides a kind of processor, processor is for running program, wherein program executes sound when running The processing method of frequency file.
The embodiment of the invention provides a kind of equipment, equipment include processor, memory and storage on a memory and can The program run on a processor, processor perform the steps of in court trial process when executing program, pass through the sound of multichannel Card carries out sound collection, wherein the corresponding sound collector of each sound channel on sound card, each sound collector is for acquiring Use the sound of object;Collected voice signal is parsed, it is multiple in the corresponding text information of identification voice signal Critical field;The temporal information for obtaining each critical field in multiple critical fielies, obtains target audio file, wherein target The temporal information of each critical field in multiple critical fielies and multiple critical fielies is carried in audio file;Playing target The temporal information of each critical field and each critical field is shown when audio file simultaneously.
In court trial process, carrying out sound collection by the sound card of multichannel includes: to pass through multichannel in court trial process Sound card carry out sound collection, obtain original audio file, wherein original audio file includes multiple voice signals;To acquisition To voice signal parsed, multiple critical fielies in the corresponding text information of identification voice signal include: to initial sound Multiple voice signals in frequency file are parsed, multiple critical fielies in the corresponding text information of identification voice signal;It obtains The temporal information for taking each critical field in multiple critical fielies, obtaining target audio file includes: to obtain multiple critical fielies In each critical field temporal information;The temporal information of each critical field is added in the multiple critical fielies that will acquire In original audio file;Addition treated original audio file will be executed as target audio file.
At the beginning of including each critical field in the temporal information of each critical field, target audio file is being played When simultaneously show the temporal information of each critical field and each critical field before, this method further include: be based on each key Each critical field corresponding position in playing progress bar is determined at the beginning of field, wherein playing progress bar is used for Playback progress is shown when playing target audio file;In each critical field, corresponding position addition is each in playing progress bar The corresponding label of critical field;Shown simultaneously when playing target audio file each critical field and each critical field when Between information include: while to show the playback progress for carrying the corresponding label of each critical field when playing target audio file Item.
At the beginning of the temporal information of each critical field includes each critical field, when playing target audio file Before showing the temporal information of each critical field and each critical field simultaneously, this method further include: creation broadcast information table, Wherein, at the beginning of including each critical field and each critical field in broadcast information table;Playing target audio file When simultaneously show that the temporal information of each critical field and each critical field includes: when playing target audio file, simultaneously Show broadcast information table.
The temporal information of each critical field includes at the beginning of each critical field and the end of each critical field Time, in obtaining multiple critical fielies after the temporal information of each critical field, this method further include: by initial audio text At the beginning of part, each critical field, each critical field and the end time of each critical field is stored in preset data In library.
In court trial process, before carrying out sound collection by the sound card of multichannel, this method further include: on configuration sound card Corresponding relationship between each sound channel and each court's trial object role;It is according to corresponding relationship, each court's trial object role is corresponding Sound collector be attached with each sound channel.Equipment herein can be server, PC, PAD, mobile phone etc..
Present invention also provides a kind of computer program products, when executing on data processing equipment, are adapted for carrying out just The program of beginningization there are as below methods step: in court trial process, the sound card for passing through multichannel carries out sound collection, wherein sound card On the corresponding sound collector of each sound channel, each sound collector is used to acquire the sound for using object;To collecting Voice signal parsed, multiple critical fielies in the corresponding text information of identification voice signal;Obtain multiple keywords The temporal information of each critical field, obtains target audio file, wherein multiple keys are carried in target audio file in section The temporal information of each critical field in field and multiple critical fielies;Show each pass simultaneously when playing target audio file The temporal information of key field and each critical field.
In court trial process, carrying out sound collection by the sound card of multichannel includes: to pass through multichannel in court trial process Sound card carry out sound collection, obtain original audio file, wherein original audio file includes multiple voice signals;To acquisition To voice signal parsed, multiple critical fielies in the corresponding text information of identification voice signal include: to initial sound Multiple voice signals in frequency file are parsed, multiple critical fielies in the corresponding text information of identification voice signal;It obtains The temporal information for taking each critical field in multiple critical fielies, obtaining target audio file includes: to obtain multiple critical fielies In each critical field temporal information;The temporal information of each critical field is added in the multiple critical fielies that will acquire In original audio file;Addition treated original audio file will be executed as target audio file.
At the beginning of including each critical field in the temporal information of each critical field, target audio file is being played When simultaneously show the temporal information of each critical field and each critical field before, this method further include: be based on each key Each critical field corresponding position in playing progress bar is determined at the beginning of field, wherein playing progress bar is used for Playback progress is shown when playing target audio file;In each critical field, corresponding position addition is each in playing progress bar The corresponding label of critical field;Shown simultaneously when playing target audio file each critical field and each critical field when Between information include: while to show the playback progress for carrying the corresponding label of each critical field when playing target audio file Item.
At the beginning of the temporal information of each critical field includes each critical field, when playing target audio file Before showing the temporal information of each critical field and each critical field simultaneously, this method further include: creation broadcast information table, Wherein, at the beginning of including each critical field and each critical field in broadcast information table;Playing target audio file When simultaneously show that the temporal information of each critical field and each critical field includes: when playing target audio file, simultaneously Show broadcast information table.
The temporal information of each critical field includes at the beginning of each critical field and the end of each critical field Time, in obtaining multiple critical fielies after the temporal information of each critical field, this method further include: by initial audio text At the beginning of part, each critical field, each critical field and the end time of each critical field is stored in preset data In library.
In court trial process, before carrying out sound collection by the sound card of multichannel, this method further include: on configuration sound card Corresponding relationship between each sound channel and each court's trial object role;It is according to corresponding relationship, each court's trial object role is corresponding Sound collector be attached with each sound channel.
It should be understood by those skilled in the art that, embodiments herein can provide as method, system or computer program Product.Therefore, complete hardware embodiment, complete software embodiment or reality combining software and hardware aspects can be used in the application Apply the form of example.Moreover, it wherein includes the computer of computer usable program code that the application, which can be used in one or more, The computer program implemented in usable storage medium (including but not limited to magnetic disk storage, CD-ROM, optical memory etc.) produces The form of product.
The application is referring to method, the process of equipment (system) and computer program product according to the embodiment of the present application Figure and/or block diagram describe.It should be understood that every one stream in flowchart and/or the block diagram can be realized by computer program instructions The combination of process and/or box in journey and/or box and flowchart and/or the block diagram.It can provide these computer programs Instruct the processor of general purpose computer, special purpose computer, Embedded Processor or other programmable data processing devices to produce A raw machine, so that being generated by the instruction that computer or the processor of other programmable data processing devices execute for real The device for the function of being specified in present one or more flows of the flowchart and/or one or more blocks of the block diagram.
These computer program instructions, which may also be stored in, is able to guide computer or other programmable data processing devices with spy Determine in the computer-readable memory that mode works, so that it includes referring to that instruction stored in the computer readable memory, which generates, Enable the manufacture of device, the command device realize in one box of one or more flows of the flowchart and/or block diagram or The function of being specified in multiple boxes.
These computer program instructions also can be loaded onto a computer or other programmable data processing device, so that counting Series of operation steps are executed on calculation machine or other programmable devices to generate computer implemented processing, thus in computer or The instruction executed on other programmable devices is provided for realizing in one or more flows of the flowchart and/or block diagram one The step of function of being specified in a box or multiple boxes.
In a typical configuration, calculating equipment includes one or more processors (CPU), input/output interface, net Network interface and memory.
Memory may include the non-volatile memory in computer-readable medium, random access memory (RAM) and/ Or the forms such as Nonvolatile memory, such as read-only memory (ROM) or flash memory (flash RAM).Memory is computer-readable Jie The example of matter.
Computer-readable medium includes permanent and non-permanent, removable and non-removable media can be by any method Or technology come realize information store.Information can be computer readable instructions, data structure, the module of program or other data. The example of the storage medium of computer includes, but are not limited to phase change memory (PRAM), static random access memory (SRAM), moves State random access memory (DRAM), other kinds of random access memory (RAM), read-only memory (ROM), electric erasable Programmable read only memory (EEPROM), flash memory or other memory techniques, read-only disc read only memory (CD-ROM) (CD-ROM), Digital versatile disc (DVD) or other optical storage, magnetic cassettes, tape magnetic disk storage or other magnetic storage devices Or any other non-transmission medium, can be used for storage can be accessed by a computing device information.As defined in this article, it calculates Machine readable medium does not include temporary computer readable media (transitory media), such as the data-signal and carrier wave of modulation.
It should also be noted that, the terms "include", "comprise" or its any other variant are intended to nonexcludability It include so that the process, method, commodity or the equipment that include a series of elements not only include those elements, but also to wrap Include other elements that are not explicitly listed, or further include for this process, method, commodity or equipment intrinsic want Element.In the absence of more restrictions, the element limited by sentence "including a ...", it is not excluded that including element There is also other identical elements in process, method, commodity or equipment.
It will be understood by those skilled in the art that embodiments herein can provide as method, system or computer program product. Therefore, complete hardware embodiment, complete software embodiment or embodiment combining software and hardware aspects can be used in the application Form.It is deposited moreover, the application can be used to can be used in the computer that one or more wherein includes computer usable program code The shape for the computer program product implemented on storage media (including but not limited to magnetic disk storage, CD-ROM, optical memory etc.) Formula.
The above is only embodiments herein, are not intended to limit this application.To those skilled in the art, Various changes and changes are possible in this application.It is all within the spirit and principles of the present application made by any modification, equivalent replacement, Improve etc., it should be included within the scope of the claims of this application.

Claims (10)

1. a kind of processing method of audio file characterized by comprising
In court trial process, sound collection is carried out by the sound card of multichannel, wherein each sound channel corresponding one on the sound card A sound collector, each sound collector are used to acquire the sound using object;
Collected voice signal is parsed, identifies multiple keywords in the corresponding text information of the voice signal Section;
The temporal information for obtaining each critical field in the multiple critical field, obtains target audio file, wherein the mesh The temporal information of each critical field in multiple critical fielies and the multiple critical field is carried in mark with phonetic symbols frequency file;
Show the time of each critical field and each critical field simultaneously when playing the target audio file Information.
2. the method according to claim 1, wherein
In court trial process, carrying out sound collection by the sound card of multichannel includes: to pass through the sound of multichannel in court trial process Card carries out sound collection, obtains original audio file, wherein the original audio file includes multiple voice signals;
Collected voice signal is parsed, identifies multiple critical fielies in the corresponding text information of the voice signal Include: that multiple voice signals in the original audio file are parsed, identifies the corresponding text envelope of the voice signal Multiple critical fielies in breath;
The temporal information for obtaining each critical field in the multiple critical field, obtaining target audio file includes:
Obtain the temporal information of each critical field in the multiple critical field;
The temporal information of each critical field is added to the original audio file in the multiple critical field that will acquire In;
Addition treated original audio file will be executed as the target audio file.
3. method according to claim 1 or 2, which is characterized in that include in the temporal information of each critical field At the beginning of each critical field,
Show the time of each critical field and each critical field simultaneously when playing the target audio file Before information, the method also includes: determine that each critical field exists at the beginning of based on each critical field Corresponding position in playing progress bar, wherein the playing progress bar is broadcast for showing when playing the target audio file Degree of putting into;In each critical field, the corresponding mark of each critical field is added in corresponding position in playing progress bar Label;
Show the time of each critical field and each critical field simultaneously when playing the target audio file Information include: when playing the target audio file, while show carry the broadcasting of the corresponding label of each critical field into Spend item.
4. method according to claim 1 or 2, which is characterized in that the temporal information of each critical field includes institute At the beginning of stating each critical field,
Show the time of each critical field and each critical field simultaneously when playing the target audio file Before information, the method also includes: creation broadcast information table, wherein include each key in the broadcast information table At the beginning of field and each critical field;
Show the time of each critical field and each critical field simultaneously when playing the target audio file Information includes: while to show the broadcast information table when playing the target audio file.
5. according to the method described in claim 2, it is characterized in that, the temporal information of each critical field includes described every It is each in obtaining the multiple critical field at the beginning of a critical field and the end time of each critical field After the temporal information of critical field, the method also includes:
It will be at the beginning of the original audio file, each critical field, each critical field and described each The end time storage of critical field is in the preset database.
6. method according to claim 1 or 2, which is characterized in that in court trial process, carried out by the sound card of multichannel Before sound collection, the method also includes:
Configure the corresponding relationship on the sound card between each sound channel and each court's trial object role;
According to the corresponding relationship, the corresponding sound collector of each court's trial object role is attached with each sound channel.
7. a kind of processing unit of audio file characterized by comprising
Acquisition unit, for carrying out sound collection by the sound card of multichannel, wherein on the sound card in court trial process The corresponding sound collector of each sound channel, each sound collector are used to acquire the sound using object;
Recognition unit identifies in the corresponding text information of the voice signal for parsing to collected voice signal Multiple critical fielies;
Acquiring unit obtains target audio text for obtaining the temporal information of each critical field in the multiple critical field Part, wherein each critical field in multiple critical fielies and the multiple critical field is carried in the target audio file Temporal information;
Broadcast unit, for showing each critical field and each pass simultaneously when playing the target audio file The temporal information of key field.
8. device according to claim 7, which is characterized in that
The acquisition unit is also used in court trial process, is carried out sound collection by the sound card of multichannel, is obtained initial audio File, wherein the original audio file includes multiple voice signals;
The recognition unit is also used to parse collected voice signal, identifies the corresponding text envelope of the voice signal Multiple critical fielies in breath include: to parse to multiple voice signals in the original audio file, identify the sound Multiple critical fielies in the corresponding text information of sound signal;
The acquiring unit further include:
Module is obtained, for obtaining the temporal information of each critical field in the multiple critical field;
Adding module, the temporal information of each critical field is added to described in the multiple critical field for will acquire In original audio file;
Determining module adds treated original audio file as the target audio file for that will execute.
9. a kind of storage medium, which is characterized in that the storage medium includes the program of storage, wherein described program right of execution Benefit require any one of 1 to 6 described in audio file processing method.
10. a kind of processor, which is characterized in that the processor is for running program, wherein right of execution when described program is run Benefit require any one of 1 to 6 described in audio file processing method.
CN201710890678.5A 2017-09-27 2017-09-27 The treating method and apparatus of audio file Pending CN109559764A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201710890678.5A CN109559764A (en) 2017-09-27 2017-09-27 The treating method and apparatus of audio file

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201710890678.5A CN109559764A (en) 2017-09-27 2017-09-27 The treating method and apparatus of audio file

Publications (1)

Publication Number Publication Date
CN109559764A true CN109559764A (en) 2019-04-02

Family

ID=65864033

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201710890678.5A Pending CN109559764A (en) 2017-09-27 2017-09-27 The treating method and apparatus of audio file

Country Status (1)

Country Link
CN (1) CN109559764A (en)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN112333554A (en) * 2020-10-27 2021-02-05 腾讯科技(深圳)有限公司 Multimedia data processing method and device, electronic equipment and storage medium

Citations (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101482880A (en) * 2008-01-09 2009-07-15 索尼株式会社 Video searching apparatus, editing apparatus, video searching method, and program
CN101996195A (en) * 2009-08-28 2011-03-30 中国移动通信集团公司 Searching method and device of voice information in audio files and equipment
CN104078044A (en) * 2014-07-02 2014-10-01 深圳市中兴移动通信有限公司 Mobile terminal and sound recording search method and device of mobile terminal
CN105653729A (en) * 2016-01-28 2016-06-08 努比亚技术有限公司 Device and method for indexing sound recording file
CN105913838A (en) * 2016-05-19 2016-08-31 努比亚技术有限公司 Device and method of audio management
US20160364102A1 (en) * 2015-06-11 2016-12-15 Yaron Galant Method and apparatus for using gestures during video capture
US20170186465A1 (en) * 2015-12-23 2017-06-29 Bryant E. Walters System for playing files associated with tagged interest items

Patent Citations (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101482880A (en) * 2008-01-09 2009-07-15 索尼株式会社 Video searching apparatus, editing apparatus, video searching method, and program
CN101996195A (en) * 2009-08-28 2011-03-30 中国移动通信集团公司 Searching method and device of voice information in audio files and equipment
CN104078044A (en) * 2014-07-02 2014-10-01 深圳市中兴移动通信有限公司 Mobile terminal and sound recording search method and device of mobile terminal
US20160364102A1 (en) * 2015-06-11 2016-12-15 Yaron Galant Method and apparatus for using gestures during video capture
US20170186465A1 (en) * 2015-12-23 2017-06-29 Bryant E. Walters System for playing files associated with tagged interest items
CN105653729A (en) * 2016-01-28 2016-06-08 努比亚技术有限公司 Device and method for indexing sound recording file
CN105913838A (en) * 2016-05-19 2016-08-31 努比亚技术有限公司 Device and method of audio management

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN112333554A (en) * 2020-10-27 2021-02-05 腾讯科技(深圳)有限公司 Multimedia data processing method and device, electronic equipment and storage medium
CN112333554B (en) * 2020-10-27 2024-02-06 腾讯科技(深圳)有限公司 Multimedia data processing method and device, electronic equipment and storage medium

Similar Documents

Publication Publication Date Title
US20210287662A1 (en) Methods and apparatus to segment audio and determine audio segment similarities
US9092531B2 (en) Customized content consumption interface
CN109478195A (en) The method and system of selection and optimization for search engine
CN103974143B (en) A kind of method and apparatus for generating media data
CN112511854B (en) Live video highlight generation method, device, medium and equipment
US11669296B2 (en) Computerized systems and methods for hosting and dynamically generating and providing customized media and media experiences
US11176194B2 (en) User configurable radio
CN109565621A (en) Video segmentation in system for managing video
US10832700B2 (en) Sound file sound quality identification method and apparatus
Hujran et al. Big data and its effect on the music industry
CN108259985A (en) Live audio sound mixing method, device, readable storage medium storing program for executing and equipment
CN109561339A (en) The treating method and apparatus of video file
US20140129571A1 (en) Electronic media signature based applications
CN106909567B (en) Data processing method and device
CN107680584B (en) Method and device for segmenting audio
WO2016171900A1 (en) Gapless media generation
CN109388740A (en) A kind of monitoring method and device of spreading network information effect
CN109559764A (en) The treating method and apparatus of audio file
CN110019923A (en) The lookup method and device of speech message
CN112349303B (en) Audio playing method, device and storage medium
CN109213971A (en) The generation method and device of court's trial notes
CN110046263A (en) Multimedia recommendation method, device, server and storage medium
CN107799138A (en) The method and device of audio recording
CN108874815A (en) The search method and device of audio-video
Narayana et al. Effect of noise-in-speech on mfcc parameters

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
CB02 Change of applicant information

Address after: 100083 No. 401, 4th Floor, Haitai Building, 229 North Fourth Ring Road, Haidian District, Beijing

Applicant after: BEIJING GRIDSUM TECHNOLOGY Co.,Ltd.

Address before: 100086 Beijing city Haidian District Shuangyushu Area No. 76 Zhichun Road cuigongfandian 8 layer A

Applicant before: BEIJING GRIDSUM TECHNOLOGY Co.,Ltd.

CB02 Change of applicant information
RJ01 Rejection of invention patent application after publication

Application publication date: 20190402

RJ01 Rejection of invention patent application after publication