CN111312295A - Holographic sound recording method and device and recording equipment - Google Patents

Holographic sound recording method and device and recording equipment Download PDF

Info

Publication number
CN111312295A
CN111312295A CN201811519792.8A CN201811519792A CN111312295A CN 111312295 A CN111312295 A CN 111312295A CN 201811519792 A CN201811519792 A CN 201811519792A CN 111312295 A CN111312295 A CN 111312295A
Authority
CN
China
Prior art keywords
sound source
recording
microphone
microphone array
target sound
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CN201811519792.8A
Other languages
Chinese (zh)
Other versions
CN111312295B (en
Inventor
姜德军
吴海全
张恩勤
曹磊
师瑞文
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Shenzhen Grandsun Electronics Co Ltd
Original Assignee
Shenzhen Grandsun Electronics Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Shenzhen Grandsun Electronics Co Ltd filed Critical Shenzhen Grandsun Electronics Co Ltd
Priority to CN201811519792.8A priority Critical patent/CN111312295B/en
Publication of CN111312295A publication Critical patent/CN111312295A/en
Application granted granted Critical
Publication of CN111312295B publication Critical patent/CN111312295B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Classifications

    • GPHYSICS
    • G11INFORMATION STORAGE
    • G11BINFORMATION STORAGE BASED ON RELATIVE MOVEMENT BETWEEN RECORD CARRIER AND TRANSDUCER
    • G11B20/00Signal processing not specific to the method of recording or reproducing; Circuits therefor
    • G11B20/10Digital recording or reproducing
    • G11B20/10527Audio or video recording; Data buffering arrangements
    • GPHYSICS
    • G11INFORMATION STORAGE
    • G11CSTATIC STORES
    • G11C7/00Arrangements for writing information into, or reading information out from, a digital store
    • G11C7/16Storage of analogue signals in digital stores using an arrangement comprising analogue/digital [A/D] converters, digital memories and digital/analogue [D/A] converters 
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04RLOUDSPEAKERS, MICROPHONES, GRAMOPHONE PICK-UPS OR LIKE ACOUSTIC ELECTROMECHANICAL TRANSDUCERS; DEAF-AID SETS; PUBLIC ADDRESS SYSTEMS
    • H04R5/00Stereophonic arrangements
    • H04R5/027Spatial or constructional arrangements of microphones, e.g. in dummy heads
    • GPHYSICS
    • G11INFORMATION STORAGE
    • G11BINFORMATION STORAGE BASED ON RELATIVE MOVEMENT BETWEEN RECORD CARRIER AND TRANSDUCER
    • G11B20/00Signal processing not specific to the method of recording or reproducing; Circuits therefor
    • G11B20/10Digital recording or reproducing
    • G11B20/10527Audio or video recording; Data buffering arrangements
    • G11B2020/10537Audio or video recording
    • G11B2020/10546Audio or video recording specifically adapted for audio data
    • GPHYSICS
    • G11INFORMATION STORAGE
    • G11BINFORMATION STORAGE BASED ON RELATIVE MOVEMENT BETWEEN RECORD CARRIER AND TRANSDUCER
    • G11B20/00Signal processing not specific to the method of recording or reproducing; Circuits therefor
    • G11B20/10Digital recording or reproducing
    • G11B20/10527Audio or video recording; Data buffering arrangements
    • G11B2020/10537Audio or video recording
    • G11B2020/10592Audio or video recording specifically adapted for recording or reproducing multichannel signals

Landscapes

  • Engineering & Computer Science (AREA)
  • Signal Processing (AREA)
  • Multimedia (AREA)
  • Physics & Mathematics (AREA)
  • Acoustics & Sound (AREA)
  • Circuit For Audible Band Transducer (AREA)

Abstract

The invention is suitable for the technical field of multi-channel recording, and provides a holographic sound recording method, a holographic sound recording device and terminal equipment, wherein the method comprises the following steps: in the microphone array recording process, the angle of a target sound source is obtained through the microphone array; recording and saving the angle and audio data of the target sound source. The invention solves the problem that the sound information can not be accurately recorded for a continuous or mobile sound source in the time-varying process of the sound.

Description

Holographic sound recording method and device and recording equipment
Technical Field
The invention belongs to the technical field of multi-channel recording, and particularly relates to a holographic sound recording method, a holographic sound recording device and recording equipment.
Background
The use of microphone-based recording technology is now widespread, and microphone arrays are commonly used in many digital devices. In the prior art, stereo or multi-channel recording is carried out through a microphone array, the process that the amplitude of sound changes along with time is recorded, only sound intensity information exists, partial data of a sound source is lost, and recorded audio data are not comprehensive enough; in addition, in the prior art, the relative time difference of the sound source signals reaching the microphone array and the sound propagation speed are estimated through an algorithm, the sound source information is estimated, and accurate recording of target information cannot be realized.
Disclosure of Invention
In view of this, embodiments of the present invention provide a method and an apparatus for recording a holographic sound, and a recording device, so as to solve the problem in the prior art that sound information cannot be accurately recorded for a continuous or moving sound source.
A first aspect of an embodiment of the present invention provides a method for recording a holographic sound, including:
in the microphone array recording process, the angle of a target sound source is obtained through the microphone array;
recording and saving the angle and audio data of the target sound source.
A second aspect of embodiments of the present invention provides a holographic sound recording apparatus, including:
the microphone array is used for acquiring the angle of a target sound source in the microphone array recording process;
and the data storage unit is used for recording and saving the angle and the audio data of the target sound source.
A third aspect of an embodiment of the present invention provides a sound recording apparatus, including: a memory, a processor and a computer program stored in the memory and executable on the processor, the processor implementing the steps of the above-described holographic sound recording method when executing the computer program.
A fourth aspect of embodiments of the present invention provides a computer-readable storage medium storing a computer program which, when executed by a processor, implements the steps of the above-described holographic sound recording method.
Compared with the prior art, the embodiment of the invention has the following beneficial effects: by the embodiment of the invention, the angle of the target sound source can be obtained by the microphone array in the process of microphone array recording, the angle of the target sound source and audio data are recorded and stored, and more sound information can be obtained and recorded without comparison of sound transmission of multiple sound channels; the method can also record continuous or moving sound more comprehensively, realize the recording of more information of holographic sound, and ensure that the sound is better restored, and has stronger usability and practicability.
Drawings
In order to more clearly illustrate the technical solutions in the embodiments of the present invention, the drawings needed to be used in the embodiments or the prior art descriptions will be briefly described below, and it is obvious that the drawings in the following description are only some embodiments of the present invention, and it is obvious for those skilled in the art to obtain other drawings based on these drawings without inventive exercise.
Fig. 1 is a schematic flow chart of an implementation of a holographic sound recording method provided by an embodiment of the present invention;
fig. 2 is a schematic diagram of a geometric principle of a single microphone array recording method according to an embodiment of the present invention;
FIG. 3 is a schematic diagram of a geometric principle of a recording method with two sets of microphone arrays according to an embodiment of the present invention;
fig. 4 is a schematic geometric schematic diagram of a three-set microphone array recording method according to an embodiment of the present invention;
fig. 5 is a schematic view of a holographic sound recording apparatus provided by an embodiment of the present invention;
fig. 6 is a schematic diagram of a terminal device according to an embodiment of the present invention.
Detailed Description
In the following description, for purposes of explanation and not limitation, specific details are set forth, such as particular system structures, techniques, etc. in order to provide a thorough understanding of the embodiments of the invention. It will be apparent, however, to one skilled in the art that the present invention may be practiced in other embodiments that depart from these specific details. In other instances, detailed descriptions of well-known systems, devices, circuits, and methods are omitted so as not to obscure the description of the present invention with unnecessary detail.
It will be understood that the terms "comprises" and/or "comprising," when used in this specification and the appended claims, specify the presence of stated features, integers, steps, operations, elements, and/or components, but do not preclude the presence or addition of one or more other features, integers, steps, operations, elements, components, and/or groups thereof.
It is also to be understood that the terminology used in the description of the invention herein is for the purpose of describing particular embodiments only and is not intended to be limiting of the invention. As used in the specification of the present invention and the appended claims, the singular forms "a," "an," and "the" are intended to include the plural forms as well, unless the context clearly indicates otherwise.
It should be further understood that the term "and/or" as used in this specification and the appended claims refers to and includes any and all possible combinations of one or more of the associated listed items.
In order to explain the technical means of the present invention, the following description will be given by way of specific examples.
Referring to fig. 1, a schematic flow chart of an implementation of the method for recording holographic sound according to the embodiment of the present invention is shown, where the method may be applied to a smart speaker or other recording devices. As shown, the method may include the steps of:
and step S101, acquiring the angle of a target sound source through a microphone array in the microphone array recording process.
In the embodiment of the invention, the microphone array for sound recording can be a group of microphone arrays or a plurality of groups of microphone arrays; the target sound source can be a sound source for making voice, a sound source for making continuous voice with time change or a moving sound source; in the recording process, the microphone array can directly acquire the angle of the voice sound source, and the microphone array can provide information such as the frequency and amplitude of sound and can also provide direction information of the sound source.
In addition, if the set of microphone arrays is used for planar recording, the provided azimuth information is the angle information of the target sound source, and as shown in fig. 2, the geometric principle schematic diagram of the single microphone array recording method provided by the embodiment of the present invention can provide the angle a of the target sound source while the microphone array provides the audio data V, and can record the set of azimuth information as S, the azimuth information S can be recorded as { a } in the case of recording by the set of microphone arrays.
And step S102, recording and storing the angle and the audio data of the target sound source.
In the embodiment of the present invention, the recording component collects and stores the original data of the voice during recording, where the original data includes: audio data of the voice and azimuth information of the target sound source; when the single microphone array is used for planar recording, the recorded original data comprise audio data of voice and the angle of a target sound source; when two or more sets of microphone arrays record, the azimuth information of the target sound source comprises the angle of the target sound source relative to each set of microphone arrays and the relative position between the multiple sets of microphone arrays, and the collected and stored original data of the voice comprises the audio data, the angle and the distance of the voice.
In addition, the audio data includes information such as sound amplitude, frequency, multi-channel, and stereo.
Further, as shown in fig. 3, a schematic diagram of a geometric principle of two sets of microphone arrays recording methods provided by the embodiment of the present invention is provided, where the microphone arrays are two sets of microphone arrays whose center points are on the same plane, and the method further includes:
a1, obtaining the distance between the two microphone arrays and obtaining two angles of the target sound source relative to the two microphone arrays;
a2, recording and saving the distance between the two microphone arrays and the two angles, and determining the position coordinates of the target sound source according to the distance between the two microphone arrays and the two angles.
In the embodiment of the present invention, the distance between two sets of microphone arrays having center points in the same plane may be obtained by measuring, as shown in fig. 3, the distance between the microphone array 1 and the microphone array 2 is measured to be x, and the position of the microphone array 1 is taken as an origin, then in a two-dimensional coordinate system, the angle between the target sound source and the microphone array 1 is a1, the angle between the target sound source and the microphone array 2 is a2, and the azimuth information S of the target sound source is recorded as a set { a1, a2, x }; the azimuth information and the audio data of the target sound source are stored as original information of voice, the position information of the voice is not lost, and the calculation can be carried out according to the original information when the voice needs to be restored to obtain the specific position coordinates of the target sound source; in addition, the angle may be expressed in a polar coordinate system.
Further, as shown in fig. 4, a schematic diagram of a geometric principle of a three-set microphone array recording method provided by an embodiment of the present invention is provided, where the microphone array is a three-dimensional three-set or more than three-set microphone array, and the method further includes:
b1, acquiring the relative positions of three or more groups of microphone arrays, and acquiring a plurality of angles of the target sound source relative to the multiple groups of microphone arrays;
b2, recording and saving the relative position and the angles between the microphone arrays, and determining the position coordinates of the target sound source according to the relative position and the angles between the microphone arrays.
In the embodiment of the present invention, the microphone array may be three or more than three stereo microphone arrays, and first obtain the relative position between the microphone arrays by measurement, where the relative position includes: relative distance and angle between microphone arrays. If the centers of the multiple microphone arrays are in the same plane, the relative positions are represented by distances, and if the centers of the multiple microphone arrays are not in the same plane, the relative positions are represented by distances and angles. As shown in fig. 4, if three sets of microphone arrays perform stereo recording, the center points of the three sets of microphone arrays are located on the same plane, in the three-dimensional coordinate system, the microphone array 1 is used as the origin of the three-dimensional coordinate system, the microphone array 1 is perpendicular to the z-axis, the microphone array 2 is located on the x-axis and perpendicular to the z-axis, and the microphone array 3 is perpendicular to the y-axis; if the target sound source and the microphone array are not in the same plane, the microphone array acquires the angle of the vertical projection point of the target sound source in the plane of the microphone array. As shown in fig. 4, the microphone array 1 can provide an angle a1 of a target sound source in an xy plane perpendicular to the z axis while recording the voice data V1; the microphone array 2 can provide an angle a2 of a target sound source in an xy plane perpendicular to the z axis while recording the voice data V2; the microphone array 3 can provide an angle a3 of a target sound source in an xz plane perpendicular to the y axis while recording the voice data V3; meanwhile, when the distance between the microphone array 1 and the microphone array 2 is Mx and the distance between the microphone array 1 and the microphone array 3 is My, the direction information S of the target sound source can be recorded as a set { a1, a2, a3, Mx, My }. And storing the original information which is not calculated, the angle directly acquired by each group of microphone arrays, the relative position between the microphone arrays obtained by measurement and the audio data acquired by each microphone.
Further, the recording and storing the relative position and the plurality of angles between the microphone arrays and determining the position information of the target sound source according to the relative position and the plurality of angles between the microphone arrays includes:
if three-dimensional sound recording is carried out on the three microphone arrays, the central points of the three microphone arrays are located on the same plane, in a three-dimensional coordinate system, the microphone array 1 is used as the origin of the three-dimensional coordinate system, the microphone array 1 is vertical to the z axis, the microphone array 2 is located on the x axis and is vertical to the z axis, and the microphone array 3 is vertical to the y axis; the first angle of the target sound source obtained by the microphone array 1 is a1, the second angle of the target sound source obtained by the microphone array 2 is a2, the angle of the target sound source obtained by the microphone array 3 is a3, the distance between the first microphone array and the second microphone array is Mx, and the distance between the second microphone array and the third microphone array is My, then according to the formula:
Figure BDA0001902929750000061
and calculating to obtain the space coordinates (x, y, z) of the target sound source.
Other arrangements of microphone array arrangements may be solved in a similar manner.
In the embodiment of the invention, the microphone array can store original data of voice in the recording process, based on the direction information S of the target sound source, simplified data can be obtained through positioning operation to represent the space position coordinate information of the target sound source, and the obtained simplified data is stored instead of the direction information S. As shown in fig. 4, the spatial position coordinates (x, y, z) of the sound source with respect to the origin of the coordinate system (microphone array 1), that is, the coordinates of the intersection of the target sound source and the three planes formed by the microphones, are calculated from the set { a1, a2, a3, Mx, My } of the azimuth information S of the target sound source.
Different layouts of the microphone arrays and different solving processes are adopted, if the angle a1 of the microphone array 1 is obtained, the target sound source can be determined to be in a plane passing through the z-axis and the angle and the x-axis are a1, and then the plane equation is as follows:
-tan(a1)×x+y=0;
from the angles a2 and Mx of the microphone array 2, it can be determined that the target sound source is in the plane with the straight line x being Mx, y being 0, and the angle and x axis being a2, then the plane equation is:
-tan(a2)×(x-Mx)+y=0;
similarly, the plane equation through the y-axis and at an angle a3 with the x-axis is:
-tan(a3)×x+z=0;
and solving the three plane equations to obtain the space position coordinates (x, y, z) of the target sound source.
Optionally, in the embodiment of the present invention, a GPS module may be further added to the microphone array, so as to record the GPS location information of the recording point while recording the relative location of the sound source with respect to the recording point.
In addition, in the embodiment of the invention, the microphone array is used as the origin to record the angle and distance information of the sound source; and through data operation, the sound source can be used as an origin to identify the angle and distance information of the microphone array (recording point).
Optionally, the obtained spatial position coordinates (x, y, z) of the target sound source may also be replaced by a spatial angle plus a distance, the angle may be described by a euler angle (Pitch, Yaw, Roll), the distance may be described by a quadruple { x, y, z, w }, and a rotation matrix.
Further, the position coordinates of the target sound source obtained through the calculation are stored according to the corresponding relation between the position coordinates of the target sound source and the audio data.
For example, for the raw data acquired: the position coordinates of the target sound source are obtained through positioning operation according to the position relation among the angle information and the plurality of microphone arrays, the position coordinates are directly in one-to-one correspondence with the audio data, the position coordinates obtained through operation directly replace original data to be stored, and a data storage mode is simplified.
Furthermore, the acquisition frequency of the audio data is a first frequency, the acquisition frequency of the angle is a second frequency, and the one-to-one correspondence relationship between the angle and the audio data is established according to the second frequency; wherein the first frequency is greater than the second frequency.
In the embodiment of the present invention, the sampling frequency of the audio data and the sampling frequency of the angle information may not coincide, for example, the first frequency of the collected audio data of the voice is 48kHz, and the second frequency of the angle information collection may be set to be sampled once per second, and all sounds in this second are determined to be from the same position. The correspondence of time to audio data V and angle is shown in table 1.
Time of day Audio data V Angle of rotation
1s "good family" 0 degree
2s "welcome to Shenzhen" 15 degrees
3s Today " 30 degree
4s "weather is very good" 45 degree
TABLE 1
Further, storing the audio data, and the angle information corresponding to the audio data or the relative position information between the multiple microphone arrays to the same file or different files;
if the audio data, the angle information corresponding to the audio data and the relative position information among the plurality of microphone arrays are stored in different files, establishing a time index corresponding to the audio data;
and inquiring angle information corresponding to the audio data or relative position information among a plurality of microphone arrays in a specified time according to the time index.
In the embodiment of the present invention, the audio data and the position information may be stored in the same file and correspond to each other one by one, such as the corresponding relationship between the audio data V and the position information shown in table 2.
Figure BDA0001902929750000081
TABLE 2
In addition, the audio data V and the position information may also be stored in different files, and the corresponding relationship between the audio data V and the position information S is ensured by indexing with time, where the position information S includes angle information corresponding to the audio data or relative position information between multiple microphone arrays. The corresponding relation is the appointed section of sound, and the position information butted with the appointed section of sound can be inquired. The collected audio data V may be audio in MP3 or WAV format.
The query for location information of the speech may also be obtained by a location file query with a time tag, such as the storage form in table 3. At a specified time T, if Timestampn≤T<Timestampn+1If the corresponding position information is Sn
TABLE 3
Timestamp1 S1(Timestamp1Corresponding position information)
Timestamp2 S2(Timestamp1Corresponding position information)
Timestamp3 S3(Timestamp1Corresponding position information)
....... .......
In addition, the storage form shown in table 4 may be such that the storage position of the position information at a predetermined time is obtained by calculation without a time stamp for sampling the fixed azimuth sampling frequency, and the position information is obtained. If time T is specified, according to n ═ T × SR]+1, n is obtained, and the position information S corresponding to T time is obtainednWhere SR is the sampling frequency in Hz.
TABLE 4
SR (sampling frequency, unit Hz)
S1(the first sampling is the position information corresponding to the sound source)
S2(the second sampling is the position information corresponding to the sound source)
S3(the third sampling is the position information corresponding to the sound source)
...
By the embodiment of the invention, the audio data can be recorded and the position information of the audio can be recorded at the same time, the position information comprises angle distance and the like, so that the problems that the common audio recording only records the processed data, part of information is lost and the common audio recording cannot be suitable for a continuous mobile recording scene are avoided; and the sound field can be better restored through the recorded original data.
It should be noted that, within the technical scope of the present disclosure, other sequencing schemes that can be easily conceived by those skilled in the art should also be within the protection scope of the present disclosure, and detailed description is omitted here.
It should be understood that, the sequence numbers of the steps in the foregoing embodiments do not imply an execution sequence, and the execution sequence of each process should be determined by its function and inherent logic, and should not constitute any limitation to the implementation process of the embodiments of the present invention.
Referring to fig. 5, a schematic block diagram of a mobile terminal according to a fourth embodiment of the present invention is shown, and for convenience of description, only the parts related to the embodiment of the present invention are shown.
The holographic sound recording apparatus includes:
the microphone array 51 is used for acquiring the angle of a target sound source in the microphone array recording process;
a data storage unit 52 for recording and saving the angle and audio data of the target sound source.
Further, the apparatus further comprises:
and the data processing unit is used for determining the position coordinates of the target sound source according to the distance between the two groups of microphone arrays and the two angles, or determining the position coordinates of the target sound source according to the relative positions between the multiple microphone arrays and the multiple angles.
It will be apparent to those skilled in the art that, for convenience and simplicity of description, the foregoing division of the functional modules is merely illustrated, and in practical applications, the above function distribution may be performed by different functional units and modules as needed, that is, the internal structure of the mobile terminal is divided into different functional units or modules to perform all or part of the above described functions. Each functional module in the embodiments may be integrated in one processing unit, or each unit may exist alone physically, or two or more units are integrated in one unit, and the integrated unit may be implemented in a form of hardware, or in a form of software functional unit. In addition, specific names of the functional modules are only used for distinguishing one functional module from another, and are not used for limiting the protection scope of the application. The specific working process of the module in the mobile terminal may refer to the corresponding process in the foregoing method embodiment, and is not described herein again.
Fig. 6 is a schematic diagram of a sound recording apparatus according to an embodiment of the present invention. As shown in fig. 6, the sound recording apparatus 6 of this embodiment includes: a processor 60, a memory 61 and a computer program 62, such as a Java program, stored in said memory 61 and executable on said processor 60. The processor 60, when executing the computer program 62, implements the steps in the above-described respective holographic sound recording method embodiments, such as the steps 101 to 102 shown in fig. 1. Alternatively, the processor 60, when executing the computer program 62, implements the functions of the modules/units in the above-mentioned device embodiments, such as the functions of the modules 51 to 52 shown in fig. 5.
Illustratively, the computer program 62 may be partitioned into one or more modules/units that are stored in the memory 61 and executed by the processor 60 to implement the present invention. The one or more modules/units may be a series of computer program instruction segments capable of performing specific functions, which are used to describe the execution of the computer program 62 in the sound recording device 6.
The recording device 6 can be a desktop computer, a notebook computer, a palm computer, a mobile phone, an intelligent sound box and other recordable devices. The recording device may include, but is not limited to, a processor 60, a memory 61. Those skilled in the art will appreciate that fig. 6 is merely an example of the sound recording apparatus 6, and does not constitute a limitation on the sound recording apparatus 6, and may include more or less components than those shown, or some components in combination, or different components, for example, the sound recording apparatus may further include an input-output device, a network access device, a bus, etc.
The Processor 60 may be a Central Processing Unit (CPU), other general purpose Processor, a Digital Signal Processor (DSP), an Application Specific Integrated Circuit (ASIC), an off-the-shelf Programmable Gate Array (FPGA) or other Programmable logic device, discrete Gate or transistor logic, discrete hardware components, etc. A general purpose processor may be a microprocessor or the processor may be any conventional processor or the like.
The memory 61 may be an internal storage unit of the sound recording apparatus 6, such as a hard disk or a memory of the sound recording apparatus 6. The memory 61 may also be an external storage device of the sound recording apparatus 6, such as a plug-in hard disk, a Smart Media Card (SMC), a Secure Digital (SD) Card, a Flash memory Card (Flash Card), and the like provided on the sound recording apparatus 6. Further, the memory 61 may also include both an internal storage unit and an external storage device of the audio recording apparatus 6. The memory 61 is used for storing the computer programs and other programs and data required by the sound recording apparatus. The memory 61 may also be used to temporarily store data that has been output or is to be output.
It will be apparent to those skilled in the art that, for convenience and brevity of description, only the above-mentioned division of the functional units and modules is illustrated, and in practical applications, the above-mentioned function distribution may be performed by different functional units and modules according to needs, that is, the internal structure of the apparatus is divided into different functional units or modules to perform all or part of the above-mentioned functions. Each functional unit and module in the embodiments may be integrated in one processing unit, or each unit may exist alone physically, or two or more units are integrated in one unit, and the integrated unit may be implemented in a form of hardware, or in a form of software functional unit. In addition, specific names of the functional units and modules are only for convenience of distinguishing from each other, and are not used for limiting the protection scope of the present invention. The specific working processes of the units and modules in the system may refer to the corresponding processes in the foregoing method embodiments, and are not described herein again.
In the above embodiments, the descriptions of the respective embodiments have respective emphasis, and reference may be made to the related descriptions of other embodiments for parts that are not described or illustrated in a certain embodiment.
Those of ordinary skill in the art will appreciate that the various illustrative elements and algorithm steps described in connection with the embodiments disclosed herein may be implemented as electronic hardware or combinations of computer software and electronic hardware. Whether such functionality is implemented as hardware or software depends upon the particular application and design constraints imposed on the implementation. Skilled artisans may implement the described functionality in varying ways for each particular application, but such implementation decisions should not be interpreted as causing a departure from the scope of the present invention.
In the embodiments provided in the present invention, it should be understood that the disclosed apparatus/terminal device and method may be implemented in other ways. For example, the above-described embodiments of the apparatus/terminal device are merely illustrative, and for example, the division of the modules or units is only one logical division, and there may be other divisions when actually implemented, for example, a plurality of units or components may be combined or integrated into another system, or some features may be omitted, or not executed. In addition, the shown or discussed mutual coupling or direct coupling or communication connection may be an indirect coupling or communication connection through some interfaces, devices or units, and may be in an electrical, mechanical or other form.
The units described as separate parts may or may not be physically separate, and parts displayed as units may or may not be physical units, may be located in one place, or may be distributed on a plurality of network units. Some or all of the units can be selected according to actual needs to achieve the purpose of the solution of the embodiment.
In addition, functional units in the embodiments of the present invention may be integrated into one processing unit, or each unit may exist alone physically, or two or more units are integrated into one unit. The integrated unit can be realized in a form of hardware, and can also be realized in a form of a software functional unit.
The integrated modules/units, if implemented in the form of software functional units and sold or used as separate products, may be stored in a computer readable storage medium. Based on such understanding, all or part of the flow of the method according to the embodiments of the present invention may also be implemented by a computer program, which may be stored in a computer-readable storage medium, and when the computer program is executed by a processor, the steps of the method embodiments may be implemented. Wherein the computer program comprises computer program code, which may be in the form of source code, object code, an executable file or some intermediate form, etc. The computer-readable medium may include: any entity or device capable of carrying the computer program code, recording medium, usb disk, removable hard disk, magnetic disk, optical disk, computer Memory, Read-Only Memory (ROM), Random Access Memory (RAM), electrical carrier wave signals, telecommunications signals, software distribution medium, and the like. It should be noted that the computer readable medium may contain other components which may be suitably increased or decreased as required by legislation and patent practice in jurisdictions, for example, in some jurisdictions, computer readable media which may not include electrical carrier signals and telecommunications signals in accordance with legislation and patent practice.
The above-mentioned embodiments are only used for illustrating the technical solutions of the present invention, and not for limiting the same; although the present invention has been described in detail with reference to the foregoing embodiments, it will be understood by those of ordinary skill in the art that: the technical solutions described in the foregoing embodiments may still be modified, or some technical features may be equivalently replaced; such modifications and substitutions do not substantially depart from the spirit and scope of the embodiments of the present invention, and are intended to be included within the scope of the present invention.

Claims (11)

1. A method of recording a holographic sound, comprising:
in the microphone array recording process, the angle of a target sound source is obtained through the microphone array;
recording and saving the angle and audio data of the target sound source.
2. The holographic sound recording method of claim 1, wherein the microphone arrays are two sets of microphone arrays having center points in a same plane, the method further comprising:
acquiring the distance between the two groups of microphone arrays and acquiring two angles of a target sound source relative to the two groups of microphone arrays;
and recording and storing the distance between the two groups of microphone arrays and the two angles, and determining the position coordinates of the target sound source according to the distance between the two groups of microphone arrays and the two angles.
3. The holographic sound recording method of claim 1, wherein the microphone array is a three-dimensional three or more sets of microphone arrays, the method further comprising:
acquiring relative positions among three or more groups of microphone arrays, and acquiring a plurality of angles of a target sound source relative to the three or more groups of microphone arrays;
recording and storing the relative position and the plurality of angles between the microphone arrays, and determining the position coordinates of the target sound source according to the relative position and the plurality of angles between the microphone arrays.
4. The holographic sound recording method of claim 3, wherein the recording and saving the relative position between the microphone arrays and the plurality of angles and determining the position information of the target sound source according to the relative position between the microphone arrays and the plurality of angles comprises:
if three-dimensional sound recording is carried out on the three microphone arrays, the central points of the three microphone arrays are located on the same plane, in a three-dimensional coordinate system, the microphone array 1 is used as the origin of the three-dimensional coordinate system, the microphone array 1 is vertical to the z axis, the microphone array 2 is located on the x axis and is vertical to the z axis, and the microphone array 3 is vertical to the y axis; the first angle of the target sound source obtained by the microphone array 1 is a1, the second angle of the target sound source obtained by the microphone array 2 is a2, the angle of the target sound source obtained by the microphone array 3 is a3, the distance between the first microphone array and the second microphone array is Mx, and the distance between the second microphone array and the third microphone array is My, then according to the formula:
Figure FDA0001902929740000021
and calculating to obtain the space coordinates (x, y, z) of the target sound source.
5. Holographic sound recording method according to any of claims 2-4,
and storing the position coordinates of the target sound source obtained through calculation according to the corresponding relation between the position coordinates of the target sound source and the audio data.
6. The holographic sound recording method of claim 1,
the acquisition frequency of the audio data is a first frequency, the acquisition frequency of the angle is a second frequency, and the one-to-one correspondence relationship between the angle and the audio data is established according to the second frequency; wherein the first frequency is greater than the second frequency.
7. Holographic sound recording method according to any of claims 1-3, characterized in that
Storing the audio data, and the angle information corresponding to the audio data or the relative position information between the multiple microphone arrays to the same file or different files;
if the audio data and the angle information corresponding to the audio data or the relative position information among the multiple microphone arrays are stored in different files, establishing a time index corresponding to the audio data;
and inquiring angle information corresponding to the audio data or relative position information among a plurality of microphone arrays in a specified time according to the time index.
8. A holographic sound recording apparatus, comprising:
the microphone array is used for acquiring the angle of a target sound source in the microphone array recording process;
and the data storage unit is used for recording and saving the angle and the audio data of the target sound source.
9. The holographic sound recording apparatus of claim 7, further comprising:
and the data processing unit is used for determining the position coordinates of the target sound source according to the distance between the two groups of microphone arrays and the two angles, or determining the position coordinates of the target sound source according to the relative positions between the multiple microphone arrays and the multiple angles.
10. Recording device comprising a memory, a processor and a computer program stored in the memory and executable on the processor, characterized in that the steps of the method according to any of claims 1 to 7 are implemented when the computer program is executed by the processor.
11. A computer-readable storage medium, in which a computer program is stored which, when being executed by a processor, carries out the steps of the method according to any one of claims 1 to 7.
CN201811519792.8A 2018-12-12 2018-12-12 Holographic sound recording method and device and recording equipment Active CN111312295B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201811519792.8A CN111312295B (en) 2018-12-12 2018-12-12 Holographic sound recording method and device and recording equipment

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201811519792.8A CN111312295B (en) 2018-12-12 2018-12-12 Holographic sound recording method and device and recording equipment

Publications (2)

Publication Number Publication Date
CN111312295A true CN111312295A (en) 2020-06-19
CN111312295B CN111312295B (en) 2022-06-28

Family

ID=71146581

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201811519792.8A Active CN111312295B (en) 2018-12-12 2018-12-12 Holographic sound recording method and device and recording equipment

Country Status (1)

Country Link
CN (1) CN111312295B (en)

Cited By (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN112151080A (en) * 2020-10-28 2020-12-29 成都启英泰伦科技有限公司 Method for recording and processing training corpus
CN114442039A (en) * 2020-11-05 2022-05-06 中国移动通信集团山东有限公司 Sound source positioning method and device and electronic equipment
WO2022111579A1 (en) * 2020-11-27 2022-06-02 华为技术有限公司 Voice wakeup method and electronic device
WO2023212879A1 (en) * 2022-05-05 2023-11-09 北京小米移动软件有限公司 Object audio data generation method and apparatus, electronic device, and storage medium

Citations (10)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN202362450U (en) * 2011-12-04 2012-08-01 东华大学 Sound source positioning and visualizing system based on microphone array
CN104898091A (en) * 2015-05-29 2015-09-09 复旦大学 Microphone array self-calibration sound source positioning system based on iterative optimization algorithm
CN104991573A (en) * 2015-06-25 2015-10-21 北京品创汇通科技有限公司 Locating and tracking method and apparatus based on sound source array
CN105227743A (en) * 2015-08-25 2016-01-06 努比亚技术有限公司 A kind of method for recording, device and mobile terminal
CN105263093A (en) * 2015-10-12 2016-01-20 深圳东方酷音信息技术有限公司 Omnibearing audio acquisition apparatus, omnibearing audio editing apparatus, and omnibearing audio acquisition and editing system
CN106291469A (en) * 2016-10-18 2017-01-04 武汉轻工大学 A kind of three dimensions source of sound localization method and system
CN106872944A (en) * 2017-02-27 2017-06-20 海尔优家智能科技(北京)有限公司 A kind of sound localization method and device based on microphone array
CN107040843A (en) * 2017-03-06 2017-08-11 联想(北京)有限公司 The method and collecting device of same source of sound are obtained by two microphones
CN107820037A (en) * 2016-09-14 2018-03-20 南京中兴新软件有限责任公司 The methods, devices and systems of audio signal, image procossing
CN108802690A (en) * 2018-05-30 2018-11-13 大连民族大学 A kind of robot sonic location system and device based on microphone array

Patent Citations (10)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN202362450U (en) * 2011-12-04 2012-08-01 东华大学 Sound source positioning and visualizing system based on microphone array
CN104898091A (en) * 2015-05-29 2015-09-09 复旦大学 Microphone array self-calibration sound source positioning system based on iterative optimization algorithm
CN104991573A (en) * 2015-06-25 2015-10-21 北京品创汇通科技有限公司 Locating and tracking method and apparatus based on sound source array
CN105227743A (en) * 2015-08-25 2016-01-06 努比亚技术有限公司 A kind of method for recording, device and mobile terminal
CN105263093A (en) * 2015-10-12 2016-01-20 深圳东方酷音信息技术有限公司 Omnibearing audio acquisition apparatus, omnibearing audio editing apparatus, and omnibearing audio acquisition and editing system
CN107820037A (en) * 2016-09-14 2018-03-20 南京中兴新软件有限责任公司 The methods, devices and systems of audio signal, image procossing
CN106291469A (en) * 2016-10-18 2017-01-04 武汉轻工大学 A kind of three dimensions source of sound localization method and system
CN106872944A (en) * 2017-02-27 2017-06-20 海尔优家智能科技(北京)有限公司 A kind of sound localization method and device based on microphone array
CN107040843A (en) * 2017-03-06 2017-08-11 联想(北京)有限公司 The method and collecting device of same source of sound are obtained by two microphones
CN108802690A (en) * 2018-05-30 2018-11-13 大连民族大学 A kind of robot sonic location system and device based on microphone array

Cited By (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN112151080A (en) * 2020-10-28 2020-12-29 成都启英泰伦科技有限公司 Method for recording and processing training corpus
CN112151080B (en) * 2020-10-28 2021-08-03 成都启英泰伦科技有限公司 Method for recording and processing training corpus
CN114442039A (en) * 2020-11-05 2022-05-06 中国移动通信集团山东有限公司 Sound source positioning method and device and electronic equipment
WO2022111579A1 (en) * 2020-11-27 2022-06-02 华为技术有限公司 Voice wakeup method and electronic device
WO2023212879A1 (en) * 2022-05-05 2023-11-09 北京小米移动软件有限公司 Object audio data generation method and apparatus, electronic device, and storage medium

Also Published As

Publication number Publication date
CN111312295B (en) 2022-06-28

Similar Documents

Publication Publication Date Title
CN111312295B (en) Holographic sound recording method and device and recording equipment
US8989401B2 (en) Audio zooming process within an audio scene
CN106872944B (en) Sound source positioning method and device based on microphone array
US20190278802A1 (en) Constructing an audio filter database using head-tracking data
CN107105384B (en) The synthetic method of near field virtual sound image on a kind of middle vertical plane
CN101401450A (en) Sound collection/reproduction method and device
CN113821190B (en) Audio playing method, device, equipment and storage medium
US20200217919A1 (en) Sound source distance estimation
CN111077496A (en) Voice processing method and device based on microphone array and terminal equipment
US7116788B1 (en) Efficient head related transfer function filter generation
CN110956973A (en) Echo cancellation method and device and intelligent terminal
JP4293986B2 (en) Method and system for representing a sound field
JP5000297B2 (en) System and method for determining a representation of a sound field
CN113314138B (en) Sound source monitoring and separating method and device based on microphone array and storage medium
CN111615045B (en) Audio processing method, device, equipment and storage medium
CN113889140A (en) Audio signal playing method and device and electronic equipment
CN110583030B (en) Incoherent idempotent ambisonics rendering
US11252525B2 (en) Compressing spatial acoustic transfer functions
CN113704374B (en) Spacecraft trajectory fitting method, device and terminal
CN112946576B (en) Sound source positioning method and device and electronic equipment
CN114765712A (en) Audio processing method, device, terminal and computer readable storage medium
CN113312971B (en) Parameter calibration method and device for microphone array, terminal equipment and storage medium
CN112565979B (en) Speaker frequency response numerical value calculation method and device, electronic equipment and storage medium
CN116228991B (en) Coordinate conversion method and device, electronic equipment and storage medium
US20220329961A1 (en) Methods and apparatus to expand acoustic rendering ranges

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant