WO2021102855A1 - Mobile platform, terminal device and control method therefor, and storage medium - Google Patents

Mobile platform, terminal device and control method therefor, and storage medium Download PDF

Info

Publication number
WO2021102855A1
WO2021102855A1 PCT/CN2019/121766 CN2019121766W WO2021102855A1 WO 2021102855 A1 WO2021102855 A1 WO 2021102855A1 CN 2019121766 W CN2019121766 W CN 2019121766W WO 2021102855 A1 WO2021102855 A1 WO 2021102855A1
Authority
WO
WIPO (PCT)
Prior art keywords
platform
terminal
sound
terminal device
sound file
Prior art date
Application number
PCT/CN2019/121766
Other languages
French (fr)
Chinese (zh)
Inventor
舒路
苏冠华
Original Assignee
深圳市大疆创新科技有限公司
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 深圳市大疆创新科技有限公司 filed Critical 深圳市大疆创新科技有限公司
Priority to PCT/CN2019/121766 priority Critical patent/WO2021102855A1/en
Priority to CN201980040354.XA priority patent/CN112292867A/en
Publication of WO2021102855A1 publication Critical patent/WO2021102855A1/en

Links

Images

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04QSELECTING
    • H04Q5/00Selecting arrangements wherein two or more subscriber stations are connected by the same line to the exchange
    • H04Q5/24Selecting arrangements wherein two or more subscriber stations are connected by the same line to the exchange for two-party-line systems

Definitions

  • This specification relates to the field of movable platforms, and in particular to a movable platform, a terminal device, a control method thereof, and a storage medium.
  • Movable platforms such as mobile robots and unmanned aerial vehicles can usually communicate with terminal devices such as mobile phones and remote controls to realize functions such as command transmission and image transmission. But these functions are only limited to realize the interactive mode between the terminal equipment and the mobile platform, which is relatively simple.
  • this specification provides a mobile platform, a terminal device and its control method, and a storage medium, which can realize the interactive mode of voice transmission between the terminal device and the mobile platform, such as realizing processes such as voice intercom.
  • this specification provides a control method applied to a system composed of a terminal device and a movable platform, and both the terminal device and the movable platform are provided with audio sensors and speakers;
  • the method includes:
  • the terminal device obtains a terminal sound file according to the environmental sound of the terminal device, and sends the terminal sound file to the mobile platform;
  • the mobile platform receives the terminal sound file sent by the terminal device, decodes the terminal sound file, and plays it;
  • the movable platform generates a platform sound file according to the environmental sound of the movable platform
  • the terminal device After receiving the platform sound file sent by the mobile platform, the terminal device decodes the platform sound file to obtain platform audio information, and plays the platform audio information.
  • this specification provides a control method for a terminal device, the terminal device is used to communicate with a movable platform, and both the terminal device and the movable platform are provided with audio sensors and speakers;
  • the method includes:
  • this specification provides a control method, which is applied to a movable platform, the movable platform is used to communicate with a terminal device, and both the terminal device and the movable platform are provided with audio sensors and speakers ;
  • the method includes:
  • the terminal sound file sent by the terminal device, decode the terminal sound file to generate terminal audio information, and play the terminal audio information; wherein the terminal sound file is collected by the terminal device from the environment of the terminal device Generated after the sound;
  • this specification provides a terminal device, including an audio sensor, a speaker, a memory, and a processor;
  • the audio sensor is used to collect environmental sounds of the terminal device, and the speaker is used to play audio information;
  • the memory is used to store a computer program
  • the processor is configured to execute the computer program and, when executing the computer program, implement the following steps:
  • this specification provides a movable platform, including an audio sensor, a speaker, a memory, and a processor;
  • the audio sensor is used to collect environmental sounds of the movable platform, and the speaker is used to play audio information;
  • the memory is used to store a computer program
  • the processor is configured to execute the computer program and, when executing the computer program, implement the following steps:
  • the terminal sound file sent by the terminal device, decode the terminal sound file to generate terminal audio information, and play the terminal audio information; wherein the terminal sound file is collected by the terminal device from the environment of the terminal device Generated after the sound;
  • this specification provides a computer-readable storage medium that stores a computer program, and when the computer program is executed by a processor, the processor implements the above-mentioned control method.
  • the embodiments of this specification provide a movable platform, a terminal device and a control method thereof, and a storage medium.
  • the terminal device collects audio data of its surrounding environment and sends the audio data to the corresponding movable platform, so that the user can A sound is made through the movable platform at a place far away from the movable platform, for example, a person near the movable platform or other movable platforms are called.
  • collecting the audio data of the surrounding environment through the mobile platform and sending the audio data to the terminal device of the mobile platform can enable the user of the terminal device to listen to the mobile platform even if it is not in the vicinity of the mobile platform.
  • the sound scene of the environment where the platform is located can facilitate the user to interact with the surrounding environment more conveniently and intuitively, facilitate the user's control of the movable platform, and meet the user's purpose of transmitting voice information.
  • FIG. 1 is a schematic flowchart of a control method provided by an embodiment of this specification
  • FIG. 2 is a schematic diagram of an embodiment of a system composed of a terminal device and a movable platform
  • FIG. 3 is a schematic diagram of another embodiment of a system composed of a terminal device and a movable platform
  • Fig. 5 is a schematic diagram of an embodiment of a platform control interface of a terminal device
  • Figure 6 is a schematic diagram of the display mode of the intercom button in the platform control interface
  • FIG. 7 is a schematic diagram of the processing state of the environmental sound displayed on the terminal device.
  • FIG. 8 is a schematic diagram of another implementation manner of a platform control interface of a terminal device
  • FIG. 9 is a schematic diagram of an embodiment of a recording record list
  • FIG. 10 is a schematic flowchart of a control method provided by another embodiment of this specification.
  • FIG. 11 is a schematic block diagram of a terminal device according to an embodiment of the present specification.
  • Fig. 12 is a schematic block diagram of a movable platform provided by an embodiment of the present specification.
  • FIG. 1 is a schematic flowchart of a control method provided by an embodiment of this specification.
  • the control method can be applied to a system composed of a terminal device and a movable platform, and is used to implement processes such as voice intercom between the terminal device and the movable platform.
  • both the terminal device and the movable platform are provided with audio sensors and speakers.
  • terminal devices may include, for example, remote controls, mobile phones, tablet computers, notebook computers, desktop computers, personal digital assistants, and wearable devices, such as virtual reality (Virtual Reality, VR) glasses, FPV (First Person View, first-person view) At least one item of glasses, etc.;
  • the movable platform may be, for example, a movable robot, a robotic vehicle, an unmanned aerial vehicle, etc., and a movable robot is used as an example for schematic illustration.
  • the mobile robot 11 and the terminal device 13 may communicate, and the communication method may be wired communication or wireless communication. This embodiment takes wireless communication as an example.
  • the mobile platform and the terminal device may be directly connected in communication, or may be connected to each other through communication connections such as routers, servers, base stations, etc.
  • the mobile robot includes:
  • the robot body 110 includes a chassis main body 111 and a pan/tilt main body 112 provided on the chassis main body 111.
  • the pan/tilt main body 112 is used to carry the camera 101;
  • the power device 120 is arranged on the chassis body 111 and is used to provide moving power to the robot body 110;
  • the audio sensor and the speaker are arranged on the robot body 110, the audio sensor is used to collect environmental sound, and the speaker is used to play audio;
  • the communication device is provided on the robot body 110 and is used to communicate with the terminal device.
  • a launching device is provided on the mobile robot, and the launching device can be used to launch projectiles, and the size and shape of the projectiles are not specifically limited.
  • multiple mobile robots such as the mobile robot 11 and the mobile robot 12, launch projectiles or light beams through their respective launching devices to compete.
  • each mobile robot may also correspond to one terminal device, or multiple mobile robots correspond to one terminal device.
  • the mobile robot 11 corresponds to the terminal device 13 and the mobile robot 12 corresponds to the terminal device 14.
  • control method applied to a system composed of a terminal device and a movable platform may include step S110 to step S140.
  • the terminal device obtains a terminal sound file according to the environmental sound of the terminal device, and sends the terminal sound file to the movable platform.
  • the mobile platform receives the terminal sound file sent by the terminal device, decodes the terminal sound file, and plays it.
  • the movable platform generates a platform sound file according to the environmental sound of the movable platform.
  • the terminal device After receiving the platform sound file sent by the movable platform, the terminal device decodes the platform sound file to obtain platform audio information, and plays the platform audio information.
  • the mobile robot can be provided with an audio sensor, which can be used to collect audio data in the surrounding environment of the mobile robot.
  • the audio sensor can be a microphone. After the mobile robot collects audio data in the surrounding environment through a microphone, the audio data can be sent to the terminal device that communicates with the mobile robot.
  • the mobile robot 11 is in communication connection with the terminal device 13, and the mobile robot 11 collects audio data in the surrounding environment through a microphone, and then sends the audio data to the terminal device 13.
  • the audio data in the surrounding environment collected by the mobile robot 11 through a microphone may be derived from other mobile robots, for example, the mobile robot 12.
  • the audio data in the surrounding environment collected by the microphone on the mobile robot 11 may come from users of other mobile robot terminal devices.
  • the mobile robot 12 is in communication connection with the terminal device 14, and the mobile robot 11
  • the audio data collected by the microphone is the audio data of the terminal device 14 or the user who controls the terminal device 14.
  • the mobile robot 11 may send the collected audio data emitted by the mobile robot 12 and/or the audio data of the terminal device 14 corresponding to the mobile robot 12 or the user of the terminal device 14 to the terminal device 13.
  • the mobile robot 12 may also send the collected audio data from the mobile robot 11 and/or the terminal device 13 corresponding to the mobile robot 11 or the user's audio data of the terminal device 13 to the terminal device 14.
  • the terminal device may be provided with an audio sensor, and the audio sensor may be used to collect audio data in the surrounding environment of the terminal device, for example, audio data of a user of the terminal device. Further, the terminal device sends the user's audio data to the mobile robot that communicates with the terminal device.
  • the mobile robot 11 is in communication connection with the terminal device 13, and the terminal device 13 is provided with a microphone.
  • the microphone can be used to collect the audio data of the user of the terminal device 13.
  • the terminal device 13 sends the user's audio data to the user Mobile robot 11.
  • the terminal device 14 may also send the audio data of the user of the terminal device 14 to the mobile robot 12.
  • both the terminal device and the mobile robot are provided with audio sensors.
  • the mobile robot 11 is in communication connection with the terminal device 13, and the mobile robot 11 and the terminal device 13 are respectively provided with microphones, and the mobile robot 11 collects the audio data around the mobile robot 11 in real time through the microphone on the mobile robot 11, and The audio data is sent to the terminal device 13, and at the same time, the terminal device 13 collects the audio data of the user of the terminal device 13 in real time through the microphone on the terminal device 13 and sends the audio data to the mobile robot 11.
  • the mobile robot 11 is in communication connection with the terminal device 13, and the mobile robot 12 is in communication connection with the terminal device 14.
  • the mobile robot 11, the mobile robot 12, the terminal device 13 and the terminal device 14 are respectively A microphone is provided, and the terminal device 13 collects the audio data of the user of the terminal device 13 through the microphone on the terminal device 13, and sends the audio data to the mobile robot 11.
  • the mobile robot 11 may also be provided with a speaker.
  • the speaker can be used to play audio data of the user of the terminal device 13.
  • the microphone on the mobile robot 12 collects the audio data of the user of the terminal device 13 and moves to the position of the terminal device 14 to play the audio data of the user of the terminal device 13 to the user of the terminal device 14.
  • the mobile robot 12 can also receive the audio data of the user of the terminal device 14, and play the audio data of the user of the terminal device 14 through the speaker on the mobile robot 12. At the same time, the mobile robot 11 collects the terminal device 14 14 of the user’s audio data, and sent to the terminal device 13.
  • the control method provided by the above-mentioned embodiments of this specification collects the audio data of its surrounding environment through the terminal device and sends the audio data to the corresponding movable platform, so that the user can pass through the movable platform far away from the movable platform.
  • the platform emits sounds, for example, shouting to people near the movable platform or other movable platforms.
  • collecting the audio data of the surrounding environment through the mobile platform and sending the audio data to the terminal device of the mobile platform can enable the user of the terminal device to listen to the mobile platform even if it is not in the vicinity of the mobile platform. The sound scene of the environment where the platform is located.
  • the user of the terminal device can still feel the atmosphere of the battle scene, and the user of the terminal device of the mobile platform can send it back according to the mobile platform.
  • the audio accurately controls the movable platform. It is convenient for users to interact with the surrounding environment more conveniently and intuitively, which is conducive to the user's control of the movable platform and meets the user's purpose of transmitting voice information. In the networked competition of multiple mobile robots, the real-time audio-visual effects are increased, and the voice interaction fun of the players is improved.
  • FIG. 4 is a schematic flowchart of a control method provided by an embodiment of this specification.
  • the control method can be applied to a terminal device that is used to communicate with a movable platform, and both the terminal device and the movable platform are provided with audio sensors and speakers.
  • control method of the embodiment of this specification includes step S210 to step S240.
  • S210 Acquire a terminal sound file according to the environmental sound of the terminal device.
  • the terminal device collects sound in the environment through an audio sensor, and encodes the collected sound data to obtain a terminal sound file. Reduce the amount of data sent by encoding, and improve the real-time performance of sound file sending.
  • the terminal device encodes the currently collected environmental sound in real time, and generates a terminal sound file in the corresponding encoding format after the recording ends. This improves the real-time performance of the terminal sound file sending to the mobile platform.
  • the terminal device encodes the currently collected sound data into OPUS data in real time, and generates a terminal sound file in OPUS format after the recording is finished.
  • control method further includes: displaying a platform control interface, and the platform control interface includes an intercom button.
  • the intercom button may also be a physical button set on a terminal device, such as a remote control, VR glasses, or FPV glasses.
  • the camera device mounted on the movable platform can transmit the collected images to the terminal device, and the terminal device can display the image taken by the movable platform on the platform control interface as shown in FIG. 5, so that the user can understand the movable platform.
  • the environment can be any convenient location.
  • an intercom button named megaphone is displayed on the upper right of the terminal equipment platform control interface, and the environmental sound of the terminal device can be controlled and collected through the intercom button.
  • the intercom control operation of the intercom button by the user can be enabled or disabled, and the display mode of the intercom button can be adjusted.
  • display mode A is the default display mode of the intercom button.
  • the user can press and hold the intercom button to trigger the terminal device to collect ambient sound.
  • the button can be displayed in display mode C.
  • the intercom button can be displayed in display mode B to prompt the user; after the user clicks the intercom button and releases it, adjust the display mode of the intercom button to display mode D, And close the user's intercom control operation on the intercom button; in this display mode D, even if the user presses the intercom button for a long time, the terminal device will not be triggered to collect environmental sounds.
  • the user can also click on the intercom button of display mode C to trigger the terminal device to display the intercom button in display mode A, so as to enable the user to control the intercom operation of the intercom button, such as allowing triggering by long pressing the intercom button
  • the terminal equipment collects environmental sounds.
  • the obtaining the terminal sound file according to the environmental sound of the terminal device includes: obtaining the environmental sound of the terminal device according to the intercom control operation of the intercom button by the user, and encoding the terminal sound file.
  • the user's intercom control operation on the intercom button is enabled, if it is detected that the duration of the user's pressing of the intercom button exceeds a preset threshold, such as 0.5 seconds, then start to obtain the environment of the terminal device The sound is encoded to obtain the terminal sound file.
  • a preset threshold such as 0.5 seconds
  • the environmental sound of the terminal device may be acquired during the time period after the intercom button is pressed to before the intercom button is released, and the terminal sound file may be obtained by encoding.
  • the terminal device can collect the environmental sound when the intercom button is pressed, and the user releases the intercom button to make the terminal device end the collection of the environmental sound.
  • the acquisition of the ambient sound of the terminal device may be stopped, and the acquired sound may be encoded to obtain the terminal sound file. It can limit the data volume of the terminal sound file, and prevent the transmission to the mobile platform from occupying too much time and affecting other operations, such as controlling the movement of the mobile robot.
  • the terminal device ends collecting environmental sounds.
  • the user can cancel the collection, encoding, and transmission of ambient sound by operating the intercom button. For example, when the user long presses the intercom button to record, if the terminal device detects the user's drag operation of the intercom button, for example, when the intercom button is pressed, the finger touches the position and slides away from the intercom button, then After collecting environmental sounds, you can also clear the encoded environmental sounds.
  • the terminal device when acquiring the environmental sound of the terminal device, the terminal device may display the sound spectrogram of the environmental sound on the platform control interface. So that the user can intuitively understand the current recording status.
  • the sound spectrogram of the environmental sound may be displayed on the platform control interface, and the sound may not be displayed when the environmental sound is not collected.
  • a preset threshold such as 0.5 seconds
  • the user can control the mobile robot 11 to move in front of other people or animals through the terminal device 13, and the mobile robot 11 can send the terminal sound file sent by the terminal device 13 after receiving and decoding the terminal sound file.
  • the mobile robot 11 plays to other people or animals, and realizes the function of transmitting sounds to other people or animals.
  • the mobile robot 11 is located within the sound transmission range of the mobile robot 12. After the mobile robot 11 receives and decodes the terminal sound file sent by the terminal device 13, the mobile robot 11 can play it to the mobile robot 12. Realize the function of transmitting sound to other mobile robots.
  • the mobile robot 12 may collect and send the sound played by the mobile robot 11, and then send the collected sound to the terminal device 14 communicatively connected with the mobile robot 12, and the terminal device 14 broadcasts the sound to the terminal device 14 user.
  • control method further includes: displaying the processing status of the environmental sound on the platform control interface, the processing status including at least one of silent recording, recording, transmission, and transmission completed . It can prompt the user of the processing status of the environment sound.
  • an image or animation of the processing state may be displayed in a certain area of the platform control interface.
  • the terminal device displays the sound spectrogram of the environmental sound on the platform control interface, prompting the user that the processing state of the environmental sound is recording.
  • the processing status of the environmental sound is not detected, the processing status of the environmental sound being collected, the processing status of the terminal sound file of the environmental sound being sent to the mobile platform, and the terminal sound file The processing status of the sending.
  • the platform control interface further includes a platform control button.
  • the platform control buttons may include, for example, a launch button, a camera button, a video button, a pedestrian follow button, a custom skill button, and the like.
  • the control method may further include: generating and sending a corresponding platform control instruction to the movable platform according to a button trigger operation of the platform control button by the user, so that the movable platform executes according to the platform control instruction Preset tasks.
  • a launching device is provided on the mobile robot, and the launching device can be used to launch projectiles. If the terminal device detects that the user triggers an operation on the launch button, it sends a launch instruction to the movable platform, and the movable robot can launch the projectile according to the launch instruction.
  • the user can operate the camera button and the video button to enable the terminal device to load the mobile platform with buttons and video instructions, so that the mobile platform can perform tasks such as photography and video recording through the mounted camera device.
  • the terminal device when the terminal device detects that the user triggers the operation of the pedestrian follow button, it sends a pedestrian follow instruction to the movable platform, and the movable platform follows the pedestrian target in the captured image according to the pedestrian follow instruction, and then shoots The image is sent to the terminal device.
  • the user can define tasks of the movable platform corresponding to the custom skill button, such as drift.
  • the terminal device detects that the user triggers an operation on the button of the custom skill button, it sends a drift instruction to the movable platform, and the movable platform implements the drift task according to the drift instruction.
  • the custom skill button can also be defined as a stun skill.
  • the tasks of the corresponding movable platform include releasing the skill to a certain movable platform and hitting the movable platform, and can control the hitting movable platform in the original The ground rotates and lasts for 1.5 seconds.
  • the custom skill button can also be defined as a blinding skill
  • the task of the corresponding movable platform is included within a preset time threshold
  • the display interface of the remote control terminal corresponding to the movable platform being hit is adjusted to The animation effect corresponding to the blinding skill.
  • Animation effects such as blurred, black or snowflake screens, block the image transmission screen, making it impossible for users to view the image transmission screen normally.
  • the preset time threshold may specifically be 1.5 seconds.
  • the custom skill button can also be defined as an electromagnetic interference skill, which can be launched by an infrared transmitter, and the image transmission of the movable platform that is hit is interfered for 2.5 seconds, and it can also be expressed as the FPV interface displayed as a flower screen effect.
  • the custom skill button can also be defined as a speed skill, and the tasks of the corresponding movable platform include obtaining a faster moving speed and lasting for 3 seconds.
  • the custom skill button can also be defined as an invincible skill.
  • the tasks of the corresponding movable platform include automatically canceling the skill effect released by the opponent, and obtaining a shield for 3 seconds so that the opponent cannot cause damage to it.
  • the platform control button may include a focusing button, such as the button at the bottom right of the interface in FIG. 5, which indicates that the focal length of the camera device of the current movable platform is 4 times.
  • the user can switch the focal length of the camera device of the movable platform to, for example, 1x, 2x, etc. by operating the focus button.
  • the platform control button may include a sound return button.
  • the terminal device detects that the user triggers an operation on the sound return button, it sends a sound return instruction to the movable platform, which can move The platform obtains the sound around the movable platform according to the sound return instruction and transmits the collected sound back to the terminal device.
  • the terminal device can, for example, play the sound around the movable platform.
  • control method may further include: acquiring a user's control voice, generating and sending a corresponding platform control instruction to the movable platform according to the control voice, so that the movable platform can be controlled according to the The platform control commands execute preset tasks.
  • the terminal device stores the mapping relationship data between the control voice and the platform control instruction, so that the user can send the corresponding platform control instruction to the movable platform through the voice control terminal device, so that the movable platform can perform the preset task. For example, if the terminal device detects the user's "launch" control instruction, it sends a launch instruction to the movable platform, and the movable platform can launch projectiles according to the launch instruction.
  • the obtaining the control voice of the user may include: obtaining the environmental sound of the terminal device, and detecting the control voice in the environmental sound.
  • the terminal device continuously monitors the sound of the environment in which it is located, and detects whether there is a control voice in the environment sound. Users can quickly input control commands by voice.
  • the obtaining the control voice of the user may include: obtaining the control voice uttered by the user when the user triggers the voice control function.
  • the display interface of the terminal device also displays a voice control button, and the user can make a control voice when pressing the voice control button. It can prevent the wrong detection of the control voice when the user does not need the voice control.
  • the mobile platform can collect the environmental sound of the mobile platform independently or according to the control of the terminal device, generate the platform sound file and send it to the terminal device, so that the terminal device can play the environmental sound of the mobile platform, which is convenient for
  • the user understands the environment in which the mobile platform is located. For example, the user of the terminal device can feel the atmosphere of the battle scene even if the user of the terminal device is not on the battle scene of the mobile robot; it can also make the user of the terminal device of the mobile robot accurately control the mobile robot according to the audio data. robot.
  • the terminal device is provided with a sound return button.
  • the terminal device detects that the user triggers an operation on the sound return button, it sends a sound return instruction to the movable platform, which can move
  • the platform collects the environmental sound of the movable platform according to the sound return instruction, generates the platform sound file and sends it to the terminal device. Therefore, the user can control to listen to or not listen to the ambient sound of the mobile platform through the terminal device.
  • the platform sound file is generated by the movable platform according to the voice of a person near the movable platform.
  • the user can control the mobile robot 11 to move in front of other people or animals through the terminal device 13, then the mobile robot 11 can collect the messages sent by the other people or animals that the mobile robot 11 faces.
  • the sound can be sent to the terminal device 13 after the platform sound file is generated, and the terminal device will be played to other people or animals that the mobile robot 11 faces.
  • the platform sound file is generated by the movable platform according to a sound originating from at least another movable platform.
  • another movable platform can make a sound such as a roaring sound autonomously or according to the control of a corresponding terminal device, and the movable platform can generate a platform sound file according to the sound made by the another movable platform.
  • the platform sound file is generated by the mobile platform according to a user's voice from a terminal device of at least another mobile platform.
  • the mobile robot 11 is located within the sound transmission range of the mobile robot 12, and the mobile robot 11 can play to the mobile robot 12 after receiving and decoding the terminal sound file sent by the terminal device 13;
  • the mobile robot 12 can collect and send the sound played by the mobile robot 11, and then send the collected sound to the terminal device 14 communicatively connected with the mobile robot 12, and the terminal device 14 will play the sound to the user of the terminal device 14.
  • control method may further include: the terminal device determines the playback target of the terminal sound file according to the user's object setting operation.
  • the sending the terminal sound file to the mobile platform so that the mobile platform decodes the terminal sound file and then plays it includes: sending the terminal sound file and the playback target The information is sent to the movable platform, so that the movable platform plays the terminal sound file when the playback object is recognized. In this way, the user can control the movable platform to play sound to the specified object.
  • the user can specify the playback object of the terminal sound file on the terminal device.
  • the playback object includes, for example, other movable platforms with designated marks, such as a QR code pattern or a team pattern, people or animals with designated facial features, etc. .
  • the designated mark and the like are also sent to the movable platform.
  • the movable platform can detect whether there is a playback object in the captured image, and if there is a playback object, it will play the corresponding terminal sound file to the playback object.
  • the terminal device may obtain an image taken by the movable platform from the movable platform, display the image, and then determine the playback object according to a user's selection operation of the playback object in the image.
  • the mobile platform sends the captured image to the terminal device for display, and the user can perform a selection operation in the image. For example, by clicking or box selecting a certain area in the image, the terminal device and the mobile platform can determine the playback object With some information, the movable platform can detect whether there is a playback object in the field of view of the camera device according to the selected area.
  • the terminal device may display the local image selected by the user according to the user's selection operation on the local image of the terminal device, and then determine the playback object according to the user's determining operation on the playback object in the local image.
  • the terminal device locally stores the team pattern of its own team or an image containing the team pattern
  • the team pattern can be selected so that the terminal device determines the pattern that the playback object needs to have; and the team pattern is sent to the mobile platform.
  • the mobile platform can detect whether there is a playback object in the field of view of the camera device based on the team pattern.
  • control method may further include: displaying a recording record list, and acquiring a terminal sound file corresponding to the recording record according to a user's playback control operation of the recording record in the recording record list. Then, the terminal sound file can be sent to the mobile platform, so that the mobile platform decodes the terminal sound file and plays it. In this way, users can send terminal sound files to the mobile platform more quickly.
  • the terminal device can display a list of recording records on the platform control interface.
  • the user can control the terminal device to display the recording record list by clicking the corresponding button on the platform control interface.
  • the terminal device may also display the recording record list on other interfaces, which is not limited in this embodiment.
  • the corresponding close button is displayed in the upper right corner of the recording record list. When the user clicks the close button, the recording record list is closed.
  • the recording record list may include one or more recording records, and each recording record corresponds to a corresponding terminal sound file.
  • the terminal device stores a corresponding terminal sound file corresponding to each recording record in the recording record list.
  • the recording duration can be displayed at the corresponding position of each recording record, such as 10", etc., and the playback button can also be displayed on the left side of the recording duration. If the terminal device detects that the user has selected the playback button corresponding to a certain recording record The playback control operation is to obtain the terminal sound file corresponding to the recording record.
  • the length of the icon of each recording record in the recording record list can be adjusted according to the recording duration, for example, the longer the recording duration, the longer the icon.
  • the playback status and/or playback progress of the recording record may also be displayed in the recording record list.
  • the playback status and/or playback progress of the first recording can be displayed in the form of a progress bar.
  • the user can add a new recording record to the recording record list.
  • the terminal device may obtain and process the newly recorded sound according to the user's newly added recording operation to obtain a new terminal sound file, and update the corresponding recording record in the recording record list.
  • the user can press the "press and hold recording” button to trigger the terminal device to acquire and process the newly recorded sound to obtain a new terminal sound file, and update the corresponding recording record in the recording record list.
  • control method further includes: acquiring a terminal sound file corresponding to the recording record according to a user's loop playback operation of the recording record in the recording record list.
  • the loop button can be displayed on the right side of each recording. If the terminal device detects the user's playback control operation on the loop button corresponding to a certain recording record, it acquires the terminal sound file corresponding to the recording record. Then, the terminal sound file and the loop instruction may be sent to the movable platform, so that the movable platform decodes the terminal sound file and plays it in a loop. Therefore, the user can control the movable platform to repeatedly play the specified recording through the loop playback operation.
  • control method may further include: displaying a list of recording records; sending the information of the recording records to the mobile platform according to a user's playback control operation on the recording records in the recording record list, So that the mobile platform can play the terminal audio information corresponding to the recording record.
  • the mobile platform may pre-store a terminal sound file corresponding to each recording record in the recording record list, or terminal audio information obtained by decoding the terminal sound file. Therefore, the mobile platform can determine the terminal sound file or terminal audio information to be played according to the information recorded in the recording selected by the user on the terminal device.
  • control method may further include: according to a user's loop playback operation of the recording records in the recording record list, sending the recording record information and loop instructions to the movable platform, so that the The mobile platform cyclically plays the terminal audio information corresponding to the recording record. Therefore, the user can control the movable platform to repeatedly play the specified recording through the loop playback operation.
  • control method may further include: acquiring and processing the newly recorded sound to obtain a new terminal sound file according to the new recording operation of the user, and updating the corresponding recording record in the recording record list;
  • the terminal sound file is sent to the mobile platform, so that the mobile platform stores terminal audio information obtained by decoding the terminal sound file.
  • the terminal device may immediately send the terminal sound file to the mobile platform after obtaining the new terminal sound file.
  • the terminal device may send the terminal sound file to the mobile platform according to the user's playback control operation of the recording record corresponding to the new terminal sound file. For example, when the user performs a playback control operation or a loop playback operation on a new recording record for the first time, the terminal device sends the terminal sound file corresponding to the new recording record to the mobile platform.
  • the user can edit the recording records in the recording record list, and can also send the terminal sound file corresponding to the newly added recording record to the mobile platform so that the mobile platform can store the terminal sound file or terminal audio information corresponding to the newly added recording record.
  • the playback status and/or playback progress of the recording record may also be displayed in the recording record list.
  • the playback status and/or playback progress of the first recording can be displayed in the form of a progress bar.
  • FIG. 10 is a schematic flowchart of a control method provided by an embodiment of this specification.
  • the control method can be applied to a movable platform, the movable platform is used to communicate with a terminal device, and both the terminal device and the movable platform are provided with audio sensors and speakers.
  • control method of the embodiment of this specification includes step S310 to step S330.
  • the terminal sound file is generated by the terminal device after collecting the environmental sound of the terminal device.
  • the terminal device collects the sound in the environment through the audio sensor, encodes the collected sound data to obtain the terminal sound file, and then sends the terminal sound file to the mobile platform.
  • the terminal device may display a list of recording records, and then obtain the terminal sound file corresponding to the recording record according to the user's playback control operation on the recording record in the recording record list, so as to send the terminal sound file to the mobile platform .
  • control method further includes: if acquiring the information of the recording record sent by the terminal device, determining the terminal audio information corresponding to the recording record, and playing the terminal audio information corresponding to the recording record.
  • the mobile platform may pre-store the terminal sound file sent by the terminal device or the terminal audio information generated by decoding the terminal sound file. After receiving the playback instruction or the loop instruction sent by the terminal device according to the user's control operation of the recording record in the recording record list, the terminal audio information corresponding to the recording record is played.
  • the movable platform is equipped with a camera device
  • the control method further includes: if the information of the playback object sent by the terminal device is acquired, identifying in the image taken by the camera device according to the information The play object; if the play object is recognized, the corresponding terminal audio information is played to the play object.
  • the movable platform sends the image taken by the camera device to the terminal device, so that the terminal device determines the playback object according to the user's selection operation of the playback object in the image.
  • the mobile platform sends the captured image to the terminal device for display, and the user can perform a selection operation in the image. For example, by clicking or box selecting a certain area in the image, the terminal device and the mobile platform can determine the playback object With some information, the movable platform can detect whether there is a playback object in the field of view of the camera device according to the selected area.
  • the playing the terminal audio information in step S310 includes: if a loop instruction corresponding to the terminal audio information is obtained, playing the terminal audio information in a loop.
  • the terminal device detects that the user performs a playback control operation on a loop button corresponding to a certain recording record, it acquires the terminal sound file corresponding to the recording record. Then, the terminal sound file and the loop instruction may be sent to the movable platform, so that the movable platform decodes the terminal sound file and plays it in a loop. Therefore, the user can control the movable platform to repeatedly play the specified recording through the loop playback operation.
  • the terminal device sends the information of the recording record and the loop instruction to the mobile platform according to the user's circular playback operation of the recording record in the recording record list, so that the mobile platform can play the recording record in a loop Corresponding terminal audio information.
  • control method further includes: sending the playback status and/or playback progress of the terminal audio information to the terminal device.
  • the terminal device may display the playback status and/or playback progress of the terminal audio information on the interface, for example, display the playback status and/or playback progress of the corresponding recording record in the recording record list.
  • control method further includes: when playing the terminal audio information, adjusting the display parameters of the display device of the movable platform according to the terminal audio information.
  • the movable platform such as the movable robot, includes a display device, and the display device may include, for example, a ring-shaped light bar.
  • the mobile platform may adjust the display brightness of the display device according to the sound intensity of the terminal audio information; and/or adjust the flicker frequency of the display device according to the sound frequency of the terminal audio information. For example, when the sound intensity of the terminal audio information becomes stronger at a certain moment, the display brightness of the display device is brightened; the higher the sound frequency of the terminal audio information in a certain period of time, the higher the flicker frequency of the display device.
  • a display device such as a light bar, intuitively reminds the user that the mobile platform is playing terminal audio information.
  • S320 Generate a platform sound file according to the environmental sound of the movable platform.
  • the mobile platform can collect the environmental sound of the mobile platform independently or according to the control of the terminal device, generate the platform sound file and send it to the terminal device, so that the terminal device can play the environmental sound of the mobile platform, which is convenient for
  • the user understands the environment in which the mobile platform is located. For example, the user of the terminal device can feel the atmosphere of the battle scene even if the user of the terminal device is not on the battle scene of the mobile robot; it can also make the user of the terminal device of the mobile robot accurately control the mobile robot according to the audio data. robot.
  • the terminal device is provided with a sound return button.
  • the terminal device detects that the user triggers an operation on the sound return button, it sends a sound return instruction to the movable platform, which can move
  • the platform collects the environmental sound of the movable platform according to the sound return instruction, generates the platform sound file and sends it to the terminal device. Therefore, the user can control to listen to or not listen to the ambient sound of the mobile platform through the terminal device.
  • the movable platform acquires the environmental sound of the movable platform when the terminal audio information is not played. For example, when the mobile platform is playing the terminal audio information, it may not first acquire the environmental sound of the mobile platform, so as to avoid the interference of transmitting the played terminal audio information back to the terminal device for playback.
  • the mobile platform obtains the environmental sound of the mobile platform, and filters the played terminal audio information from the environmental sound, so as to avoid transmitting the played terminal audio information back to the terminal device. Interference caused by playback.
  • the platform sound file is generated by the movable platform according to the voice of a person near the movable platform.
  • the user can control the mobile robot 11 to move in front of other people or animals through the terminal device 13, then the mobile robot 11 can collect the messages sent by the other people or animals that the mobile robot 11 faces.
  • the sound can be sent to the terminal device 13 after the platform sound file is generated, and the terminal device will be played to other people or animals that the mobile robot 11 faces.
  • the movable platform may generate a platform sound file based on the sound of at least another movable platform and/or the sound of a user of at least another terminal device of the movable platform.
  • another movable platform can make a sound such as a roaring sound autonomously or according to the control of a corresponding terminal device, and the movable platform can generate a platform sound file according to the sound made by the another movable platform.
  • the mobile robot 11 is located within the sound transmission range of the mobile robot 12, and the mobile robot 11 can play to the mobile robot 12 after receiving and decoding the terminal sound file sent by the terminal device 13.
  • the mobile robot 12 can collect and send the sound played by the mobile robot 11, and then send the collected sound to the terminal device 14 communicatively connected with the mobile robot 12, which is played by the terminal device 14 to the user of the terminal device 14.
  • control method further includes: if the platform control instruction sent by the terminal device is acquired, execute a preset task according to the platform control instruction; wherein, the platform control instruction is the terminal device according to the user It is sent by the button trigger operation of the platform control button of the terminal device, or sent by the terminal device according to the control voice of the user.
  • a launching device is provided on the mobile robot, and the launching device can be used to launch projectiles. If the terminal device detects that the user triggers an operation on the launch button, it sends a launch instruction to the movable platform, and the movable robot can launch the projectile according to the launch instruction.
  • the terminal device stores the mapping relationship data between the control voice and the platform control instruction, so that the user can send the corresponding platform control instruction to the movable platform through the voice control terminal device, so that the movable platform can perform the preset task. For example, if the terminal device detects the user's "launch" control instruction, it sends a launch instruction to the movable platform, and the movable platform can launch the projectile according to the launch instruction.
  • the control method provided by the above-mentioned embodiments of this specification collects audio data of its surrounding environment through a terminal device and sends the audio data to the corresponding movable platform, so that the user can pass through the movable platform far away from the movable platform.
  • the platform emits sounds, for example, shouting to people near the movable platform or other movable platforms.
  • collecting the audio data of the surrounding environment through the mobile platform and sending the audio data to the terminal device of the mobile platform can enable the user of the terminal device to listen to the mobile platform even if it is not in the vicinity of the mobile platform.
  • the sound scene of the environment where the platform is located can facilitate the user to interact with the surrounding environment more conveniently and intuitively, facilitate the user's control of the movable platform, and meet the user's purpose of transmitting voice information.
  • Acquiring environmental sound in the above embodiments can be realized by collecting sound signals by a sound pickup device, and one of the collecting methods is as follows:
  • the sound signal collected by the sound pickup device is a sound signal generated by a standard source.
  • the standard sound source can play the sound signal generated by the standard sound source through its configured sound playback device, so as to transmit the sound signal generated by the standard sound source to the pickup device through the transmission medium in the space.
  • the sound pickup device collects the sound signal generated by the standard sound source, and converts the collected sound signal into an electrical signal. Further, the sound pickup device transmits the converted electric signal to the signal processing device of the sound pickup device.
  • the signal processing device may include an analog-to-digital conversion device. The analog-to-digital conversion device converts the electric signal transmitted by the sound pickup device into a first digital signal. signal.
  • the standard sound source and the sound pickup device in order to reduce the energy loss in the sound signal transmission process, can be placed in the same closed cavity; the distance between the standard sound source and the sound pickup device can also be limited to a certain Within the distance.
  • the certain distance range is, for example, [30cm, 1.5m].
  • sound insulation materials can be used to construct the sealed cavity.
  • the external sound signal refers to the sound signal produced by other sound sources other than the standard sound source.
  • the second digital signal may be pre-stored in a storage medium, and the signal processing device of the sound pickup device directly obtains the second digital signal from the storage medium.
  • the second digital signal may also be pre-stored in other smart terminals or servers, and the signal processing device of the sound pickup device obtains the second digital signal from other smart terminals or servers after establishing a communication connection with other smart terminals or servers.
  • the sound signal generated by the standard sound source includes one or more groups of sound signals, and each group of sound signals corresponds to one or more sound intensity values.
  • the multiple sound intensity values may be linearly distributed.
  • the signal processing device of the sound pickup device first obtains the first sampling point set corresponding to the first digital signal, and obtains the second sampling point set corresponding to the second digital signal; and then according to the first sampling point set and the first sampling point set A set of two sampling points to determine the conversion relationship between the first digital signal and the second digital signal.
  • the signal processing device of the sound pickup device determines the conversion relationship between the first digital signal and the second digital signal according to the first sampling point set and the second sampling point set: first according to the first set of sampling points and the second set of sampling points.
  • the sampling points in the sampling point set determine the first fitting curve; and the second fitting curve is determined according to the sampling points in the second sampling point set.
  • the signal processing device of the sound pickup device uses a polynomial fitting method to fit the sampling points in the first sampling point set to obtain the first fitting function and the first fitting curve.
  • the first fitting function is a function expression corresponding to the first fitting curve;
  • the first fitting curve is a curve with the best goodness of fit when fitting sampling points in the first sampling point set.
  • the polynomial fitting method can be used to fit the sampling points in the second sampling point set to obtain the second fitting function and the second fitting curve.
  • the second fitting function is a function expression corresponding to the second fitting curve;
  • the second fitting curve is a curve with the best goodness of fit when fitting sampling points in the second sampling point set.
  • the fitting method is not limited to polynomial fitting, and those skilled in the art can set the fitting method according to actual needs.
  • the signal processing device of the sound pickup device obtains the first target conversion relationship between the first fitting curve and the second fitting curve, and determines the first target conversion relationship as the one between the first digital signal and the second digital signal The conversion relationship between.
  • the first target conversion relationship makes the first fitting curve approach or coincide with the second fitting curve.
  • the first target conversion relationship may be determined according to the function conversion relationship between the first fitting function corresponding to the first fitting curve and the second fitting function corresponding to the second fitting curve.
  • the signal calibration parameters of the sound pickup device can be determined according to the function conversion relationship between the first fitting function and the second fitting function.
  • the signal processing device of the sound pickup device determines the signal calibration parameter of the sound pickup device according to the conversion relationship between the first digital signal and the second digital signal, and then saves the signal calibration parameter for the sound pickup device. After the device subsequently collects the sound signal, the digital signal corresponding to the sound signal collected by the sound pickup device is calibrated by using the signal calibration parameters to calibrate the sound signal collected by the sound pickup device.
  • the calibration accuracy of the sound pickup device can be improved, so that different sound pickup devices have the same or similar output levels for the same sound signal.
  • the following processing may be performed after the environmental sound is acquired in the above embodiment, which may specifically include:
  • Step S401 Use multiple pre-processing circuits to process analog audio signals to be processed to obtain multiple digital audio signals, where each of the multiple pre-processing circuits includes an amplifier and an analog-to-digital converter, and each pre-processing circuit includes an amplifier and an analog-digital converter.
  • the analog gains of the amplifiers of the processing circuit are different from each other;
  • each pre-processing circuit includes an amplifier and an analog-to-digital converter.
  • the multiple pre-processing circuits include at least two pre-processing circuits; wherein the amplifier can be used for analog
  • the audio signal is power amplified, and the amplification factor is usually expressed by gain.
  • the analog gains of the amplifiers of the preprocessing circuits are different.
  • the dynamic range of the two preprocessing circuits with adjacent analog gains is at least partially Overlap; and analog-to-digital converters can be used to convert analog audio signals into digital audio signals to facilitate subsequent signal processing.
  • the analog audio signal x to be processed can be collected by a microphone, and then input into multiple pre-processing circuits in parallel, respectively, through power amplification with different gains, and converted into digital audio signals x1,...xi...,xI, that is, each The input of a pre-processing circuit is the same analog audio signal to be processed, and the digital audio signal output by each pre-processing circuit is different, thereby obtaining multiple digital audio signals.
  • the analog audio signal to be processed can be intercepted into different segments with a predetermined duration as a frame signal, or a predetermined number of sampled data can be used as a frame signal after analog-to-digital conversion.
  • the subsequent audio signal processing procedures are all It can be processed in units of one frame of signal. In order to ensure signal continuity, there may be a certain overlap between adjacent frame signals, that is, the tail of the previous frame signal and the head of the next frame signal have an overlap amount, thereby establishing the correlation between adjacent frames.
  • Step S402 Perform frequency domain conversion on the multiple channels of digital audio signals to obtain multiple channels of frequency domain data.
  • frequency domain conversion is performed on each channel of digital audio signal in the multiple channels of digital audio signal, so as to obtain frequency domain data corresponding to each channel of digital audio signal.
  • the multiple channels of digital audio signals are subjected to frequency domain conversion to obtain multiple channels of frequency domain data, and the fusion of multiple channels of frequency domain data in the frequency domain can be realized.
  • the frequency domain conversion method can adopt Fourier transform (such as discrete Fourier transform), Laplace transform, Z transform, etc. The specific frequency domain conversion process will not be repeated here.
  • Step S403 Determine frequency domain fusion data according to one or at least two channels of target frequency domain data among the multiple channels of frequency domain data.
  • each pre-processing circuit since each pre-processing circuit amplifies the same analog audio signal to be processed with different gains, each channel of audio signal has a different maximum and minimum value, and the maximum value of the audio signal with a larger gain. And the minimum value are relatively large, and the maximum and minimum values of the audio signal with a smaller gain are relatively small, and the dynamic range is the ratio of the maximum and minimum values of the audio signal without distortion.
  • the louder sound can be adjusted by A pre-processing circuit with a relatively large gain is provided, and a smaller sound can be provided by a pre-processing circuit with a relatively small gain, so that the dynamic system of the recording system can be improved, with higher sensitivity and lower noise at the same time.
  • the size of the sound can be measured by the energy feature information of the audio signal, such as the sound pressure level of the analog audio signal or digital audio signal, or the amplitude of the analog audio signal or digital audio signal, and so on.
  • Step S404 Convert the frequency domain fusion data into a time domain audio signal, and obtain an output audio signal according to the time domain audio signal.
  • the frequency domain fusion data can be converted into a time domain audio signal.
  • the conversion method can adopt the inverse transform of the Fourier transform (such as the discrete Fourier transform), the Lap The inverse transform of the Lass transform, the inverse transform of the Z transform, etc., will not be repeated here.
  • the output audio signal can be obtained according to the time domain audio signal.
  • operations such as compression and noise reduction can be performed on the time domain audio signal.
  • the process of obtaining the output audio signal according to the time domain audio signal also needs to be spliced between the frame signals to establish the correlation between adjacent frames. Specifically, the time domain audio signal of the current frame and the previous frame time domain audio signal of the previous frame can be superimposed.
  • the audio signal processing method of this embodiment uses multiple preprocessing circuits to process analog audio signals to be processed to obtain multiple digital audio signals, wherein each of the multiple preprocessing circuits includes an amplifier and an analog audio signal. Digital converter, and the analog gains of the amplifiers of each preprocessing circuit are different; frequency domain conversion is performed on the multiple channels of digital audio signals to obtain multiple channels of frequency domain data; according to the multiple channels of frequency domain data One or at least two channels of target frequency domain data determine frequency domain fusion data; convert the frequency domain fusion data into a time domain audio signal, and obtain an output audio signal according to the time domain audio signal.
  • the method of this embodiment can effectively improve the dynamic range of the recording system, has high sensitivity, and can reduce the noise floor and meet the requirements of high signal-to-noise ratio.
  • the audio signal processing method further includes:
  • the energy characteristic information of the analog audio signal to be processed may be the sound pressure level of the analog audio signal or the amplitude of the analog audio signal, etc., and specifically may be the maximum, minimum, or instantaneous amplitude of the analog audio signal.
  • the intermediate value can also be the maximum, minimum, or intermediate value of the average amplitude of the analog audio signal in a short time (preset duration).
  • determining frequency domain fusion data according to one or at least two channels of frequency domain data among the multiple channels of frequency domain data includes:
  • Step S501 Determine one or at least two channels of target frequency domain data from the multiple channels of frequency domain data according to the energy characteristic information
  • Step S502 Determine frequency domain fusion data according to the one or at least two channels of target frequency domain data.
  • one or at least two channels of target frequency domain data to be fused can be determined from multiple channels of frequency domain data according to the energy characteristic information of the analog audio signal to be processed, for example, according to the simulation to be processed
  • the size of the energy feature information of the audio signal determines the number of target frequency domain data. For example, the greater the energy feature information, the greater the number of target frequency domain data; in addition, the reference energy feature parameters of the processing circuit can also be preset according to the energy feature information Compare and determine one or at least two channels of target frequency domain data to be fused.
  • Each of the multiple preset processing circuits corresponds to a different reference energy characteristic parameter.
  • the reference energy characteristic parameter is included by the preprocessing circuit.
  • the reference energy characteristic parameter and the energy characteristic information belong to the same parameter, that is, the sound pressure level or amplitude of the digital audio signal output by the preprocessing circuit, which can be specified without distortion
  • the maximum, minimum or intermediate value of the instantaneous amplitude of the digital audio signal, or the maximum, minimum or intermediate value of the average amplitude of the digital audio signal in a short time (preset duration), when the preprocessing circuit includes the amplifier circuit The larger the simulation gain of, the larger the corresponding reference energy characteristic parameter.
  • the determining one or at least two channels of target frequency domain data from the multiple channels of frequency domain data according to the energy characteristic information includes:
  • the first target frequency domain data and the second target frequency domain data are determined from the multiple channels of frequency domain data according to the energy feature information and multiple reference energy feature parameters, where the multiple reference energy feature parameters are based on the multiple frequency domain data.
  • the analog gain of the amplifying circuit included in each preprocessing circuit is determined.
  • At least two channels of target frequency domain data are determined from multiple channels of frequency domain data, where the first target frequency domain data may be only one channel of target frequency domain data, and of course, it may also be more than one channel of target frequency domain data; Similarly, the second target frequency domain data may also be only one channel of target frequency domain data or more than one channel of target frequency domain data.
  • the determining the first target frequency domain data and the second target frequency domain data from the multiple channels of frequency domain data according to the energy characteristic information may specifically include:
  • Step S601 Determine a first reference energy characteristic parameter and a second reference energy characteristic parameter adjacent to the energy characteristic information from a plurality of reference energy characteristic parameters;
  • Step S602 Determine first target frequency domain data and second target frequency domain data from the multiple channels of frequency domain data according to the first reference energy characteristic parameter and the second reference energy characteristic parameter;
  • the first target frequency domain data and the second target frequency domain data are obtained by performing the frequency domain conversion on the first digital audio signal and the second digital audio signal in the multi-channel digital audio signal, respectively.
  • a digital audio signal and a second digital audio signal are obtained from the analog audio signal to be processed by a first preprocessing circuit and a second preprocessing circuit corresponding to the first reference energy characteristic parameter and the second reference energy characteristic parameter, respectively .
  • Step S603 Determine frequency domain fusion data according to the first target frequency domain data and the second target frequency domain data.
  • the energy of the analog audio signal to be processed is selected from the reference energy characteristic parameters of multiple preset processing circuits (indicated by L1, L2,..., LI, where I is the number of preset processing circuits)
  • the feature information represented by Lc
  • Lc is adjacent to the first reference energy feature parameter ( represented by L i′ , where 1 ⁇ i′ ⁇ I-1) and the second reference energy feature parameter (represented by L i′+1 ), That is, Lc is between L i′ and L i′+1 , where the first reference energy characteristic parameter L i′ corresponds to the first preprocessing circuit, and the first digital audio signal output by the first preprocessing circuit undergoes frequency domain conversion
  • the obtained frequency domain data is the first target frequency domain data
  • the second reference energy characteristic parameter L i′+1 corresponds to the second preprocessing circuit
  • the second digital audio signal output by the second preprocessing circuit passes through the frequency domain.
  • the converted frequency domain data is the second target frequency domain data, and the frequency domain fusion data can be obtained by performing superposition operation
  • the frequency domain data obtained by the frequency domain conversion of the digital audio signal output by the preprocessing circuit is the first target frequency domain data, which can also be L i′+1 and L i′+2 (in this case, i′ ⁇ I- 1)
  • the frequency domain data obtained by frequency domain conversion of the digital audio signal output by the corresponding preprocessing circuit is the second target frequency domain data.
  • the first target frequency domain data and the second target frequency domain data may also include more For multiple channels of frequency domain data, no examples are given here.
  • the determining one or at least two channels of target frequency domain data from the multiple channels of frequency domain data according to the energy characteristic information includes:
  • Step S701 When the energy feature information is less than the smallest third reference energy feature parameter among the multiple reference energy feature parameters, determine from the multiple channels of frequency domain data according to the third reference energy feature parameter The third target frequency domain data;
  • the third target frequency domain data is obtained by performing the frequency domain conversion on a third digital audio signal in the multi-channel digital audio signal, and the third digital audio signal is corresponding to a third reference energy characteristic parameter Obtained by the third preprocessing circuit on the analog audio signal to be processed;
  • Step S702 Acquire the frequency domain fusion data according to the third target frequency domain data.
  • the smallest third reference energy characteristic parameter L1 is greater than the energy characteristic information of the analog audio signal to be processed Lc, that is, Lc is less than L1, where the third reference energy characteristic parameter L1 corresponds to the third digital audio signal output by the third preprocessing circuit, and the frequency domain data obtained by frequency domain conversion is the third target frequency domain data, and then Acquire the frequency domain fusion data according to the third target frequency domain data.
  • the digital audio signal output by the preprocessing circuit corresponding to L1 and L2 can be obtained by frequency domain conversion.
  • the frequency domain data is the third target frequency domain data.
  • the third target frequency domain data may also include more channels of frequency domain data, and no examples are given here.
  • the fusing the multiple channels of frequency domain data to obtain frequency domain fusion data further includes:
  • Step S801 When the energy feature information is greater than the largest fourth reference energy feature parameter among the multiple reference energy feature parameters, determine from the multiple channels of frequency domain data according to the fourth reference energy feature parameter The fourth target frequency domain data;
  • the fourth target frequency domain data is obtained by performing the frequency domain conversion on the fourth digital audio signal in the multi-channel digital audio signal, and the fourth digital audio signal is corresponding to the fourth reference energy characteristic parameter Obtained by the fourth preprocessing circuit on the analog audio signal to be processed;
  • Step S802 Acquire the frequency domain fusion data according to the fourth target frequency domain data.
  • the largest fourth reference energy characteristic parameter LI is less than the energy characteristic information of the analog audio signal to be processed Lc, that is, Lc is greater than LI, where the fourth reference energy characteristic parameter LI corresponds to the fourth digital audio signal output by the fourth preprocessing circuit and the frequency domain data obtained by frequency domain conversion is the fourth target frequency domain data, and then The frequency domain fusion data is acquired according to the fourth target frequency domain data.
  • the digital audio signal output by the preprocessing circuit corresponding to LI and LI-1 is converted into the frequency domain.
  • the obtained frequency-domain data is the fourth target frequency-domain data.
  • the fourth target frequency-domain data may also include more channels of frequency-domain data, which will not be illustrated here.
  • the energy feature information Lc of the analog audio signal to be processed may be first determined. Compare with the reference energy characteristic parameters of multiple preset processing circuits, if Lc is less than (or equal to) the third reference energy characteristic parameter L1, perform steps 701-702; if Lc is greater than (or equal to) the fourth reference energy characteristic If the parameter LI is between steps 801-802; if Lc is between the adjacent third reference energy characteristic parameter L1 and the fourth reference energy characteristic parameter LI, then steps 601-603 are performed.
  • the determining frequency domain fusion data according to the one or at least two channels of target frequency domain data includes:
  • the frequency domain fusion data is obtained by performing a spectrum superposition operation on the frequency domain data.
  • the acquiring frequency domain fusion data according to the one or more channels of frequency domain data includes:
  • performing the superposition operation can set weights for the first target frequency domain data and the second target frequency domain data, by superposing the first target frequency domain data and the second target frequency domain data with different weights. , You can get frequency-domain fusion data with different dynamic ranges.
  • each of the plurality of preset processing circuits corresponds to a different reference energy characteristic parameter
  • the plurality of reference energy characteristic parameters are determined according to the analog gain of the amplifying circuit included in the plurality of preprocessing circuits , wherein the weights corresponding to the first target frequency domain data and the second target frequency domain data are based on the first preprocessing circuit and the first preprocessing circuit corresponding to the first target frequency domain data among the plurality of preset processing circuits
  • the reference energy characteristic parameter of the second preprocessing circuit corresponding to the second target frequency domain data is determined.
  • the weights corresponding to the first target frequency domain data and the second target frequency domain data are determined by the reference energy characteristic parameters of the corresponding pre-processing circuit. More specifically, the weights corresponding to the first target frequency domain data can be determined.
  • the magnitude relationship between the first reference energy feature parameter Li and the second reference energy feature parameter Li+1 corresponding to the second target frequency domain data and the energy feature information Lc of the analog audio signal to be processed is determined, wherein the reference energy feature parameter is about If the energy characteristic information Lc is close to the analog audio signal, the greater the weight, for example, Lc is closer to Li, it means that the analog audio signal is closer to the digital audio signal of the preprocessing circuit corresponding to Li, and the first target frequency corresponding to Li needs to be increased.
  • the weight of the domain data In this embodiment, the weight a1 of the first target frequency domain data and the weight a2 of the second target frequency domain data can be determined by the following formula:
  • the determining frequency domain fusion data according to one or at least two channels of target frequency domain data among the multiple channels of frequency domain data includes:
  • Step S901 Perform compression processing on the one or at least two channels of target frequency domain data according to the compression coefficient corresponding to each channel of the one or at least two channels of target frequency domain data;
  • Step S902 Acquire frequency domain fusion data according to one or more channels of frequency domain data after the compression processing.
  • the target frequency domain data needs to be compressed to avoid the fusion frequency domain fusion data from exceeding the digital quantization range.
  • the step of compressing the target frequency domain data can be performed simultaneously with the above-mentioned superposition operation, or can be completed before the superposition operation.
  • the compression processing is linear compression processing.
  • the compression coefficient corresponding to each channel of frequency domain data of the one or more channels of frequency domain data is determined according to the analog gain of the amplifier included in the preprocessing circuit corresponding to each channel of frequency domain data.
  • the compression coefficient corresponding to any channel of frequency domain data can be the product of the channel equalization parameter and the scaling factor of the corresponding preprocessing circuit.
  • a certain preprocessing circuit is used as a reference preprocessing circuit, and the The channel equalization parameter is the ratio of the analog gain of the preprocessing circuit to the analog gain of the reference preprocessing circuit, and the scaling factor is obtained according to the size of the digital audio signal output by the preprocessing circuit.
  • the compression coefficient corresponding to any channel of frequency domain data can be obtained by the following formula:
  • G i′ is the analog gain of the preprocessing circuit
  • G ref is the reference preprocessing
  • is the scaling factor, used to scale the frequency domain data.
  • steps S901-902 can be executed only when the frequency domain fusion data exceeds the digital quantization range, or it can be judged before step S901 whether the frequency domain fusion data has the possibility of exceeding the digital quantization range, if there is a possibility that the frequency domain fusion data exceeds the digital quantization range. Steps S901-902 are executed only when the range is possible.
  • the time domain audio signal is the current frame time domain audio signal
  • the step S404 in the foregoing embodiment described in step S404 obtaining the output audio signal according to the time domain audio signal includes:
  • the output audio signal is determined according to the time domain fusion audio signal of the current frame.
  • the above-mentioned audio signal processing procedures are performed in units of one frame signal.
  • there may be a certain overlap between adjacent frame signals that is, the tail of the previous frame signal and the next
  • the header of the frame signal has an overlap amount, thereby establishing the correlation between adjacent frames. Therefore, after converting the frequency domain fusion data of the current frame into the time domain audio signal of the current frame in S404, the overlapping part of the time domain audio signal of the current frame and the time domain audio signal of the previous frame can be overlapped and superimposed.
  • the non-overlapping portion of the time domain audio signal and the previous frame of time domain audio signal is not superimposed, so as to obtain the current frame time domain fused audio signal, and the output audio signal can be determined according to the current frame time domain fused audio signal.
  • FIG. 11 is a schematic block diagram of a terminal device 600 according to an embodiment of this specification.
  • the terminal device 600 includes a processor 601 and a memory 602, and also includes an audio sensor 603 and a speaker 604.
  • the audio sensor 603 is used to collect environmental sounds of the terminal device 600, and the speaker 604 is used to play audio information.
  • the processor 601 and the memory 602 are connected by a bus 605, and the bus 605 is, for example, an I2C (Inter-integrated Circuit) bus.
  • I2C Inter-integrated Circuit
  • the processor 601 may be a micro-controller unit (MCU), a central processing unit (Central Processing Unit, CPU), a digital signal processor (Digital Signal Processor, DSP), or the like.
  • MCU micro-controller unit
  • CPU Central Processing Unit
  • DSP Digital Signal Processor
  • the memory 602 may be a Flash chip, a read-only memory (ROM, Read-Only Memory) disk, an optical disk, a U disk, or a mobile hard disk.
  • the processor 601 is used to run a computer program stored in the memory 602, and implement the aforementioned control method for terminal equipment when the computer program is executed.
  • the processor 601 is configured to run a computer program stored in the memory 602, and implement the following steps when the computer program is executed:
  • the embodiments of this specification also provide a computer-readable storage medium, the computer-readable storage medium stores a computer program, the computer program includes program instructions, and the processor executes the program instructions to implement the foregoing implementation
  • the example provides the steps of the control method for terminal equipment.
  • the computer-readable storage medium may be the internal storage unit of the terminal device described in any of the foregoing embodiments, such as the hard disk or memory of the terminal device.
  • the computer-readable storage medium may also be an external storage device of the terminal device, such as a plug-in hard disk equipped on the terminal device, a smart memory card (Smart Media Card, SMC), and Secure Digital (SD). ) Card, Flash Card, etc.
  • FIG. 12 is a schematic block diagram of a movable platform 700 according to an embodiment of the present specification.
  • the mobile platform 700 includes a processor 701 and a memory 702, and also includes an audio sensor 703 and a speaker 704.
  • the audio sensor 703 is used to collect environmental sounds of the movable platform 700, and the speaker 704 is used to play audio information.
  • the processor 701 and the memory 702 are connected by a bus 705, and the bus 705 is, for example, an I2C (Inter-integrated Circuit) bus.
  • I2C Inter-integrated Circuit
  • the processor 701 may be a micro-controller unit (MCU), a central processing unit (Central Processing Unit, CPU), a digital signal processor (Digital Signal Processor, DSP), or the like.
  • MCU micro-controller unit
  • CPU Central Processing Unit
  • DSP Digital Signal Processor
  • the memory 702 may be a Flash chip, a read-only memory (ROM, Read-Only Memory) disk, an optical disk, a U disk, or a mobile hard disk.
  • the processor 701 is configured to run a computer program stored in the memory 702, and implement the aforementioned control method for a movable platform when the computer program is executed.
  • the movable platform may be a movable robot, and the movable robot may include:
  • the robot body 110 includes a chassis main body 111 and a pan/tilt main body 112 provided on the chassis main body 111.
  • the pan/tilt main body 112 is used to carry the camera 101;
  • the power device 120 is arranged on the chassis body 111 and is used to provide moving power to the robot body 110;
  • An audio sensor and a speaker are arranged on the robot body 110, the audio sensor is used to collect environmental sounds, and the speaker is used to play audio;
  • the communication device is provided on the robot body 110 and is used to communicate with the terminal device.
  • the processor 701 is configured to run a computer program stored in the memory 702, and implement the following steps when the computer program is executed:
  • the terminal sound file sent by the terminal device, decode the terminal sound file to generate terminal audio information, and play the terminal audio information; wherein the terminal sound file is collected by the terminal device from the environment of the terminal device Generated after the sound;
  • the embodiments of this specification also provide a computer-readable storage medium, the computer-readable storage medium stores a computer program, the computer program includes program instructions, the computer program is executed by a processor to cause the processing
  • the device implements the steps of the control method for the movable platform provided in the above embodiment.
  • the computer-readable storage medium may be the movable platform described in any of the foregoing embodiments, such as an internal storage unit of a movable robot, for example, a hard disk or memory of the movable robot.
  • the computer-readable storage medium may also be an external storage device of the movable platform, for example, a plug-in hard disk equipped on the movable platform, a smart memory card (Smart Media Card, SMC), and Secure Digital (Secure Digital). , SD) card, flash card (Flash Card), etc.
  • the mobile platform, terminal device and its control method, and storage medium provided in the above-mentioned embodiments of this specification collect audio data of its surrounding environment through the terminal device and send the audio data to the corresponding mobile platform, so that users can A place far away from the movable platform makes a sound through the movable platform, for example, shouting to a person near the movable platform or other movable platforms.
  • collecting the audio data of the surrounding environment through the mobile platform and sending the audio data to the terminal device of the mobile platform can enable the user of the terminal device to listen to the mobile platform even if it is not in the vicinity of the mobile platform.
  • the sound scene of the environment where the platform is located can facilitate the user to interact with the surrounding environment more conveniently and intuitively, facilitate the user's control of the movable platform, and meet the user's purpose of transmitting voice information.

Abstract

A mobile platform, a terminal device and a control method therefor, and a storage medium. The method comprises: a terminal device acquires a terminal sound file according to environmental sound and sends the terminal sound file to a mobile platform (S110); the mobile platform decodes the terminal sound file and then performs playback (S120); the mobile platform generates a platform sound file according to environmental sound (S130); and after receiving the platform sound file, the terminal device plays back platform audio information obtained by decoding the platform sound file (S140).

Description

可移动平台、终端设备及其控制方法、存储介质Movable platform, terminal equipment and its control method and storage medium 技术领域Technical field
本说明书涉及可移动平台领域,尤其涉及一种可移动平台、终端设备及其控制方法、存储介质。This specification relates to the field of movable platforms, and in particular to a movable platform, a terminal device, a control method thereof, and a storage medium.
背景技术Background technique
随着科技的发展,可移动平台逐渐具有越来越多的应用。可移动机器人、无人机等可移动平台通常可以与手机、遥控器等终端设备进行通信,实现指令传输、图像传输等功能。但这些功能只限于实现终端设备和可移动平台之间的交互方式,比较单一。With the development of technology, mobile platforms gradually have more and more applications. Movable platforms such as mobile robots and unmanned aerial vehicles can usually communicate with terminal devices such as mobile phones and remote controls to realize functions such as command transmission and image transmission. But these functions are only limited to realize the interactive mode between the terminal equipment and the mobile platform, which is relatively simple.
发明内容Summary of the invention
基于此,本说明书提供了一种可移动平台、终端设备及其控制方法、存储介质,可以实现终端设备和可移动平台之间的语音传输的交互方式,例如实现语音对讲等过程。Based on this, this specification provides a mobile platform, a terminal device and its control method, and a storage medium, which can realize the interactive mode of voice transmission between the terminal device and the mobile platform, such as realizing processes such as voice intercom.
第一方面,本说明书提供了一种控制方法,应用于一终端设备和一可移动平台构成的系统,所述终端设备和所述可移动平台均设有音频传感器和扬声器;In the first aspect, this specification provides a control method applied to a system composed of a terminal device and a movable platform, and both the terminal device and the movable platform are provided with audio sensors and speakers;
所述方法包括:The method includes:
所述终端设备根据所述终端设备的环境声音获取终端声音文件,并将所述终端声音文件向所述可移动平台发送;The terminal device obtains a terminal sound file according to the environmental sound of the terminal device, and sends the terminal sound file to the mobile platform;
所述可移动平台接收所述终端设备发送的终端声音文件,并解码所述终端声音文件后进行播放;The mobile platform receives the terminal sound file sent by the terminal device, decodes the terminal sound file, and plays it;
所述可移动平台根据所述可移动平台的环境声音,生成平台声音文件;The movable platform generates a platform sound file according to the environmental sound of the movable platform;
所述终端设备接收所述可移动平台发送的平台声音文件后,解码所述平台声音文件得到平台音频信息,播放所述平台音频信息。After receiving the platform sound file sent by the mobile platform, the terminal device decodes the platform sound file to obtain platform audio information, and plays the platform audio information.
第二方面,本说明书提供了一种控制方法,用于终端设备,所述终端设备用于与一可移动平台进行通信,所述终端设备和所述可移动平台均设有音频传感器和扬声器;In the second aspect, this specification provides a control method for a terminal device, the terminal device is used to communicate with a movable platform, and both the terminal device and the movable platform are provided with audio sensors and speakers;
所述方法包括:The method includes:
根据所述终端设备的环境声音获取终端声音文件;Acquiring a terminal sound file according to the environmental sound of the terminal device;
将所述终端声音文件向所述可移动平台发送,以使所述可移动平台解码所述终端声音 文件后播放;Sending the terminal sound file to the mobile platform, so that the mobile platform decodes the terminal sound file and plays it;
获取所述可移动平台发送的平台声音文件,所述平台声音文件由所述可移动平台采集所述可移动平台的环境声音后生成;Acquiring a platform sound file sent by the mobile platform, the platform sound file being generated by the mobile platform after collecting the environmental sound of the mobile platform;
解码所述平台声音文件得到平台音频信息,播放所述平台音频信息。Decoding the platform sound file to obtain platform audio information, and playing the platform audio information.
第三方面,本说明书提供了一种控制方法,应用于可移动平台,所述可移动平台用于与一终端设备进行通信,所述终端设备和所述可移动平台均设有音频传感器和扬声器;In the third aspect, this specification provides a control method, which is applied to a movable platform, the movable platform is used to communicate with a terminal device, and both the terminal device and the movable platform are provided with audio sensors and speakers ;
所述方法包括:The method includes:
获取所述终端设备发送的终端声音文件,并解码所述终端声音文件后生成终端音频信息,播放所述终端音频信息;其中,所述终端声音文件由所述终端设备采集所述终端设备的环境声音后生成;Acquire the terminal sound file sent by the terminal device, decode the terminal sound file to generate terminal audio information, and play the terminal audio information; wherein the terminal sound file is collected by the terminal device from the environment of the terminal device Generated after the sound;
根据所述可移动平台的环境声音,生成平台声音文件;Generate a platform sound file according to the environmental sound of the movable platform;
将所述平台声音文件向所述终端设备发送,以使所述终端设备解码所述平台声音文件后播放。Send the platform sound file to the terminal device, so that the terminal device decodes the platform sound file and plays it.
第四方面,本说明书提供了一种终端设备,包括音频传感器、扬声器、存储器和处理器;In the fourth aspect, this specification provides a terminal device, including an audio sensor, a speaker, a memory, and a processor;
所述音频传感器用于采集所述终端设备的环境声音,所述扬声器用于播放音频信息;The audio sensor is used to collect environmental sounds of the terminal device, and the speaker is used to play audio information;
所述存储器用于存储计算机程序;The memory is used to store a computer program;
所述处理器,用于执行所述计算机程序并在执行所述计算机程序时,实现如下步骤:The processor is configured to execute the computer program and, when executing the computer program, implement the following steps:
根据所述终端设备的环境声音获取终端声音文件;Acquiring a terminal sound file according to the environmental sound of the terminal device;
将所述终端声音文件向所述可移动平台发送,以使所述可移动平台解码所述终端声音文件后播放;Sending the terminal sound file to the mobile platform, so that the mobile platform decodes the terminal sound file and plays it;
获取所述可移动平台发送的平台声音文件,所述平台声音文件由所述可移动平台采集所述可移动平台的环境声音后生成;Acquiring a platform sound file sent by the mobile platform, the platform sound file being generated by the mobile platform after collecting the environmental sound of the mobile platform;
解码所述平台声音文件得到平台音频信息,播放所述平台音频信息。Decoding the platform sound file to obtain platform audio information, and playing the platform audio information.
第五方面,本说明书提供了一种可移动平台,包括音频传感器、扬声器、存储器和处理器;In the fifth aspect, this specification provides a movable platform, including an audio sensor, a speaker, a memory, and a processor;
所述音频传感器用于采集所述可移动平台的环境声音,所述扬声器用于播放音频信息;The audio sensor is used to collect environmental sounds of the movable platform, and the speaker is used to play audio information;
所述存储器用于存储计算机程序;The memory is used to store a computer program;
所述处理器,用于执行所述计算机程序并在执行所述计算机程序时,实现如下步骤:The processor is configured to execute the computer program and, when executing the computer program, implement the following steps:
获取所述终端设备发送的终端声音文件,并解码所述终端声音文件后生成终端音频信 息,播放所述终端音频信息;其中,所述终端声音文件由所述终端设备采集所述终端设备的环境声音后生成;Acquire the terminal sound file sent by the terminal device, decode the terminal sound file to generate terminal audio information, and play the terminal audio information; wherein the terminal sound file is collected by the terminal device from the environment of the terminal device Generated after the sound;
根据所述可移动平台的环境声音,生成平台声音文件;Generate a platform sound file according to the environmental sound of the movable platform;
将所述平台声音文件向所述终端设备发送,以使所述终端设备解码所述平台声音文件后播放。Send the platform sound file to the terminal device, so that the terminal device decodes the platform sound file and plays it.
第六方面,本说明书提供了一种计算机可读存储介质,所述计算机可读存储介质存储有计算机程序,所述计算机程序被处理器执行时使所述处理器实现上述的控制方法。In a sixth aspect, this specification provides a computer-readable storage medium that stores a computer program, and when the computer program is executed by a processor, the processor implements the above-mentioned control method.
本说明书实施例提供了一种可移动平台、终端设备及其控制方法、存储介质,通过终端设备采集其周围环境的音频数据,并将该音频数据发送给对应的可移动平台,可以使得用户可以在距离可移动平台较远的地方通过可移动平台发出声音,例如对该可移动平台附近的人或者其他可移动平台喊话。另外,通过可移动平台采集其周围环境的音频数据,并将该音频数据发送给该可移动平台的终端设备,可使得该终端设备的用户即使不在可移动平台的附近,也可以收听到可移动平台所处环境的声音场景,可以方便用户更加便捷和直观与周围环境进行互动,有利于用户对可移动平台的控制,满足用户传递语音信息的目的。The embodiments of this specification provide a movable platform, a terminal device and a control method thereof, and a storage medium. The terminal device collects audio data of its surrounding environment and sends the audio data to the corresponding movable platform, so that the user can A sound is made through the movable platform at a place far away from the movable platform, for example, a person near the movable platform or other movable platforms are called. In addition, collecting the audio data of the surrounding environment through the mobile platform and sending the audio data to the terminal device of the mobile platform can enable the user of the terminal device to listen to the mobile platform even if it is not in the vicinity of the mobile platform. The sound scene of the environment where the platform is located can facilitate the user to interact with the surrounding environment more conveniently and intuitively, facilitate the user's control of the movable platform, and meet the user's purpose of transmitting voice information.
应当理解的是,以上的一般描述和后文的细节描述仅是示例性和解释性的,并不能限制本说明书的公开内容。It should be understood that the above general description and the following detailed description are only exemplary and explanatory, and cannot limit the disclosure of this specification.
附图说明Description of the drawings
为了更清楚地说明本说明书实施例技术方案,下面将对实施例描述中所需要使用的附图作简单地介绍,显而易见地,下面描述中的附图是本说明书的一些实施例,对于本领域普通技术人员来讲,在不付出创造性劳动的前提下,还可以根据这些附图获得其他的附图。In order to explain the technical solutions of the embodiments of this specification more clearly, the following will briefly introduce the drawings used in the description of the embodiments. Obviously, the drawings in the following description are some embodiments of this specification. Ordinary technicians can obtain other drawings based on these drawings without creative work.
图1是本说明书一实施例提供的一种控制方法的流程示意图;FIG. 1 is a schematic flowchart of a control method provided by an embodiment of this specification;
图2是终端设备和可移动平台构成系统的一实施方式的场景示意图;FIG. 2 is a schematic diagram of an embodiment of a system composed of a terminal device and a movable platform;
图3是终端设备和可移动平台构成系统的另一实施方式的场景示意图;FIG. 3 is a schematic diagram of another embodiment of a system composed of a terminal device and a movable platform;
图4是本说明书另一实施例提供的一种控制方法的流程示意图;4 is a schematic flowchart of a control method provided by another embodiment of this specification;
图5为终端设备的平台控制界面的一实施方式的示意图;Fig. 5 is a schematic diagram of an embodiment of a platform control interface of a terminal device;
图6是平台控制界面中对讲按钮的显示方式示意图;Figure 6 is a schematic diagram of the display mode of the intercom button in the platform control interface;
图7是终端设备上显示的环境声音的处理状态的示意图;FIG. 7 is a schematic diagram of the processing state of the environmental sound displayed on the terminal device;
图8为终端设备的平台控制界面的另一实施方式的示意图;FIG. 8 is a schematic diagram of another implementation manner of a platform control interface of a terminal device;
图9是录音记录列表的一实施方式的示意图;FIG. 9 is a schematic diagram of an embodiment of a recording record list;
图10是本说明书另一实施例提供的一种控制方法的流程示意图;FIG. 10 is a schematic flowchart of a control method provided by another embodiment of this specification;
图11是本说明书一实施例提供的一种终端设备的示意性框图;FIG. 11 is a schematic block diagram of a terminal device according to an embodiment of the present specification;
图12是本说明书一实施例提供的一种可移动平台的示意性框图。Fig. 12 is a schematic block diagram of a movable platform provided by an embodiment of the present specification.
具体实施方式Detailed ways
下面将结合本说明书实施例中的附图,对本说明书实施例中的技术方案进行清楚、完整地描述,显然,所描述的实施例是本说明书一部分实施例,而不是全部的实施例。基于本说明书中的实施例,本领域普通技术人员在没有做出创造性劳动前提下所获得的所有其他实施例,都属于本说明书保护的范围。The technical solutions in the embodiments of this specification will be clearly and completely described below in conjunction with the drawings in the embodiments of this specification. Obviously, the described embodiments are part of the embodiments of this specification, not all of the embodiments. Based on the embodiments in this specification, all other embodiments obtained by a person of ordinary skill in the art without creative work shall fall within the protection scope of this specification.
附图中所示的流程图仅是示例说明,不是必须包括所有的内容和操作/步骤,也不是必须按所描述的顺序执行。例如,有的操作/步骤还可以分解、组合或部分合并,因此实际执行的顺序有可能根据实际情况改变。The flowchart shown in the drawings is only an example, and does not necessarily include all contents and operations/steps, nor does it have to be executed in the described order. For example, some operations/steps can also be decomposed, combined or partially combined, so the actual execution order may be changed according to actual conditions.
下面结合附图,对本说明书的一些实施方式作详细说明。在不冲突的情况下,下述的实施例及实施例中的特征可以相互组合。Hereinafter, some embodiments of this specification will be described in detail with reference to the accompanying drawings. In the case of no conflict, the following embodiments and features in the embodiments can be combined with each other.
请参阅图1,图1是本说明书一实施例提供的一种控制方法的流程示意图。该控制方法可以应用于一终端设备和一可移动平台构成的系统,用于实现终端设备和可移动平台之间的语音对讲等过程。具体的,所述终端设备和所述可移动平台均设有音频传感器和扬声器。Please refer to FIG. 1, which is a schematic flowchart of a control method provided by an embodiment of this specification. The control method can be applied to a system composed of a terminal device and a movable platform, and is used to implement processes such as voice intercom between the terminal device and the movable platform. Specifically, both the terminal device and the movable platform are provided with audio sensors and speakers.
其中终端设备例如可以包括遥控器、手机、平板电脑、笔记本电脑、台式电脑、个人数字助理、穿戴式设备,例如虚拟现实(Virtual Reality,VR)眼镜、FPV(First Person View,第一人称主视角)眼镜等中的至少一项;可移动平台例如可以为可移动机器人、机器车、无人机等,此处以可移动机器人为例进行示意性说明。Among them, terminal devices may include, for example, remote controls, mobile phones, tablet computers, notebook computers, desktop computers, personal digital assistants, and wearable devices, such as virtual reality (Virtual Reality, VR) glasses, FPV (First Person View, first-person view) At least one item of glasses, etc.; the movable platform may be, for example, a movable robot, a robotic vehicle, an unmanned aerial vehicle, etc., and a movable robot is used as an example for schematic illustration.
在一些实施方式中,如图2所示,可移动机器人11和终端设备13可以进行通信,通信方式可以是有线通信,或者是无线通信。本实施例以无线通信为例。In some embodiments, as shown in FIG. 2, the mobile robot 11 and the terminal device 13 may communicate, and the communication method may be wired communication or wireless communication. This embodiment takes wireless communication as an example.
示例性的,可移动平台和终端设备可以直接通信连接,也可以通过路由器、服务器、基站等通信连接以互相传递数据。Exemplarily, the mobile platform and the terminal device may be directly connected in communication, or may be connected to each other through communication connections such as routers, servers, base stations, etc.
在一些实施方式中,如图2所示,可移动机器人包括:In some embodiments, as shown in Figure 2, the mobile robot includes:
机器人本体110,包括底盘主体111和设于底盘主体111上的云台主体112,云台主体112用于搭载摄像装置101;The robot body 110 includes a chassis main body 111 and a pan/tilt main body 112 provided on the chassis main body 111. The pan/tilt main body 112 is used to carry the camera 101;
动力装置120,设于底盘主体111上,用于对机器人本体110提供移动动力;The power device 120 is arranged on the chassis body 111 and is used to provide moving power to the robot body 110;
音频传感器和扬声器,设于机器人本体110上,音频传感器用于采集环境声音,扬声 器用于播放音频;The audio sensor and the speaker are arranged on the robot body 110, the audio sensor is used to collect environmental sound, and the speaker is used to play audio;
通信装置,设于机器人本体110上,用于与终端设备进行通信。The communication device is provided on the robot body 110 and is used to communicate with the terminal device.
在一些实施方式中,可移动机器人上设置有发射装置,该发射装置可用于发射弹丸,弹丸的大小和形状不作具体限定。In some embodiments, a launching device is provided on the mobile robot, and the launching device can be used to launch projectiles, and the size and shape of the projectiles are not specifically limited.
可选的,如图3所示,多个可移动机器人,如可移动机器人11和可移动机器人12通过各自的发射装置发射弹丸或光束进行对战。另外,每个可移动机器人还可以对应有一个终端设备,或者,多个可移动机器人对应一个终端设备,例如可移动机器人11对应有终端设备13,可移动机器人12对应有终端设备14。Optionally, as shown in FIG. 3, multiple mobile robots, such as the mobile robot 11 and the mobile robot 12, launch projectiles or light beams through their respective launching devices to compete. In addition, each mobile robot may also correspond to one terminal device, or multiple mobile robots correspond to one terminal device. For example, the mobile robot 11 corresponds to the terminal device 13 and the mobile robot 12 corresponds to the terminal device 14.
如图1所示,应用于一终端设备和一可移动平台构成的系统的控制方法可以包括步骤S110至步骤S140。As shown in FIG. 1, the control method applied to a system composed of a terminal device and a movable platform may include step S110 to step S140.
S110、所述终端设备根据所述终端设备的环境声音获取终端声音文件,并将所述终端声音文件向所述可移动平台发送。S110: The terminal device obtains a terminal sound file according to the environmental sound of the terminal device, and sends the terminal sound file to the movable platform.
S120、所述可移动平台接收所述终端设备发送的终端声音文件,并解码所述终端声音文件后进行播放。S120. The mobile platform receives the terminal sound file sent by the terminal device, decodes the terminal sound file, and plays it.
S130、所述可移动平台根据所述可移动平台的环境声音,生成平台声音文件。S130. The movable platform generates a platform sound file according to the environmental sound of the movable platform.
S140、所述终端设备接收所述可移动平台发送的平台声音文件后,解码所述平台声音文件得到平台音频信息,播放所述平台音频信息。S140: After receiving the platform sound file sent by the movable platform, the terminal device decodes the platform sound file to obtain platform audio information, and plays the platform audio information.
在一种可能的方式中,如图2和图3所示,可移动机器人可设置有音频传感器,该音频传感器可用于采集可移动机器人周围环境中的音频数据,例如,该音频传感器可以是麦克风,可移动机器人通过麦克风采集到周围环境中的音频数据后,可将该音频数据发送给与该可移动机器人通信的终端设备。In a possible manner, as shown in Figures 2 and 3, the mobile robot can be provided with an audio sensor, which can be used to collect audio data in the surrounding environment of the mobile robot. For example, the audio sensor can be a microphone. After the mobile robot collects audio data in the surrounding environment through a microphone, the audio data can be sent to the terminal device that communicates with the mobile robot.
例如,如图3所示,可移动机器人11与终端设备13通信连接,可移动机器人11通过麦克风采集到周围环境中的音频数据后,将该音频数据发送给终端设备13。可选的,可移动机器人11通过麦克风采集到的周围环境中的音频数据可以来源于其他可移动机器人,例如,可移动机器人12。或者,可移动机器人11上的麦克风采集到的周围环境中的音频数据可以来源于其他可移动机器人的终端设备的用户,例如,可移动机器人12与终端设备14通信连接,可移动机器人11上的麦克风采集到的音频数据是终端设备14或控制终端设备14的用户的音频数据。也就是说,可移动机器人11可以将其采集到的可移动机器人12发出的音频数据和/或可移动机器人12对应的终端设备14或终端设备14的用户的音频数据发送给终端设备13。For example, as shown in FIG. 3, the mobile robot 11 is in communication connection with the terminal device 13, and the mobile robot 11 collects audio data in the surrounding environment through a microphone, and then sends the audio data to the terminal device 13. Optionally, the audio data in the surrounding environment collected by the mobile robot 11 through a microphone may be derived from other mobile robots, for example, the mobile robot 12. Alternatively, the audio data in the surrounding environment collected by the microphone on the mobile robot 11 may come from users of other mobile robot terminal devices. For example, the mobile robot 12 is in communication connection with the terminal device 14, and the mobile robot 11 The audio data collected by the microphone is the audio data of the terminal device 14 or the user who controls the terminal device 14. In other words, the mobile robot 11 may send the collected audio data emitted by the mobile robot 12 and/or the audio data of the terminal device 14 corresponding to the mobile robot 12 or the user of the terminal device 14 to the terminal device 13.
同理,可移动机器人12也可以将其采集到的可移动机器人11发出的音频数据和/或可移动机器人11对应的终端设备13或终端设备13的用户的音频数据发送给终端设备14。Similarly, the mobile robot 12 may also send the collected audio data from the mobile robot 11 and/or the terminal device 13 corresponding to the mobile robot 11 or the user's audio data of the terminal device 13 to the terminal device 14.
在另一种可能的方式中,如图1所示,终端设备可设置有音频传感器,该音频传感器可用于采集该终端设备周围环境中的音频数据,例如,该终端设备的用户的音频数据。进一步,该终端设备将该用户的音频数据发送给与该终端设备通信的可移动机器人。In another possible manner, as shown in FIG. 1, the terminal device may be provided with an audio sensor, and the audio sensor may be used to collect audio data in the surrounding environment of the terminal device, for example, audio data of a user of the terminal device. Further, the terminal device sends the user's audio data to the mobile robot that communicates with the terminal device.
例如,可移动机器人11与终端设备13通信连接,终端设备13上设置有麦克风,该麦克风可用于采集该终端设备13的用户的音频数据,进一步,终端设备13将该用户的音频数据发送给可移动机器人11。同理,终端设备14也可以将该终端设备14的用户的音频数据发送给可移动机器人12。For example, the mobile robot 11 is in communication connection with the terminal device 13, and the terminal device 13 is provided with a microphone. The microphone can be used to collect the audio data of the user of the terminal device 13. Further, the terminal device 13 sends the user's audio data to the user Mobile robot 11. Similarly, the terminal device 14 may also send the audio data of the user of the terminal device 14 to the mobile robot 12.
在又一种可能的方式中,如图2所示,终端设备和可移动机器人均设置有音频传感器。例如,可移动机器人11与终端设备13通信连接,可移动机器人11与终端设备13分别设置有麦克风,可移动机器人11通过可移动机器人11上的麦克风实时采集可移动机器人11周围的音频数据,并将该音频数据发送给终端设备13,同时,终端设备13通过终端设备13上的麦克风实时采集该终端设备13的用户的音频数据,并将该音频数据发送给可移动机器人11。In another possible manner, as shown in Fig. 2, both the terminal device and the mobile robot are provided with audio sensors. For example, the mobile robot 11 is in communication connection with the terminal device 13, and the mobile robot 11 and the terminal device 13 are respectively provided with microphones, and the mobile robot 11 collects the audio data around the mobile robot 11 in real time through the microphone on the mobile robot 11, and The audio data is sent to the terminal device 13, and at the same time, the terminal device 13 collects the audio data of the user of the terminal device 13 in real time through the microphone on the terminal device 13 and sends the audio data to the mobile robot 11.
再例如,如图3所示,可移动机器人11与终端设备13通信连接,可移动机器人12与终端设备14通信连接,可移动机器人11、可移动机器人12、终端设备13和终端设备14上分别设置有麦克风,终端设备13通过终端设备13上的麦克风采集该终端设备13的用户的音频数据,并将该音频数据发送给可移动机器人11,该可移动机器人11上还可设置有扬声器,该扬声器可用于播放该终端设备13的用户的音频数据。同时,可移动机器人12上的麦克风采集该终端设备13的用户的音频数据,并移动至终端设备14的位置,以将该终端设备13的用户的音频数据播放给终端设备14的用户。同理,可移动机器人12也可以接收到终端设备14的用户的音频数据,并通过可移动机器人12上的扬声器播放该终端设备14的用户的音频数据,同时,可移动机器人11采集该终端设备14的用户的音频数据,并发送给终端设备13。For another example, as shown in FIG. 3, the mobile robot 11 is in communication connection with the terminal device 13, and the mobile robot 12 is in communication connection with the terminal device 14. The mobile robot 11, the mobile robot 12, the terminal device 13 and the terminal device 14 are respectively A microphone is provided, and the terminal device 13 collects the audio data of the user of the terminal device 13 through the microphone on the terminal device 13, and sends the audio data to the mobile robot 11. The mobile robot 11 may also be provided with a speaker. The speaker can be used to play audio data of the user of the terminal device 13. At the same time, the microphone on the mobile robot 12 collects the audio data of the user of the terminal device 13 and moves to the position of the terminal device 14 to play the audio data of the user of the terminal device 13 to the user of the terminal device 14. In the same way, the mobile robot 12 can also receive the audio data of the user of the terminal device 14, and play the audio data of the user of the terminal device 14 through the speaker on the mobile robot 12. At the same time, the mobile robot 11 collects the terminal device 14 14 of the user’s audio data, and sent to the terminal device 13.
本说明书上述实施方式提供的控制方法,通过终端设备采集其周围环境的音频数据,并将该音频数据发送给对应的可移动平台,可以使得用户可以在距离可移动平台较远的地方通过可移动平台发出声音,例如对该可移动平台附近的人或者其他可移动平台喊话。另外,通过可移动平台采集其周围环境的音频数据,并将该音频数据发送给该可移动平台的终端设备,可使得该终端设备的用户即使不在可移动平台的附近,也可以收听到可移动平台所处环境的 声音场景,例如终端设备的用户即使不在可移动平台的对战现场,也可以感受到该对战现场的氛围,而且可使得该可移动平台的终端设备的用户根据可移动平台传回的音频准确地控制该可移动平台。可以方便用户更加便捷和直观与周围环境进行互动,有利于用户对可移动平台的控制,满足用户传递语音信息的目的。在多个可移动机器人的组网比赛中,增加视听效果的实时性,提高了玩家的语音互动趣味。The control method provided by the above-mentioned embodiments of this specification collects the audio data of its surrounding environment through the terminal device and sends the audio data to the corresponding movable platform, so that the user can pass through the movable platform far away from the movable platform. The platform emits sounds, for example, shouting to people near the movable platform or other movable platforms. In addition, collecting the audio data of the surrounding environment through the mobile platform and sending the audio data to the terminal device of the mobile platform can enable the user of the terminal device to listen to the mobile platform even if it is not in the vicinity of the mobile platform. The sound scene of the environment where the platform is located. For example, even if the user of the terminal device is not on the mobile platform, they can still feel the atmosphere of the battle scene, and the user of the terminal device of the mobile platform can send it back according to the mobile platform. The audio accurately controls the movable platform. It is convenient for users to interact with the surrounding environment more conveniently and intuitively, which is conducive to the user's control of the movable platform and meets the user's purpose of transmitting voice information. In the networked competition of multiple mobile robots, the real-time audio-visual effects are increased, and the voice interaction fun of the players is improved.
请结合前述实施例参阅图4,如图4所示为本说明书一实施例提供的一种控制方法的流程示意图。该控制方法可以应用于终端设备,所述终端设备用于与一可移动平台进行通信,所述终端设备和所述可移动平台均设有音频传感器和扬声器。Please refer to FIG. 4 in conjunction with the foregoing embodiment. FIG. 4 is a schematic flowchart of a control method provided by an embodiment of this specification. The control method can be applied to a terminal device that is used to communicate with a movable platform, and both the terminal device and the movable platform are provided with audio sensors and speakers.
如图4所示,本说明书实施例的控制方法包括步骤S210至步骤S240。As shown in FIG. 4, the control method of the embodiment of this specification includes step S210 to step S240.
S210、根据所述终端设备的环境声音获取终端声音文件。S210: Acquire a terminal sound file according to the environmental sound of the terminal device.
在一些实施方式中,终端设备通过音频传感器采集所处环境中的声音,将采集的声音数据编码得到终端声音文件。通过编码降低发送的数据量,提高声音文件发送的实时性。In some embodiments, the terminal device collects sound in the environment through an audio sensor, and encodes the collected sound data to obtain a terminal sound file. Reduce the amount of data sent by encoding, and improve the real-time performance of sound file sending.
示例性的,终端设备对当前采集的环境声音进行实时编码,在录制结束后生成对应编码格式的终端声音文件。从而提高了终端声音文件向可移动平台发送的实时性。Exemplarily, the terminal device encodes the currently collected environmental sound in real time, and generates a terminal sound file in the corresponding encoding format after the recording ends. This improves the real-time performance of the terminal sound file sending to the mobile platform.
终端设备将当前采集的声音数据实时编码为OPUS的数据,在录制结束后生成OPUS格式的终端声音文件。The terminal device encodes the currently collected sound data into OPUS data in real time, and generates a terminal sound file in OPUS format after the recording is finished.
在一些实施方式中,所述控制方法还包括:显示平台控制界面,所述平台控制界面包括对讲按钮。In some embodiments, the control method further includes: displaying a platform control interface, and the platform control interface includes an intercom button.
在其他一些实施方式中,对讲按钮也可以为设置在终端设备,如遥控器、VR眼镜或FPV眼镜上的实体按键。In some other embodiments, the intercom button may also be a physical button set on a terminal device, such as a remote control, VR glasses, or FPV glasses.
在一些实施方式中,可移动平台搭载的摄像装置可以将采集的图像传输给终端设备,终端设备可以在如图5所示的平台控制界面显示可移动平台拍摄的图像,以便用户了解可移动平台所处的环境。In some embodiments, the camera device mounted on the movable platform can transmit the collected images to the terminal device, and the terminal device can display the image taken by the movable platform on the platform control interface as shown in FIG. 5, so that the user can understand the movable platform. The environment.
如图5所示,在终端设备平台控制界面的右上方显示名称为喊话器的对讲按钮,通过对讲按钮可以控制采集终端设备的环境声音。As shown in Figure 5, an intercom button named megaphone is displayed on the upper right of the terminal equipment platform control interface, and the environmental sound of the terminal device can be controlled and collected through the intercom button.
示例性的,可以根据用户对所述对讲按钮的设置操作,使能或关闭用户对所述对讲按钮的对讲控制操作,并调整所述对讲按钮的显示方式。Exemplarily, according to the setting operation of the intercom button by the user, the intercom control operation of the intercom button by the user can be enabled or disabled, and the display mode of the intercom button can be adjusted.
例如,如图6所示,显示方式A是对讲按钮默认的显示方式,在该显示方式时,用户可以通过长按对讲按钮,触发终端设备采集环境声音,在采集环境声音时,对讲按钮可以以显示方式C进行显示。当用户点击显示方式A的对讲按钮时,对讲按钮可以以显示方式B进行 显示以提示用户;在用户点击对讲按钮松开后,调整所述对讲按钮的显示方式为显示方式D,并关闭用户对所述对讲按钮的对讲控制操作;在该显示方式D时,用户即使长按对讲按钮,也不会触发终端设备采集环境声音。用户还可以通过点击显示方式C的对讲按钮,触发终端设备以显示方式A显示对讲按钮,实现使能用户对所述对讲按钮的对讲控制操作,如允许通过长按对讲按钮触发终端设备采集环境声音。For example, as shown in Figure 6, display mode A is the default display mode of the intercom button. In this display mode, the user can press and hold the intercom button to trigger the terminal device to collect ambient sound. The button can be displayed in display mode C. When the user clicks the intercom button in display mode A, the intercom button can be displayed in display mode B to prompt the user; after the user clicks the intercom button and releases it, adjust the display mode of the intercom button to display mode D, And close the user's intercom control operation on the intercom button; in this display mode D, even if the user presses the intercom button for a long time, the terminal device will not be triggered to collect environmental sounds. The user can also click on the intercom button of display mode C to trigger the terminal device to display the intercom button in display mode A, so as to enable the user to control the intercom operation of the intercom button, such as allowing triggering by long pressing the intercom button The terminal equipment collects environmental sounds.
可以理解的,所述根据所述终端设备的环境声音获取终端声音文件,包括:根据用户对所述对讲按钮的对讲控制操作,获取所述终端设备的环境声音,编码得到所述终端声音文件。It is understandable that the obtaining the terminal sound file according to the environmental sound of the terminal device includes: obtaining the environmental sound of the terminal device according to the intercom control operation of the intercom button by the user, and encoding the terminal sound file.
具体的,在使能用户对所述对讲按钮的对讲控制操作时,如果检测到用户按下对讲按钮的时长超过预设的阈值,如0.5秒,则开始获取所述终端设备的环境声音,编码得到所述终端声音文件。Specifically, when the user's intercom control operation on the intercom button is enabled, if it is detected that the duration of the user's pressing of the intercom button exceeds a preset threshold, such as 0.5 seconds, then start to obtain the environment of the terminal device The sound is encoded to obtain the terminal sound file.
示例性的,可以在所述对讲按钮被按下后至松开前的时间段获取所述终端设备的环境声音,编码得到所述终端声音文件。Exemplarily, the environmental sound of the terminal device may be acquired during the time period after the intercom button is pressed to before the intercom button is released, and the terminal sound file may be obtained by encoding.
可以理解的,终端设备可以采集对讲按钮被按下时的环境声音,用户松开对讲按钮可以使终端设备结束采集环境声音。It is understandable that the terminal device can collect the environmental sound when the intercom button is pressed, and the user releases the intercom button to make the terminal device end the collection of the environmental sound.
示例性的,若所述对讲按钮被按下持续的时间达到预设时长,例如为60秒,可以停止获取所述终端设备的环境声音,编码已经获取的声音得到所述终端声音文件。可以限制终端声音文件的数据量,防止向可移动平台传输时占用过多时间而影响其他操作,例如控制可移动机器人的运动。Exemplarily, if the duration of the intercom button being pressed reaches a preset duration, for example, 60 seconds, the acquisition of the ambient sound of the terminal device may be stopped, and the acquired sound may be encoded to obtain the terminal sound file. It can limit the data volume of the terminal sound file, and prevent the transmission to the mobile platform from occupying too much time and affecting other operations, such as controlling the movement of the mobile robot.
例如,若用户按下对讲按钮的时间达到了例如30秒,则终端设备结束采集环境声音。For example, if the time for the user to press the intercom button reaches 30 seconds, for example, the terminal device ends collecting environmental sounds.
在一些实施方式中,用户可以通过操作对讲按钮取消环境声音的采集、编码和发送。例如,在用户在长按对讲按钮录音时,若终端设备检测到用户对对讲按钮的拖曳操作,例如在对讲按钮按下时手指触碰的位置向偏离对讲按钮的方向滑动,则结束采集环境声音,还可以清除已经编码的环境声音。In some embodiments, the user can cancel the collection, encoding, and transmission of ambient sound by operating the intercom button. For example, when the user long presses the intercom button to record, if the terminal device detects the user's drag operation of the intercom button, for example, when the intercom button is pressed, the finger touches the position and slides away from the intercom button, then After collecting environmental sounds, you can also clear the encoded environmental sounds.
示例性的,如图5所示,在获取所述终端设备的环境声音时,终端设备可以在所述平台控制界面显示所述环境声音的声音频谱图。以便用户直观的了解当前的录音状态。Exemplarily, as shown in FIG. 5, when acquiring the environmental sound of the terminal device, the terminal device may display the sound spectrogram of the environmental sound on the platform control interface. So that the user can intuitively understand the current recording status.
具体的,可以在检测到用户按下对讲按钮的时长超过预设的阈值,如0.5秒时,在平台控制界面显示所述环境声音的声音频谱图,在未采集环境声音时可以不显示声音频谱图的区域,从而可以在平台控制界面显示更多信息。Specifically, when it is detected that the duration of the user pressing the intercom button exceeds a preset threshold, such as 0.5 seconds, the sound spectrogram of the environmental sound may be displayed on the platform control interface, and the sound may not be displayed when the environmental sound is not collected. The area of the spectrogram, so that more information can be displayed on the platform control interface.
S220、将所述终端声音文件向所述可移动平台发送,以使所述可移动平台解码所述终端声音文件后播放。S220. Send the terminal sound file to the mobile platform, so that the mobile platform decodes the terminal sound file and plays it.
具体的,如图2所示,用户可以通过终端设备13控制可移动机器人11移动至其他人或动物的面前,则该可移动机器人11在接收和解码终端设备13发送的终端声音文件后可以向可移动机器人11面向的其他人或动物播放,实现向其他人或动物传送声音的功能。Specifically, as shown in FIG. 2, the user can control the mobile robot 11 to move in front of other people or animals through the terminal device 13, and the mobile robot 11 can send the terminal sound file sent by the terminal device 13 after receiving and decoding the terminal sound file. The mobile robot 11 plays to other people or animals, and realizes the function of transmitting sounds to other people or animals.
具体的,如图3所示,可移动机器人11位于可移动机器人12的声音传输范围内,在可移动机器人11在接收和解码终端设备13发送的终端声音文件后可以向可移动机器人12播放,实现向其他可移动机器人传送声音的功能。示例性的,可移动机器人12可以采集和发送可移动机器人11播放的声音,然后将采集到的声音发送给与可移动机器人12通信连接的终端设备14,由终端设备14播放给终端设备14的用户。Specifically, as shown in FIG. 3, the mobile robot 11 is located within the sound transmission range of the mobile robot 12. After the mobile robot 11 receives and decodes the terminal sound file sent by the terminal device 13, the mobile robot 11 can play it to the mobile robot 12. Realize the function of transmitting sound to other mobile robots. Exemplarily, the mobile robot 12 may collect and send the sound played by the mobile robot 11, and then send the collected sound to the terminal device 14 communicatively connected with the mobile robot 12, and the terminal device 14 broadcasts the sound to the terminal device 14 user.
在一些实施方式中,所述控制方法还包括:在所述平台控制界面显示所述环境声音的处理状态,所述处理状态包括无声音录入、录制中、传输中、传输完成中的至少一项。可以提示用户环境声音的处理状态。In some embodiments, the control method further includes: displaying the processing status of the environmental sound on the platform control interface, the processing status including at least one of silent recording, recording, transmission, and transmission completed . It can prompt the user of the processing status of the environment sound.
示例性的,如图5和图7所示,可以在平台控制界面的某一区域显示处理状态的图像或动画。Exemplarily, as shown in FIG. 5 and FIG. 7, an image or animation of the processing state may be displayed in a certain area of the platform control interface.
如图5所示,终端设备在平台控制界面显示环境声音的声音频谱图,提示用户所述环境声音的处理状态为录制中。As shown in Fig. 5, the terminal device displays the sound spectrogram of the environmental sound on the platform control interface, prompting the user that the processing state of the environmental sound is recording.
如图7所示,由上至下依次为未检测到环境声音的处理状态、正在采集环境声音的处理状态、正在将环境声音的终端声音文件向可移动平台发送的处理状态、以及终端声音文件发送完毕的处理状态。As shown in Figure 7, from top to bottom, the processing status of the environmental sound is not detected, the processing status of the environmental sound being collected, the processing status of the terminal sound file of the environmental sound being sent to the mobile platform, and the terminal sound file The processing status of the sending.
在一些实施方式中,如图5所示,所述平台控制界面还包括平台控制按钮。平台控制按钮例如可以包括发射按钮、拍照按钮、录像按钮、行人跟随按钮、自定义技能按钮等。In some embodiments, as shown in FIG. 5, the platform control interface further includes a platform control button. The platform control buttons may include, for example, a launch button, a camera button, a video button, a pedestrian follow button, a custom skill button, and the like.
所述控制方法还可以包括:根据用户对所述平台控制按钮的按钮触发操作,生成并向所述可移动平台发送对应的平台控制指令,以使所述可移动平台根据所述平台控制指令执行预设任务。The control method may further include: generating and sending a corresponding platform control instruction to the movable platform according to a button trigger operation of the platform control button by the user, so that the movable platform executes according to the platform control instruction Preset tasks.
示例性的,可移动机器人上设置有发射装置,该发射装置可用于发射弹丸。若终端设备检测到用户对发射按钮的按钮触发操作,则向可移动平台发送发射指令,可移动机器人根据发射指令可以发射弹丸。Exemplarily, a launching device is provided on the mobile robot, and the launching device can be used to launch projectiles. If the terminal device detects that the user triggers an operation on the launch button, it sends a launch instruction to the movable platform, and the movable robot can launch the projectile according to the launch instruction.
示例性的,用户可以通过操作拍照按钮、录像按钮,使终端设备向可移动平台搭载按钮、录像指令,以使可移动平台通过搭载的摄像装置实现拍照、录像等任务。Exemplarily, the user can operate the camera button and the video button to enable the terminal device to load the mobile platform with buttons and video instructions, so that the mobile platform can perform tasks such as photography and video recording through the mounted camera device.
示例性的,当终端设备检测到用户对行人跟随按钮的按钮触发操作,则向可移动平台发送行人跟随指令,可移动平台根据行人跟随指令对拍摄图像中的行人目标进行跟随拍摄,并 将拍摄的图像发送给终端设备。Exemplarily, when the terminal device detects that the user triggers the operation of the pedestrian follow button, it sends a pedestrian follow instruction to the movable platform, and the movable platform follows the pedestrian target in the captured image according to the pedestrian follow instruction, and then shoots The image is sent to the terminal device.
示例性的,用户可以定义自定义技能按钮对应的可移动平台的任务,例如漂移。当终端设备检测到用户对该自定义技能按钮的按钮触发操作,则向可移动平台发送漂移指令,可移动平台根据漂移指令实现漂移任务。Exemplarily, the user can define tasks of the movable platform corresponding to the custom skill button, such as drift. When the terminal device detects that the user triggers an operation on the button of the custom skill button, it sends a drift instruction to the movable platform, and the movable platform implements the drift task according to the drift instruction.
示例性的,自定义技能按钮还可以定义为眩晕技能,对应的可移动平台的任务包括向某个可移动平台释放该技能并击中该可移动平台,可以控制被击中的可移动平台在原地旋转,并持续时间1.5秒。Exemplarily, the custom skill button can also be defined as a stun skill. The tasks of the corresponding movable platform include releasing the skill to a certain movable platform and hitting the movable platform, and can control the hitting movable platform in the original The ground rotates and lasts for 1.5 seconds.
示例性的,自定义技能按钮还可以定义为致盲技能,对应的可移动平台的任务包括在预设的时间阈值内,将被击中的可移动平台对应的遥控终端的显示界面调整为与致盲技能对应的动画效果。动画效果比如花屏、黑屏或雪花屏遮挡图传画面,使用户无法正常观看图传画面。在实际应用中,该预设的时间阈值具体可以为1.5秒。Exemplarily, the custom skill button can also be defined as a blinding skill, the task of the corresponding movable platform is included within a preset time threshold, and the display interface of the remote control terminal corresponding to the movable platform being hit is adjusted to The animation effect corresponding to the blinding skill. Animation effects, such as blurred, black or snowflake screens, block the image transmission screen, making it impossible for users to view the image transmission screen normally. In practical applications, the preset time threshold may specifically be 1.5 seconds.
示例性的,自定义技能按钮还可以定义为电磁干扰技能,该技能可通过红外发射器发射,被击中的可移动平台的图传传输受到干扰2.5秒,还可以表现为FPV界面显示为花屏效果。Exemplarily, the custom skill button can also be defined as an electromagnetic interference skill, which can be launched by an infrared transmitter, and the image transmission of the movable platform that is hit is interfered for 2.5 seconds, and it can also be expressed as the FPV interface displayed as a flower screen effect.
示例性的,自定义技能按钮还可以定义为极速技能,对应的可移动平台的任务包括可获得更快的移动速度,并持续3秒。Exemplarily, the custom skill button can also be defined as a speed skill, and the tasks of the corresponding movable platform include obtaining a faster moving speed and lasting for 3 seconds.
示例性的,自定义技能按钮还可以定义为无敌技能,对应的可移动平台的任务包括可自动解除对手释放的技能效果,且获得3秒的护盾,使得对方无法对其造成伤害。Exemplarily, the custom skill button can also be defined as an invincible skill. The tasks of the corresponding movable platform include automatically canceling the skill effect released by the opponent, and obtaining a shield for 3 seconds so that the opponent cannot cause damage to it.
示例性的,平台控制按钮可以包括调焦按钮,如图5中界面右下方的按钮,表示当前可移动平台的摄像装置的焦距为4倍。用户可以通过对调焦按钮的操作,切换可移动平台的摄像装置的焦距例如为1倍、2倍等。Exemplarily, the platform control button may include a focusing button, such as the button at the bottom right of the interface in FIG. 5, which indicates that the focal length of the camera device of the current movable platform is 4 times. The user can switch the focal length of the camera device of the movable platform to, for example, 1x, 2x, etc. by operating the focus button.
示例性的,如图5所示,平台控制按钮可以包括声音回传按钮,当终端设备检测到用户对该声音回传按钮的按钮触发操作,则向可移动平台发送声音回传指令,可移动平台根据声音回传指令获取可移动平台周边的声音并将采集的声音回传给终端设备,终端设备例如可以播放可移动平台周边的声音。Exemplarily, as shown in FIG. 5, the platform control button may include a sound return button. When the terminal device detects that the user triggers an operation on the sound return button, it sends a sound return instruction to the movable platform, which can move The platform obtains the sound around the movable platform according to the sound return instruction and transmits the collected sound back to the terminal device. The terminal device can, for example, play the sound around the movable platform.
在一些实施方式中,所述控制方法还可以包括:获取用户的控制语音,根据所述控制语音生成并向所述可移动平台发送对应的平台控制指令,以使所述可移动平台根据所述平台控制指令执行预设任务。In some embodiments, the control method may further include: acquiring a user's control voice, generating and sending a corresponding platform control instruction to the movable platform according to the control voice, so that the movable platform can be controlled according to the The platform control commands execute preset tasks.
示例性的,终端设备存储有控制语音和平台控制指令的映射关系数据,从而用户可以通过语音控制终端设备向可移动平台发送对应的平台控制指令,以使可移动平台执行预设任务。例如,若终端设备检测到用户的“发射”控制指令,则向可移动平台发送发射指令,可移动 平台根据发射指令可以发射弹丸。Exemplarily, the terminal device stores the mapping relationship data between the control voice and the platform control instruction, so that the user can send the corresponding platform control instruction to the movable platform through the voice control terminal device, so that the movable platform can perform the preset task. For example, if the terminal device detects the user's "launch" control instruction, it sends a launch instruction to the movable platform, and the movable platform can launch projectiles according to the launch instruction.
示例性的,所述获取用户的控制语音,可以包括:获取所述终端设备的环境声音,在所述环境声音中检测所述控制语音。Exemplarily, the obtaining the control voice of the user may include: obtaining the environmental sound of the terminal device, and detecting the control voice in the environmental sound.
具体的,终端设备持续监听所处环境的声音,并检测环境声音中是否存在控制语音。用户可以更快速的通过语音输入控制指令。Specifically, the terminal device continuously monitors the sound of the environment in which it is located, and detects whether there is a control voice in the environment sound. Users can quickly input control commands by voice.
示例性的,所述获取用户的控制语音,可以包括:在用户触发语音控制功能时,获取用户发出的控制语音。Exemplarily, the obtaining the control voice of the user may include: obtaining the control voice uttered by the user when the user triggers the voice control function.
示例性的,终端设备的显示界面还显示语音控制按键,用户可以在按下语音控制按键时发出控制语音。可以防止在用户不需要语音控制时错误检测控制语音。Exemplarily, the display interface of the terminal device also displays a voice control button, and the user can make a control voice when pressing the voice control button. It can prevent the wrong detection of the control voice when the user does not need the voice control.
S230、获取所述可移动平台发送的平台声音文件,所述平台声音文件由所述可移动平台采集所述可移动平台的环境声音后生成。S230. Obtain a platform sound file sent by the movable platform, where the platform sound file is generated by the movable platform after collecting the environmental sound of the movable platform.
S240、解码所述平台声音文件得到平台音频信息,播放所述平台音频信息。S240. Decode the platform sound file to obtain platform audio information, and play the platform audio information.
在一些实施方式中,可移动平台可以自主或者根据终端设备的控制采集所述可移动平台的环境声音,生成平台声音文件后向终端设备发送,以由终端设备播放可移动平台的环境声音,便于用户了解可移动平台所处的环境。例如,可使得该终端设备的用户即使不在可移动机器人的对战现场,也可以感受到该对战现场的氛围;还可以使得该可移动机器人的终端设备的用户根据该音频数据准确地控制该可移动机器人。In some embodiments, the mobile platform can collect the environmental sound of the mobile platform independently or according to the control of the terminal device, generate the platform sound file and send it to the terminal device, so that the terminal device can play the environmental sound of the mobile platform, which is convenient for The user understands the environment in which the mobile platform is located. For example, the user of the terminal device can feel the atmosphere of the battle scene even if the user of the terminal device is not on the battle scene of the mobile robot; it can also make the user of the terminal device of the mobile robot accurately control the mobile robot according to the audio data. robot.
示例性的,如图5所示,终端设备上设有声音回传按钮,当终端设备检测到用户对该声音回传按钮的按钮触发操作,则向可移动平台发送声音回传指令,可移动平台根据声音回传指令采集所述可移动平台的环境声音,生成平台声音文件后向终端设备发送。从而用户可以通过终端设备控制收听或不收听可移动平台的环境声音。Exemplarily, as shown in Figure 5, the terminal device is provided with a sound return button. When the terminal device detects that the user triggers an operation on the sound return button, it sends a sound return instruction to the movable platform, which can move The platform collects the environmental sound of the movable platform according to the sound return instruction, generates the platform sound file and sends it to the terminal device. Therefore, the user can control to listen to or not listen to the ambient sound of the mobile platform through the terminal device.
在一些实施方式中,所述平台声音文件是所述可移动平台根据所述可移动平台附近的人的声音生成的。In some embodiments, the platform sound file is generated by the movable platform according to the voice of a person near the movable platform.
示例性的,如图2所示,用户可以通过终端设备13控制可移动机器人11移动至其他人或动物的面前,则该可移动机器人11可以采集可移动机器人11面向的其他人或动物发出的声音,生成平台声音文件后可以发送给终端设备13,由终端设备播放给可移动机器人11面向的其他人或动物。Exemplarily, as shown in FIG. 2, the user can control the mobile robot 11 to move in front of other people or animals through the terminal device 13, then the mobile robot 11 can collect the messages sent by the other people or animals that the mobile robot 11 faces. The sound can be sent to the terminal device 13 after the platform sound file is generated, and the terminal device will be played to other people or animals that the mobile robot 11 faces.
在一些实施方式中,所述平台声音文件是所述可移动平台根据来源于至少另一可移动平台的声音生成的。In some embodiments, the platform sound file is generated by the movable platform according to a sound originating from at least another movable platform.
示例性的,另一可移动平台可以自主或者根据对应终端设备的控制发出声音例如咆哮的 声音,则所述可移动平台可以根据所述另一可移动平台发出的声音生成平台声音文件。Exemplarily, another movable platform can make a sound such as a roaring sound autonomously or according to the control of a corresponding terminal device, and the movable platform can generate a platform sound file according to the sound made by the another movable platform.
在一些实施方式中,所述平台声音文件是所述可移动平台根据来源于至少另一可移动平台的终端设备的用户的声音生成的。In some embodiments, the platform sound file is generated by the mobile platform according to a user's voice from a terminal device of at least another mobile platform.
具体的,如图3所示,可移动机器人11位于可移动机器人12的声音传输范围内,在可移动机器人11在接收和解码终端设备13发送的终端声音文件后可以向可移动机器人12播放;可移动机器人12可以采集和发送可移动机器人11播放的声音,然后将采集到的声音发送给与可移动机器人12通信连接的终端设备14,由终端设备14播放给终端设备14的用户。Specifically, as shown in FIG. 3, the mobile robot 11 is located within the sound transmission range of the mobile robot 12, and the mobile robot 11 can play to the mobile robot 12 after receiving and decoding the terminal sound file sent by the terminal device 13; The mobile robot 12 can collect and send the sound played by the mobile robot 11, and then send the collected sound to the terminal device 14 communicatively connected with the mobile robot 12, and the terminal device 14 will play the sound to the user of the terminal device 14.
在一些实施方式中,所述控制方法还可以包括:终端设备根据用户的对象设置操作,确定所述终端声音文件的播放对象。In some embodiments, the control method may further include: the terminal device determines the playback target of the terminal sound file according to the user's object setting operation.
示例性的,所述将所述终端声音文件向所述可移动平台发送,以使所述可移动平台解码所述终端声音文件后播放,包括:将所述终端声音文件和所述播放对象的信息向所述可移动平台发送,以使所述可移动平台在识别到所述播放对象时播放所述终端声音文件。从而用户可以控制可移动平台对指定的对象播放声音。Exemplarily, the sending the terminal sound file to the mobile platform so that the mobile platform decodes the terminal sound file and then plays it includes: sending the terminal sound file and the playback target The information is sent to the movable platform, so that the movable platform plays the terminal sound file when the playback object is recognized. In this way, the user can control the movable platform to play sound to the specified object.
示例性的,用户可以在终端设备指定终端声音文件的播放对象,播放对象例如包括设有指定标记,如二维码图案或战队图案的其他可移动平台、具有指定脸部特征的人或动物等。Exemplarily, the user can specify the playback object of the terminal sound file on the terminal device. The playback object includes, for example, other movable platforms with designated marks, such as a QR code pattern or a team pattern, people or animals with designated facial features, etc. .
例如,在步骤S220将所述终端声音文件向所述可移动平台发送时,将指定标记等也发送给可移动平台。可移动平台可以检测拍摄的图像中是否存在播放对象,如果存在播放对象则向播放对象播放对应的终端声音文件。For example, when the terminal sound file is sent to the movable platform in step S220, the designated mark and the like are also sent to the movable platform. The movable platform can detect whether there is a playback object in the captured image, and if there is a playback object, it will play the corresponding terminal sound file to the playback object.
示例性的,终端设备可以从所述可移动平台获取所述可移动平台拍摄的图像,显示所述图像,然后根据用户对所述图像中播放对象的选中操作,确定所述播放对象。Exemplarily, the terminal device may obtain an image taken by the movable platform from the movable platform, display the image, and then determine the playback object according to a user's selection operation of the playback object in the image.
例如,可移动平台将拍摄的图像发送给终端设备进行显示,用户可以在图像中进行选中操作,例如通过点击或者框选选中图像中的一定区域,则终端设备和可移动平台可以确定播放对象的一些信息,可移动平台可以根据选中的区域检测摄像装置视野内是否存在播放对象。For example, the mobile platform sends the captured image to the terminal device for display, and the user can perform a selection operation in the image. For example, by clicking or box selecting a certain area in the image, the terminal device and the mobile platform can determine the playback object With some information, the movable platform can detect whether there is a playback object in the field of view of the camera device according to the selected area.
示例性的,终端设备可以根据用户对所述终端设备本地图像的选中操作,显示用户选中的本地图像,然后根据用户对所述本地图像中播放对象的确定操作,确定所述播放对象。Exemplarily, the terminal device may display the local image selected by the user according to the user's selection operation on the local image of the terminal device, and then determine the playback object according to the user's determining operation on the playback object in the local image.
例如,终端设备本地存储有自己战队的战队图案或者包含战队图案的图像,则可以选中该战队图案使得终端设备确定所述播放对象需要具备的图案;将该战队图案发送给可移动平台,则可移动平台可以根据战队图案检测摄像装置视野内是否存在播放对象。For example, if the terminal device locally stores the team pattern of its own team or an image containing the team pattern, the team pattern can be selected so that the terminal device determines the pattern that the playback object needs to have; and the team pattern is sent to the mobile platform. The mobile platform can detect whether there is a playback object in the field of view of the camera device based on the team pattern.
在一些实施方式中,所述控制方法还可以包括:显示录音记录列表,根据用户对所述录音记录列表中录音记录的播放控制操作,获取所述录音记录对应的终端声音文件。然后可以 将所述终端声音文件向所述可移动平台发送,以使所述可移动平台解码所述终端声音文件后播放。从而用户可以更快捷的向可移动平台发送终端声音文件。In some embodiments, the control method may further include: displaying a recording record list, and acquiring a terminal sound file corresponding to the recording record according to a user's playback control operation of the recording record in the recording record list. Then, the terminal sound file can be sent to the mobile platform, so that the mobile platform decodes the terminal sound file and plays it. In this way, users can send terminal sound files to the mobile platform more quickly.
如图8所示,终端设备可以在平台控制界面上显示录音记录列表。例如用户可以通过点击平台控制界面上的相应按钮控制终端设备显示录音记录列表。或者终端设备也可以在其他界面上显示录音记录列表,本实施方式对此不做限定。示例性的,录音记录列表右上角显示对应的关闭按钮,用户点击该关闭按钮时,录音记录列表关闭。As shown in Figure 8, the terminal device can display a list of recording records on the platform control interface. For example, the user can control the terminal device to display the recording record list by clicking the corresponding button on the platform control interface. Or the terminal device may also display the recording record list on other interfaces, which is not limited in this embodiment. Exemplarily, the corresponding close button is displayed in the upper right corner of the recording record list. When the user clicks the close button, the recording record list is closed.
具体的,录音记录列表中可以包括一条或多条录音记录,各录音记录对应相应的终端声音文件。例如,终端设备存储有录音记录列表中各录音记录对应相应的终端声音文件。Specifically, the recording record list may include one or more recording records, and each recording record corresponds to a corresponding terminal sound file. For example, the terminal device stores a corresponding terminal sound file corresponding to each recording record in the recording record list.
示例性的,如果录音记录列表中的录音记录较多,则可以显示其中几条,用户可以通过上下滚动查看或者选中其余的录音记录。Exemplarily, if there are many recording records in the recording record list, several of them can be displayed, and the user can scroll up and down to view or select the remaining recording records.
如图8所示,在各录音记录的相应位置可以显示录音时长,如10”等,还可以在录音时长左侧显示播放按钮。如果终端设备检测到用户对某一条录音记录对应的播放按钮的播放控制操作,则获取所述录音记录对应的终端声音文件。As shown in Figure 8, the recording duration can be displayed at the corresponding position of each recording record, such as 10", etc., and the playback button can also be displayed on the left side of the recording duration. If the terminal device detects that the user has selected the playback button corresponding to a certain recording record The playback control operation is to obtain the terminal sound file corresponding to the recording record.
示例性的,如图9所示,录音记录列表中各录音记录的图标的长度可以根据录音时长进行调整,例如录音时长越长的录音记录,其图标也越长。Exemplarily, as shown in FIG. 9, the length of the icon of each recording record in the recording record list can be adjusted according to the recording duration, for example, the longer the recording duration, the longer the icon.
示例性的,还可以在所述录音记录列表显示所述录音记录的播放状态和/或播放进度。如图9所示,可以以进度条的方式显示第一条录音记录的播放状态和/或播放进度。Exemplarily, the playback status and/or playback progress of the recording record may also be displayed in the recording record list. As shown in Figure 9, the playback status and/or playback progress of the first recording can be displayed in the form of a progress bar.
示例性的,用户可以在录音记录列表增加新的录音记录。具体的,终端设备可以根据用户的新增录音操作,获取和处理新录入的声音以得到新的终端声音文件,并在所述录音记录列表更新对应的录音记录。Exemplarily, the user can add a new recording record to the recording record list. Specifically, the terminal device may obtain and process the newly recorded sound according to the user's newly added recording operation to obtain a new terminal sound file, and update the corresponding recording record in the recording record list.
例如,如图8所示,用户可以按下“按住录音”按钮,触发终端设备获取和处理新录入的声音以得到新的终端声音文件,并在所述录音记录列表更新对应的录音记录。For example, as shown in FIG. 8, the user can press the "press and hold recording" button to trigger the terminal device to acquire and process the newly recorded sound to obtain a new terminal sound file, and update the corresponding recording record in the recording record list.
示例性的,所述控制方法还包括:根据用户对所述录音记录列表中录音记录的循环播放操作,获取所述录音记录对应的终端声音文件。Exemplarily, the control method further includes: acquiring a terminal sound file corresponding to the recording record according to a user's loop playback operation of the recording record in the recording record list.
如图8和图9所示,在各录音记录的右侧可以显示循环按钮。如果终端设备检测到用户对某一条录音记录对应的循环按钮的播放控制操作,则获取所述录音记录对应的终端声音文件。然后可以将所述终端声音文件和循环指令向所述可移动平台发送,以使所述可移动平台解码所述终端声音文件后循环播放。从而用户可以通过循环播放操作,控制可移动平台重复播放指定的录音。As shown in Figure 8 and Figure 9, the loop button can be displayed on the right side of each recording. If the terminal device detects the user's playback control operation on the loop button corresponding to a certain recording record, it acquires the terminal sound file corresponding to the recording record. Then, the terminal sound file and the loop instruction may be sent to the movable platform, so that the movable platform decodes the terminal sound file and plays it in a loop. Therefore, the user can control the movable platform to repeatedly play the specified recording through the loop playback operation.
在另一些实施方式中,所述控制方法还可以包括:显示录音记录列表;根据用户对所述 录音记录列表中录音记录的播放控制操作,向所述可移动平台发送所述录音记录的信息,以使所述可移动平台播放所述录音记录对应的终端音频信息。In some other implementation manners, the control method may further include: displaying a list of recording records; sending the information of the recording records to the mobile platform according to a user's playback control operation on the recording records in the recording record list, So that the mobile platform can play the terminal audio information corresponding to the recording record.
示例性的,可移动平台可以预先存储录音记录列表中各录音记录对应的终端声音文件,或者终端声音文件解码得到的终端音频信息。从而可移动平台可以根据用户在终端设备选中的录音记录的信息确定要播放的终端声音文件或者终端音频信息。Exemplarily, the mobile platform may pre-store a terminal sound file corresponding to each recording record in the recording record list, or terminal audio information obtained by decoding the terminal sound file. Therefore, the mobile platform can determine the terminal sound file or terminal audio information to be played according to the information recorded in the recording selected by the user on the terminal device.
示例性的,所述控制方法还可以包括:根据用户对所述录音记录列表中录音记录的循环播放操作,向所述可移动平台发送所述录音记录的信息和循环指令,以使所述可移动平台循环播放所述录音记录对应的终端音频信息。从而用户可以通过循环播放操作,控制可移动平台重复播放指定的录音。Exemplarily, the control method may further include: according to a user's loop playback operation of the recording records in the recording record list, sending the recording record information and loop instructions to the movable platform, so that the The mobile platform cyclically plays the terminal audio information corresponding to the recording record. Therefore, the user can control the movable platform to repeatedly play the specified recording through the loop playback operation.
示例性的,所述控制方法还可以包括:根据用户的新增录音操作,获取和处理新录入的声音以得到新的终端声音文件,并在所述录音记录列表更新对应的录音记录;将所述终端声音文件发送给所述可移动平台,以使所述可移动平台存储解码所述终端声音文件得到的终端音频信息。Exemplarily, the control method may further include: acquiring and processing the newly recorded sound to obtain a new terminal sound file according to the new recording operation of the user, and updating the corresponding recording record in the recording record list; The terminal sound file is sent to the mobile platform, so that the mobile platform stores terminal audio information obtained by decoding the terminal sound file.
例如,终端设备可以在得到所述新的终端声音文件后即时将所述终端声音文件发送给所述可移动平台。For example, the terminal device may immediately send the terminal sound file to the mobile platform after obtaining the new terminal sound file.
例如,终端设备可以根据用户对所述新的终端声音文件对应的录音记录的播放控制操作,将所述终端声音文件发送给所述可移动平台。例如,在用户第一次对新的录音记录执行播放控制操作或者循环播放操作时,终端设备将所述新的录音记录对应的终端声音文件发送给所述可移动平台。For example, the terminal device may send the terminal sound file to the mobile platform according to the user's playback control operation of the recording record corresponding to the new terminal sound file. For example, when the user performs a playback control operation or a loop playback operation on a new recording record for the first time, the terminal device sends the terminal sound file corresponding to the new recording record to the mobile platform.
从而用户可以编辑录音记录列表中的录音记录,还可以将新增的录音记录对应的终端声音文件发送给可移动平台以使可移动平台存储新增录音记录对应的终端声音文件或者终端音频信息。Therefore, the user can edit the recording records in the recording record list, and can also send the terminal sound file corresponding to the newly added recording record to the mobile platform so that the mobile platform can store the terminal sound file or terminal audio information corresponding to the newly added recording record.
示例性的,还可以在所述录音记录列表显示所述录音记录的播放状态和/或播放进度。如图9所示,可以以进度条的方式显示第一条录音记录的播放状态和/或播放进度。Exemplarily, the playback status and/or playback progress of the recording record may also be displayed in the recording record list. As shown in Figure 9, the playback status and/or playback progress of the first recording can be displayed in the form of a progress bar.
请结合前述实施例参阅图10,如图10所示为本说明书一实施例提供的一种控制方法的流程示意图。该控制方法可以应用于可移动平台,所述可移动平台用于与一终端设备进行通信,所述终端设备和所述可移动平台均设有音频传感器和扬声器。Please refer to FIG. 10 in conjunction with the foregoing embodiment. FIG. 10 is a schematic flowchart of a control method provided by an embodiment of this specification. The control method can be applied to a movable platform, the movable platform is used to communicate with a terminal device, and both the terminal device and the movable platform are provided with audio sensors and speakers.
如图10所示,本说明书实施例的控制方法包括步骤S310至步骤S330。As shown in FIG. 10, the control method of the embodiment of this specification includes step S310 to step S330.
S310、获取所述终端设备发送的终端声音文件,并解码所述终端声音文件后生成终端音频信息,播放所述终端音频信息。S310. Obtain a terminal sound file sent by the terminal device, and decode the terminal sound file to generate terminal audio information, and play the terminal audio information.
示例性的,所述终端声音文件由所述终端设备采集所述终端设备的环境声音后生成。例如,终端设备通过音频传感器采集所处环境中的声音,将采集的声音数据编码得到终端声音文件,然后将终端声音文件发送给可移动平台。Exemplarily, the terminal sound file is generated by the terminal device after collecting the environmental sound of the terminal device. For example, the terminal device collects the sound in the environment through the audio sensor, encodes the collected sound data to obtain the terminal sound file, and then sends the terminal sound file to the mobile platform.
示例性的,终端设备可以显示录音记录列表,然后根据用户对所述录音记录列表中录音记录的播放控制操作,获取所述录音记录对应的终端声音文件,以将终端声音文件发送给可移动平台。Exemplarily, the terminal device may display a list of recording records, and then obtain the terminal sound file corresponding to the recording record according to the user's playback control operation on the recording record in the recording record list, so as to send the terminal sound file to the mobile platform .
在一些实施方式中,所述控制方法还包括:若获取所述终端设备发送的录音记录的信息,确定所述录音记录对应的终端音频信息,并播放所述录音记录对应的终端音频信息。In some embodiments, the control method further includes: if acquiring the information of the recording record sent by the terminal device, determining the terminal audio information corresponding to the recording record, and playing the terminal audio information corresponding to the recording record.
示例性的,可移动平台可以预先存储终端设备发送的终端声音文件或者解码所述终端声音文件生成的终端音频信息。在接收到终端设备根据用户对所述录音记录列表中录音记录的控制操作发送的播放指令或者循环指令之后,播放所述录音记录对应的终端音频信息。Exemplarily, the mobile platform may pre-store the terminal sound file sent by the terminal device or the terminal audio information generated by decoding the terminal sound file. After receiving the playback instruction or the loop instruction sent by the terminal device according to the user's control operation of the recording record in the recording record list, the terminal audio information corresponding to the recording record is played.
在一些实施方式中,所述可移动平台搭载有摄像装置,所述控制方法还包括:若获取所述终端设备发送的播放对象的信息,根据所述信息在所述摄像装置拍摄的图像中识别所述播放对象;若识别到所述播放对象,向所述播放对象播放对应的终端音频信息。In some embodiments, the movable platform is equipped with a camera device, and the control method further includes: if the information of the playback object sent by the terminal device is acquired, identifying in the image taken by the camera device according to the information The play object; if the play object is recognized, the corresponding terminal audio information is played to the play object.
示例性的,可移动平台将所述摄像装置拍摄的图像发送给所述终端设备,以使所述终端设备根据用户对所述图像中播放对象的选中操作确定所述播放对象。Exemplarily, the movable platform sends the image taken by the camera device to the terminal device, so that the terminal device determines the playback object according to the user's selection operation of the playback object in the image.
例如,可移动平台将拍摄的图像发送给终端设备进行显示,用户可以在图像中进行选中操作,例如通过点击或者框选选中图像中的一定区域,则终端设备和可移动平台可以确定播放对象的一些信息,可移动平台可以根据选中的区域检测摄像装置视野内是否存在播放对象。For example, the mobile platform sends the captured image to the terminal device for display, and the user can perform a selection operation in the image. For example, by clicking or box selecting a certain area in the image, the terminal device and the mobile platform can determine the playback object With some information, the movable platform can detect whether there is a playback object in the field of view of the camera device according to the selected area.
示例性的,步骤S310中的所述播放所述终端音频信息,包括:若获取所述终端音频信息对应的循环指令,循环播放所述终端音频信息。Exemplarily, the playing the terminal audio information in step S310 includes: if a loop instruction corresponding to the terminal audio information is obtained, playing the terminal audio information in a loop.
例如,如果终端设备检测到用户对某一条录音记录对应的循环按钮的播放控制操作,则获取所述录音记录对应的终端声音文件。然后可以将所述终端声音文件和循环指令向所述可移动平台发送,以使所述可移动平台解码所述终端声音文件后循环播放。从而用户可以通过循环播放操作,控制可移动平台重复播放指定的录音。For example, if the terminal device detects that the user performs a playback control operation on a loop button corresponding to a certain recording record, it acquires the terminal sound file corresponding to the recording record. Then, the terminal sound file and the loop instruction may be sent to the movable platform, so that the movable platform decodes the terminal sound file and plays it in a loop. Therefore, the user can control the movable platform to repeatedly play the specified recording through the loop playback operation.
例如,终端设备根据用户对所述录音记录列表中录音记录的循环播放操作,向所述可移动平台发送所述录音记录的信息和循环指令,以使所述可移动平台循环播放所述录音记录对应的终端音频信息。For example, the terminal device sends the information of the recording record and the loop instruction to the mobile platform according to the user's circular playback operation of the recording record in the recording record list, so that the mobile platform can play the recording record in a loop Corresponding terminal audio information.
在一些实施方式中,所述控制方法还包括:向所述终端设备发送所述终端音频信息的播放状态和/或播放进度。In some implementation manners, the control method further includes: sending the playback status and/or playback progress of the terminal audio information to the terminal device.
示例性的,终端设备可以在界面显示终端音频信息的播放状态和/或播放进度,例如在录音记录列表显示相应录音记录的播放状态和/或播放进度。Exemplarily, the terminal device may display the playback status and/or playback progress of the terminal audio information on the interface, for example, display the playback status and/or playback progress of the corresponding recording record in the recording record list.
在一些实施方式中,所述控制方法还包括:在播放所述终端音频信息时,根据所述终端音频信息调整所述可移动平台的显示装置的显示参数。In some embodiments, the control method further includes: when playing the terminal audio information, adjusting the display parameters of the display device of the movable platform according to the terminal audio information.
示例性,可移动平台,例如可移动机器人包括显示装置,显示装置例如可以包括环形的灯条。For example, the movable platform, such as the movable robot, includes a display device, and the display device may include, for example, a ring-shaped light bar.
示例性的,可移动平台可以根据所述终端音频信息的声强调整所述显示装置的显示亮度;和/或根据所述终端音频信息的声音频率调整所述显示装置的闪烁频率。例如,终端音频信息在某一时刻的声强变强时,显示装置的显示亮度调亮;终端音频信息在某一时间段的声音频率越高,则显示装置的闪烁频率也越高,可以通过显示装置,如灯条的灯效直观的提示用户可移动平台在播放终端音频信息。Exemplarily, the mobile platform may adjust the display brightness of the display device according to the sound intensity of the terminal audio information; and/or adjust the flicker frequency of the display device according to the sound frequency of the terminal audio information. For example, when the sound intensity of the terminal audio information becomes stronger at a certain moment, the display brightness of the display device is brightened; the higher the sound frequency of the terminal audio information in a certain period of time, the higher the flicker frequency of the display device. A display device, such as a light bar, intuitively reminds the user that the mobile platform is playing terminal audio information.
S320、根据所述可移动平台的环境声音,生成平台声音文件。S320: Generate a platform sound file according to the environmental sound of the movable platform.
S330、将所述平台声音文件向所述终端设备发送,以使所述终端设备解码所述平台声音文件后播放。S330. Send the platform sound file to the terminal device, so that the terminal device decodes the platform sound file and plays it.
在一些实施方式中,可移动平台可以自主或者根据终端设备的控制采集所述可移动平台的环境声音,生成平台声音文件后向终端设备发送,以由终端设备播放可移动平台的环境声音,便于用户了解可移动平台所处的环境。例如,可使得该终端设备的用户即使不在可移动机器人的对战现场,也可以感受到该对战现场的氛围;还可以使得该可移动机器人的终端设备的用户根据该音频数据准确地控制该可移动机器人。In some embodiments, the mobile platform can collect the environmental sound of the mobile platform independently or according to the control of the terminal device, generate the platform sound file and send it to the terminal device, so that the terminal device can play the environmental sound of the mobile platform, which is convenient for The user understands the environment in which the mobile platform is located. For example, the user of the terminal device can feel the atmosphere of the battle scene even if the user of the terminal device is not on the battle scene of the mobile robot; it can also make the user of the terminal device of the mobile robot accurately control the mobile robot according to the audio data. robot.
示例性的,如图5所示,终端设备上设有声音回传按钮,当终端设备检测到用户对该声音回传按钮的按钮触发操作,则向可移动平台发送声音回传指令,可移动平台根据声音回传指令采集所述可移动平台的环境声音,生成平台声音文件后向终端设备发送。从而用户可以通过终端设备控制收听或不收听可移动平台的环境声音。Exemplarily, as shown in Figure 5, the terminal device is provided with a sound return button. When the terminal device detects that the user triggers an operation on the sound return button, it sends a sound return instruction to the movable platform, which can move The platform collects the environmental sound of the movable platform according to the sound return instruction, generates the platform sound file and sends it to the terminal device. Therefore, the user can control to listen to or not listen to the ambient sound of the mobile platform through the terminal device.
在一些实施方式中,可移动平台在未播放所述终端音频信息时,获取所述可移动平台的环境声音。例如,可移动平台在播放所述终端音频信息时,可以先不获取所述可移动平台的环境声音,避免将播放的终端音频信息再回传到终端设备进行播放造成干扰。In some embodiments, the movable platform acquires the environmental sound of the movable platform when the terminal audio information is not played. For example, when the mobile platform is playing the terminal audio information, it may not first acquire the environmental sound of the mobile platform, so as to avoid the interference of transmitting the played terminal audio information back to the terminal device for playback.
在一些实施方式中,可移动平台获取所述可移动平台的环境声音,并将播放的所述终端音频信息从所述环境声音中滤除,避免将播放的终端音频信息再回传到终端设备进行播放造成干扰。In some embodiments, the mobile platform obtains the environmental sound of the mobile platform, and filters the played terminal audio information from the environmental sound, so as to avoid transmitting the played terminal audio information back to the terminal device. Interference caused by playback.
在一些实施方式中,所述平台声音文件是所述可移动平台根据所述可移动平台附近的人 的声音生成的。In some embodiments, the platform sound file is generated by the movable platform according to the voice of a person near the movable platform.
示例性的,如图2所示,用户可以通过终端设备13控制可移动机器人11移动至其他人或动物的面前,则该可移动机器人11可以采集可移动机器人11面向的其他人或动物发出的声音,生成平台声音文件后可以发送给终端设备13,由终端设备播放给可移动机器人11面向的其他人或动物。Exemplarily, as shown in FIG. 2, the user can control the mobile robot 11 to move in front of other people or animals through the terminal device 13, then the mobile robot 11 can collect the messages sent by the other people or animals that the mobile robot 11 faces. The sound can be sent to the terminal device 13 after the platform sound file is generated, and the terminal device will be played to other people or animals that the mobile robot 11 faces.
在一些实施方式中,可移动平台可以根据至少另一可移动平台的声音和/或至少另一可移动平台的终端设备的用户的声音,生成平台声音文件。In some embodiments, the movable platform may generate a platform sound file based on the sound of at least another movable platform and/or the sound of a user of at least another terminal device of the movable platform.
示例性的,另一可移动平台可以自主或者根据对应终端设备的控制发出声音例如咆哮的声音,则所述可移动平台可以根据所述另一可移动平台发出的声音生成平台声音文件。Exemplarily, another movable platform can make a sound such as a roaring sound autonomously or according to the control of a corresponding terminal device, and the movable platform can generate a platform sound file according to the sound made by the another movable platform.
示例性的,如图3所示,可移动机器人11位于可移动机器人12的声音传输范围内,在可移动机器人11在接收和解码终端设备13发送的终端声音文件后可以向可移动机器人12播放;可移动机器人12可以采集和发送可移动机器人11播放的声音,然后将采集到的声音发送给与可移动机器人12通信连接的终端设备14,由终端设备14播放给终端设备14的用户。Exemplarily, as shown in FIG. 3, the mobile robot 11 is located within the sound transmission range of the mobile robot 12, and the mobile robot 11 can play to the mobile robot 12 after receiving and decoding the terminal sound file sent by the terminal device 13. The mobile robot 12 can collect and send the sound played by the mobile robot 11, and then send the collected sound to the terminal device 14 communicatively connected with the mobile robot 12, which is played by the terminal device 14 to the user of the terminal device 14.
在一些实施方式中,所述控制方法还包括:若获取所述终端设备发送的平台控制指令,根据所述平台控制指令执行预设任务;其中,所述平台控制指令是所述终端设备根据用户对所述终端设备的平台控制按钮的按钮触发操作发送的,或者是所述终端设备根据用户的控制语音发送的。In some embodiments, the control method further includes: if the platform control instruction sent by the terminal device is acquired, execute a preset task according to the platform control instruction; wherein, the platform control instruction is the terminal device according to the user It is sent by the button trigger operation of the platform control button of the terminal device, or sent by the terminal device according to the control voice of the user.
示例性的,可移动机器人上设置有发射装置,该发射装置可用于发射弹丸。若终端设备检测到用户对发射按钮的按钮触发操作,则向可移动平台发送发射指令,可移动机器人根据发射指令可以发射弹丸。Exemplarily, a launching device is provided on the mobile robot, and the launching device can be used to launch projectiles. If the terminal device detects that the user triggers an operation on the launch button, it sends a launch instruction to the movable platform, and the movable robot can launch the projectile according to the launch instruction.
示例性的,终端设备存储有控制语音和平台控制指令的映射关系数据,从而用户可以通过语音控制终端设备向可移动平台发送对应的平台控制指令,以使可移动平台执行预设任务。例如,若终端设备检测到用户的“发射”控制指令,则向可移动平台发送发射指令,可移动平台根据发射指令可以发射弹丸。Exemplarily, the terminal device stores the mapping relationship data between the control voice and the platform control instruction, so that the user can send the corresponding platform control instruction to the movable platform through the voice control terminal device, so that the movable platform can perform the preset task. For example, if the terminal device detects the user's "launch" control instruction, it sends a launch instruction to the movable platform, and the movable platform can launch the projectile according to the launch instruction.
本说明书上述实施例提供的控制方法,通过终端设备采集其周围环境的音频数据,并将该音频数据发送给对应的可移动平台,可以使得用户可以在距离可移动平台较远的地方通过可移动平台发出声音,例如对该可移动平台附近的人或者其他可移动平台喊话。另外,通过可移动平台采集其周围环境的音频数据,并将该音频数据发送给该可移动平台的终端设备,可使得该终端设备的用户即使不在可移动平台的附近,也可以收听到可移动平台所处环境的 声音场景,可以方便用户更加便捷和直观与周围环境进行互动,有利于用户对可移动平台的控制,满足用户传递语音信息的目的。The control method provided by the above-mentioned embodiments of this specification collects audio data of its surrounding environment through a terminal device and sends the audio data to the corresponding movable platform, so that the user can pass through the movable platform far away from the movable platform. The platform emits sounds, for example, shouting to people near the movable platform or other movable platforms. In addition, collecting the audio data of the surrounding environment through the mobile platform and sending the audio data to the terminal device of the mobile platform can enable the user of the terminal device to listen to the mobile platform even if it is not in the vicinity of the mobile platform. The sound scene of the environment where the platform is located can facilitate the user to interact with the surrounding environment more conveniently and intuitively, facilitate the user's control of the movable platform, and meet the user's purpose of transmitting voice information.
以上实施例获取环境声音可由拾音设备采集声音信号实现,其中一采集方法如下所示:Acquiring environmental sound in the above embodiments can be realized by collecting sound signals by a sound pickup device, and one of the collecting methods is as follows:
获取拾音设备采集声音信号产生的第一数字信号;获取标准发生源产生的声音信号对应的第二数字信号;根据所述第一数字信号和所述第二数字信号之间的转换关系,确定所述拾音设备的信号校准参数。Acquire the first digital signal generated by the sound signal collected by the sound pickup device; acquire the second digital signal corresponding to the sound signal generated by the standard source; determine according to the conversion relationship between the first digital signal and the second digital signal The signal calibration parameters of the pickup device.
本发明实施例中,拾音设备采集的声音信号为标准发生源产生的声音信号。标准发声源在产生声音信号的过程中,可以通过其配置的声音播放设备播放标准发声源产生的声音信号,以通过空间中的传输介质将标准发声源产生的声音信号传输至拾音设备处。相应地,拾音设备采集标准发声源产生的声音信号,并将采集到的声音信号转换成电信号。进一步地,拾音设备将转换成的电信号传输给拾音设备的信号处理装置,该信号处理装置可以包括模数转换设备,模数转换设备将拾音设备传输的电信号转换成第一数字信号。在一实施例中,为降低声音信号传递过程中的能量损耗,可以将标准发声源和拾音设备放置在同一密闭腔体内;还可以将标准发声源和拾音设备之间的距离限制在一定距离范围内。该一定距离范围例如是[30cm,1.5m]。在另一实施例中,为避免外界声音信号的干扰,可以使用隔音材料构建形成该密闭腔体。外界声音信号是指除标准发声源之外的其他发声源产生的声音信号。In the embodiment of the present invention, the sound signal collected by the sound pickup device is a sound signal generated by a standard source. In the process of generating sound signals, the standard sound source can play the sound signal generated by the standard sound source through its configured sound playback device, so as to transmit the sound signal generated by the standard sound source to the pickup device through the transmission medium in the space. Correspondingly, the sound pickup device collects the sound signal generated by the standard sound source, and converts the collected sound signal into an electrical signal. Further, the sound pickup device transmits the converted electric signal to the signal processing device of the sound pickup device. The signal processing device may include an analog-to-digital conversion device. The analog-to-digital conversion device converts the electric signal transmitted by the sound pickup device into a first digital signal. signal. In one embodiment, in order to reduce the energy loss in the sound signal transmission process, the standard sound source and the sound pickup device can be placed in the same closed cavity; the distance between the standard sound source and the sound pickup device can also be limited to a certain Within the distance. The certain distance range is, for example, [30cm, 1.5m]. In another embodiment, in order to avoid the interference of external sound signals, sound insulation materials can be used to construct the sealed cavity. The external sound signal refers to the sound signal produced by other sound sources other than the standard sound source.
本发明实施例中,第二数字信号可以是预先存储在存储介质中,拾音设备的信号处理装置直接从其存储介质中获取第二数字信号。第二数字信号也可以是预先存储在其他智能终端或者服务器中,拾音设备的信号处理装置在与其他智能终端或者服务器建立通信连接之后,从其他智能终端或者服务器中获取第二数字信号。In the embodiment of the present invention, the second digital signal may be pre-stored in a storage medium, and the signal processing device of the sound pickup device directly obtains the second digital signal from the storage medium. The second digital signal may also be pre-stored in other smart terminals or servers, and the signal processing device of the sound pickup device obtains the second digital signal from other smart terminals or servers after establishing a communication connection with other smart terminals or servers.
在一实施例中,标准发声源产生的声音信号包括一组或者多组声音信号,每组声音信号所对应的声强值为一个或者多个。当标准发声源产生的一组声音信号所对应的声强值为多个时,该多个声强值可以是呈线性分布。In an embodiment, the sound signal generated by the standard sound source includes one or more groups of sound signals, and each group of sound signals corresponds to one or more sound intensity values. When there are multiple sound intensity values corresponding to a group of sound signals generated by the standard sound source, the multiple sound intensity values may be linearly distributed.
本发明实施例中,拾音设备的信号处理装置首先获取第一数字信号对应的第一采样点集合,并获取第二数字信号对应的第二采样点集合;然后根据第一采样点集合和第二采样点集合,确定第一数字信号和第二数字信号之间的转换关系。In the embodiment of the present invention, the signal processing device of the sound pickup device first obtains the first sampling point set corresponding to the first digital signal, and obtains the second sampling point set corresponding to the second digital signal; and then according to the first sampling point set and the first sampling point set A set of two sampling points to determine the conversion relationship between the first digital signal and the second digital signal.
在一实施例中,拾音设备的信号处理装置根据第一采样点集合和第二采样点集合,确定第一数字信号和第二数字信号之间的转换关系的具体方式为:首先根据第一采样点集合中的采样点确定第一拟合曲线;并根据第二采样点集合中的采样点确定第二拟合曲线。具体地, 拾音设备的信号处理装置利用多项式拟合的方法对第一采样点集合中的采样点进行拟合,获取第一拟合函数和第一拟合曲线。其中,第一拟合函数为第一拟合曲线对应的函数表达式;第一拟合曲线为对第一采样点集合中的采样点进行拟合时拟合优度最好的曲线。同理,可以利用多项式拟合的方法对第二采样点集合中的采样点进行拟合,获取第二拟合函数和第二拟合曲线。其中,第二拟合函数为第二拟合曲线对应的函数表达式;第二拟合曲线为对第二采样点集合中的采样点进行拟合时拟合优度最好的曲线。需要说明的是,拟合方法并不限于多项式拟合,本领域技术人员可根据实际需求设定拟合方法。In an embodiment, the signal processing device of the sound pickup device determines the conversion relationship between the first digital signal and the second digital signal according to the first sampling point set and the second sampling point set: first according to the first set of sampling points and the second set of sampling points. The sampling points in the sampling point set determine the first fitting curve; and the second fitting curve is determined according to the sampling points in the second sampling point set. Specifically, the signal processing device of the sound pickup device uses a polynomial fitting method to fit the sampling points in the first sampling point set to obtain the first fitting function and the first fitting curve. Wherein, the first fitting function is a function expression corresponding to the first fitting curve; the first fitting curve is a curve with the best goodness of fit when fitting sampling points in the first sampling point set. In the same way, the polynomial fitting method can be used to fit the sampling points in the second sampling point set to obtain the second fitting function and the second fitting curve. Wherein, the second fitting function is a function expression corresponding to the second fitting curve; the second fitting curve is a curve with the best goodness of fit when fitting sampling points in the second sampling point set. It should be noted that the fitting method is not limited to polynomial fitting, and those skilled in the art can set the fitting method according to actual needs.
进一步地,拾音设备的信号处理装置获取第一拟合曲线和第二拟合曲线之间的第一目标转换关系,并将第一目标转换关系确定为第一数字信号和第二数字信号之间的转换关系。其中,第一目标转换关系使得第一拟合曲线趋近或者重合于第二拟合曲线。第一目标转换关系可以是根据第一拟合曲线对应的第一拟合函数和第二拟合曲线对应的第二拟合函数之间的函数转换关系确定出的。进一步地,可以根据第一拟合函数和第二拟合函数之间的函数转换关系,确定拾音设备的信号校准参数。Further, the signal processing device of the sound pickup device obtains the first target conversion relationship between the first fitting curve and the second fitting curve, and determines the first target conversion relationship as the one between the first digital signal and the second digital signal The conversion relationship between. Wherein, the first target conversion relationship makes the first fitting curve approach or coincide with the second fitting curve. The first target conversion relationship may be determined according to the function conversion relationship between the first fitting function corresponding to the first fitting curve and the second fitting function corresponding to the second fitting curve. Further, the signal calibration parameters of the sound pickup device can be determined according to the function conversion relationship between the first fitting function and the second fitting function.
本发明实施例中,拾音设备的信号处理装置在根据第一数字信号和第二数字信号之间的转换关系确定出拾音设备的信号校准参数之后,保存该信号校准参数,以在拾音设备后续采集到声音信号之后,通过利用信号校准参数对拾音设备采集到的声音信号对应的数字信号进行校准,从而对拾音设备采集到的声音信号进行校准。采用上述方式可以提高拾音设备的校准精度,从而使得不同的拾音设备对于相同的声音信号具有相同或相近的输出水平。In the embodiment of the present invention, the signal processing device of the sound pickup device determines the signal calibration parameter of the sound pickup device according to the conversion relationship between the first digital signal and the second digital signal, and then saves the signal calibration parameter for the sound pickup device. After the device subsequently collects the sound signal, the digital signal corresponding to the sound signal collected by the sound pickup device is calibrated by using the signal calibration parameters to calibrate the sound signal collected by the sound pickup device. By adopting the above method, the calibration accuracy of the sound pickup device can be improved, so that different sound pickup devices have the same or similar output levels for the same sound signal.
以上实施例获取环境声音后可进行以下处理,具体可以包括:The following processing may be performed after the environmental sound is acquired in the above embodiment, which may specifically include:
步骤S401、利用多个预处理电路对待处理的模拟音频信号进行处理以获取多路数字音频信号,其中,多个预处理电路中的每一个预处理电路包括放大器和模数转换器,且各预处理电路的所述放大器的模拟增益各不相同;Step S401: Use multiple pre-processing circuits to process analog audio signals to be processed to obtain multiple digital audio signals, where each of the multiple pre-processing circuits includes an amplifier and an analog-to-digital converter, and each pre-processing circuit includes an amplifier and an analog-digital converter. The analog gains of the amplifiers of the processing circuit are different from each other;
具体地,将多个预处理电路并联,每一预处理电路包括放大器和模数转换器,需要说明的是,所述多个预处理电路至少包括两个预处理电路;其中放大器可用于对模拟音频信号进行功率放大,放大倍数通常用增益表示,本实施例中各预处理电路的放大器的模拟增益各不相同,进一步的,模拟增益大小相邻的两个预处理电路的动态范围至少存在部分重叠;而模数转换器可用于将模拟音频信号转换成数字音频信号,以便于后续的信号处理。Specifically, multiple pre-processing circuits are connected in parallel, and each pre-processing circuit includes an amplifier and an analog-to-digital converter. It should be noted that the multiple pre-processing circuits include at least two pre-processing circuits; wherein the amplifier can be used for analog The audio signal is power amplified, and the amplification factor is usually expressed by gain. In this embodiment, the analog gains of the amplifiers of the preprocessing circuits are different. Further, the dynamic range of the two preprocessing circuits with adjacent analog gains is at least partially Overlap; and analog-to-digital converters can be used to convert analog audio signals into digital audio signals to facilitate subsequent signal processing.
假设待处理的模拟音频信号x可由麦克风采集,然后分别输入到并联的多个预处理电路中,分别经过不同增益的功率放大,并转换成数字音频信号x1、…xi…、xI,也即每一预 处理电路的输入均为同一待处理的模拟音频信号,而每一预处理电路的输出的数字音频信号则各不相同,从而得到多路数字音频信号。Assuming that the analog audio signal x to be processed can be collected by a microphone, and then input into multiple pre-processing circuits in parallel, respectively, through power amplification with different gains, and converted into digital audio signals x1,...xi...,xI, that is, each The input of a pre-processing circuit is the same analog audio signal to be processed, and the digital audio signal output by each pre-processing circuit is different, thereby obtaining multiple digital audio signals.
本实施例中可以将待处理的模拟音频信号以预定时长截取为不同片段作为一帧信号,或者在进行模数转换后以预定数量的采样数据作为一帧信号,后续的音频信号处理流程中均可以以一帧信号为单位进行处理。为了保证信号的连续性,相邻帧信号之间可具有一定的重叠,即前一帧信号的尾部与后一帧信号的头部具有重叠量,从而建立相邻帧之间的相关性。In this embodiment, the analog audio signal to be processed can be intercepted into different segments with a predetermined duration as a frame signal, or a predetermined number of sampled data can be used as a frame signal after analog-to-digital conversion. The subsequent audio signal processing procedures are all It can be processed in units of one frame of signal. In order to ensure signal continuity, there may be a certain overlap between adjacent frame signals, that is, the tail of the previous frame signal and the head of the next frame signal have an overlap amount, thereby establishing the correlation between adjacent frames.
步骤S402、对所述多路数字音频信号进行频域转换以获取多路频域数据。Step S402: Perform frequency domain conversion on the multiple channels of digital audio signals to obtain multiple channels of frequency domain data.
在本实施例中,对多路数字音频信号中每一路数字音频信号分别进行频域转换,从而得到每一路数字音频信号对应的频域数据。本实施例中将多路数字音频信号进行频域转换以获取多路频域数据,进而可实现多路频域数据在频域上的融合。其中频域转换方法可采用傅里叶变换(如离散傅里叶变换)、拉普拉斯变换、Z变换等等,具体的频域转换过程此处不再赘述。In this embodiment, frequency domain conversion is performed on each channel of digital audio signal in the multiple channels of digital audio signal, so as to obtain frequency domain data corresponding to each channel of digital audio signal. In this embodiment, the multiple channels of digital audio signals are subjected to frequency domain conversion to obtain multiple channels of frequency domain data, and the fusion of multiple channels of frequency domain data in the frequency domain can be realized. The frequency domain conversion method can adopt Fourier transform (such as discrete Fourier transform), Laplace transform, Z transform, etc. The specific frequency domain conversion process will not be repeated here.
步骤S403、根据所述多路频域数据中的一路或至少两路目标频域数据确定频域融合数据。Step S403: Determine frequency domain fusion data according to one or at least two channels of target frequency domain data among the multiple channels of frequency domain data.
在本实施例中,由于各预处理电路对同一待处理的模拟音频信号以不同增益进行放大,因此每一路的音频信号具有不同的最大值和最小值,增益较大一路的音频信号的最大值和最小值均相对较大,而增益较小一路的音频信号的最大值和最小值均相对较小,而动态范围是无失真情况下音频信号的最大值和最小值的比值,本实施例中根据多路频域数据中的一路或至少两路目标频域数据确定频域融合数据,则可通过融合实现在融合后最终得到的音频信号的最大值和最小值的调节,例如较大声音可由增益相对较大的预处理电路提供,较小声音可由增益相对较小的预处理电路提供,从而可以提高录音系统的动态系统,具有较高的灵敏度、同时能够降低底噪。在本实施例中,声音的大小可由音频信号的能量特征信息进行衡量,例如模拟音频信号或数字音频信号的声压级,或者模拟音频信号或数字音频信号的幅值等。In this embodiment, since each pre-processing circuit amplifies the same analog audio signal to be processed with different gains, each channel of audio signal has a different maximum and minimum value, and the maximum value of the audio signal with a larger gain. And the minimum value are relatively large, and the maximum and minimum values of the audio signal with a smaller gain are relatively small, and the dynamic range is the ratio of the maximum and minimum values of the audio signal without distortion. In this embodiment Determine the frequency domain fusion data according to one or at least two channels of the target frequency domain data in the multi-channel frequency domain data, and the maximum and minimum values of the audio signal finally obtained after the fusion can be adjusted through the fusion. For example, the louder sound can be adjusted by A pre-processing circuit with a relatively large gain is provided, and a smaller sound can be provided by a pre-processing circuit with a relatively small gain, so that the dynamic system of the recording system can be improved, with higher sensitivity and lower noise at the same time. In this embodiment, the size of the sound can be measured by the energy feature information of the audio signal, such as the sound pressure level of the analog audio signal or digital audio signal, or the amplitude of the analog audio signal or digital audio signal, and so on.
步骤S404、将所述频域融合数据转换为时域音频信号,并根据所述时域音频信号获取输出音频信号。Step S404: Convert the frequency domain fusion data into a time domain audio signal, and obtain an output audio signal according to the time domain audio signal.
在本实施例中,在得到频域融合数据后,即可将频域融合数据转换为时域音频信号,转换方法可采用傅里叶变换(如离散傅里叶变换)的逆变换、拉普拉斯变换的逆变换、Z变换的逆变换等等,此处不再赘述。在完成转换后,可根据时域音频信号获取输出音频信号,其中根据时域音频信号获取输出音频信号过程中,可对时域音频信号进行压缩、降噪等操作。此外,若本实施例中以一帧信号为单位进行上述的音频信号处理流程,则根据时域音频信号 获取输出音频信号过程中还需要将各帧信号进行拼接,建立相邻帧之间的相关性,具体的,可将当前帧时域音频信号与前一帧前帧时域音频信号进行叠加处理。In this embodiment, after the frequency domain fusion data is obtained, the frequency domain fusion data can be converted into a time domain audio signal. The conversion method can adopt the inverse transform of the Fourier transform (such as the discrete Fourier transform), the Lap The inverse transform of the Lass transform, the inverse transform of the Z transform, etc., will not be repeated here. After the conversion is completed, the output audio signal can be obtained according to the time domain audio signal. In the process of obtaining the output audio signal according to the time domain audio signal, operations such as compression and noise reduction can be performed on the time domain audio signal. In addition, if the audio signal processing procedure described above is performed in the unit of a frame signal in this embodiment, the process of obtaining the output audio signal according to the time domain audio signal also needs to be spliced between the frame signals to establish the correlation between adjacent frames. Specifically, the time domain audio signal of the current frame and the previous frame time domain audio signal of the previous frame can be superimposed.
本实施例的音频信号处理方法,通过利用多个预处理电路对待处理的模拟音频信号进行处理以获取多路数字音频信号,其中,多个预处理电路中的每一个预处理电路包括放大器和模数转换器,且各预处理电路的所述放大器的模拟增益各不相同;对所述多路数字音频信号进行频域转换以获取多路频域数据;根据所述多路频域数据中的一路或至少两路目标频域数据确定频域融合数据;将所述频域融合数据转换为时域音频信号,并根据所述时域音频信号获取输出音频信号。本实施例的方法可实现有效的提高录音系统的动态范围,具有较高的灵敏度,同时能够降低底噪,满足高信噪比的要求。The audio signal processing method of this embodiment uses multiple preprocessing circuits to process analog audio signals to be processed to obtain multiple digital audio signals, wherein each of the multiple preprocessing circuits includes an amplifier and an analog audio signal. Digital converter, and the analog gains of the amplifiers of each preprocessing circuit are different; frequency domain conversion is performed on the multiple channels of digital audio signals to obtain multiple channels of frequency domain data; according to the multiple channels of frequency domain data One or at least two channels of target frequency domain data determine frequency domain fusion data; convert the frequency domain fusion data into a time domain audio signal, and obtain an output audio signal according to the time domain audio signal. The method of this embodiment can effectively improve the dynamic range of the recording system, has high sensitivity, and can reduce the noise floor and meet the requirements of high signal-to-noise ratio.
在上述任一实施例的基础上,所述音频信号处理方法还包括:On the basis of any of the foregoing embodiments, the audio signal processing method further includes:
获取所述待处理的模拟音频信号的能量特征信息。Obtain the energy feature information of the analog audio signal to be processed.
在本实施例中,待处理的模拟音频信号的能量特征信息可以为模拟音频信号的声压级或者模拟音频信号的幅值等,具体可以为模拟音频信号的瞬时幅度的最大值、最小值或中间值,也可以为模拟音频信号短时(预设时长)平均幅度的最大值、最小值或中间值。In this embodiment, the energy characteristic information of the analog audio signal to be processed may be the sound pressure level of the analog audio signal or the amplitude of the analog audio signal, etc., and specifically may be the maximum, minimum, or instantaneous amplitude of the analog audio signal. The intermediate value can also be the maximum, minimum, or intermediate value of the average amplitude of the analog audio signal in a short time (preset duration).
进一步的,步骤S403所述的根据所述多路频域数据中的一路或至少两路频域数据确定频域融合数据,包括:Further, in step S403, determining frequency domain fusion data according to one or at least two channels of frequency domain data among the multiple channels of frequency domain data includes:
步骤S501、根据所述能量特征信息从所述多路频域数据中确定一路或至少两路目标频域数据;Step S501: Determine one or at least two channels of target frequency domain data from the multiple channels of frequency domain data according to the energy characteristic information;
步骤S502、根据所述一路或至少两路目标频域数据确定频域融合数据。Step S502: Determine frequency domain fusion data according to the one or at least two channels of target frequency domain data.
在本实施例中,在进行融合时可根据待处理的模拟音频信号的能量特征信息从多路频域数据中确定待融合的一路或至少两路目标频域数据,例如可根据待处理的模拟音频信号的能量特征信息的大小确定目标频域数据的数量,如能量特征信息越大、则目标频域数据的数量越多;此外也可根据能量特征信息与预设处理电路的参考能量特征参数进行比较、再确定待融合的一路或至少两路目标频域数据,其中多个预设处理电路的每一个都对应一个各不相同的参考能量特征参数,参考能量特征参数是由预处理电路包括的放大电路的模拟增益决定的,参考能量特征参数与能量特征信息属于相同的参数,也即为预处理电路输出的数字音频信号的声压级或者幅值等,具体可以为在不失真情况下的数字音频信号的瞬时幅度的最大值、最小值或中间值,也可以为数字音频信号短时(预设时长)平均幅度的最大值、最小值或中间值,当预处理电路包括的放大电路的模拟增益越大,则对应的参考能量特征参数越大。In this embodiment, during fusion, one or at least two channels of target frequency domain data to be fused can be determined from multiple channels of frequency domain data according to the energy characteristic information of the analog audio signal to be processed, for example, according to the simulation to be processed The size of the energy feature information of the audio signal determines the number of target frequency domain data. For example, the greater the energy feature information, the greater the number of target frequency domain data; in addition, the reference energy feature parameters of the processing circuit can also be preset according to the energy feature information Compare and determine one or at least two channels of target frequency domain data to be fused. Each of the multiple preset processing circuits corresponds to a different reference energy characteristic parameter. The reference energy characteristic parameter is included by the preprocessing circuit. Determined by the analog gain of the amplifier circuit, the reference energy characteristic parameter and the energy characteristic information belong to the same parameter, that is, the sound pressure level or amplitude of the digital audio signal output by the preprocessing circuit, which can be specified without distortion The maximum, minimum or intermediate value of the instantaneous amplitude of the digital audio signal, or the maximum, minimum or intermediate value of the average amplitude of the digital audio signal in a short time (preset duration), when the preprocessing circuit includes the amplifier circuit The larger the simulation gain of, the larger the corresponding reference energy characteristic parameter.
更具体的,在一种可选实施例中,所述根据所述能量特征信息从所述多路频域数据中确定一路或至少两路目标频域数据,包括:More specifically, in an optional embodiment, the determining one or at least two channels of target frequency domain data from the multiple channels of frequency domain data according to the energy characteristic information includes:
根据所述能量特征信息和多个参考能量特征参数从所述多路频域数据中确定第一目标频域数据和第二目标频域数据,其中,多个参考能量特征参数是根据所述多个预处理电路包括的放大电路的模拟增益确定的。The first target frequency domain data and the second target frequency domain data are determined from the multiple channels of frequency domain data according to the energy feature information and multiple reference energy feature parameters, where the multiple reference energy feature parameters are based on the multiple frequency domain data. The analog gain of the amplifying circuit included in each preprocessing circuit is determined.
在本实施例是从多路频域数据中确定至少两路目标频域数据的情况,其中第一目标频域数据可以仅仅为一路目标频域数据,当然也可为不止一路目标频域数据;同样的,第二目标频域数据也可以仅仅为一路目标频域数据,也可为不止一路目标频域数据。其中,所述根据所述能量特征信息从所述多路频域数据中确定第一目标频域数据和第二目标频域数据,具体可包括:In this embodiment, at least two channels of target frequency domain data are determined from multiple channels of frequency domain data, where the first target frequency domain data may be only one channel of target frequency domain data, and of course, it may also be more than one channel of target frequency domain data; Similarly, the second target frequency domain data may also be only one channel of target frequency domain data or more than one channel of target frequency domain data. Wherein, the determining the first target frequency domain data and the second target frequency domain data from the multiple channels of frequency domain data according to the energy characteristic information may specifically include:
步骤S601、从多个参考能量特征参数中确定与所述能量特征信息相邻的第一参考能量特征参数和第二参考能量特征参数;Step S601: Determine a first reference energy characteristic parameter and a second reference energy characteristic parameter adjacent to the energy characteristic information from a plurality of reference energy characteristic parameters;
步骤S602、根据第一参考能量特征参数和第二参考能量特征参数从所述多路频域数据中确定第一目标频域数据和第二目标频域数据;Step S602: Determine first target frequency domain data and second target frequency domain data from the multiple channels of frequency domain data according to the first reference energy characteristic parameter and the second reference energy characteristic parameter;
其中,第一目标频域数据和第二目标频域数据分别是对所述多路数字音频信号中的第一数字音频信号和第二数字音频信号进行所述频域转换得到的,所述第一数字音频信号和第二数字音频信号分别是由第一参考能量特征参数和第二参考能量特征参数对应的第一预处理电路和第二预处理电路对所述待处理的模拟音频信号得到的。Wherein, the first target frequency domain data and the second target frequency domain data are obtained by performing the frequency domain conversion on the first digital audio signal and the second digital audio signal in the multi-channel digital audio signal, respectively. A digital audio signal and a second digital audio signal are obtained from the analog audio signal to be processed by a first preprocessing circuit and a second preprocessing circuit corresponding to the first reference energy characteristic parameter and the second reference energy characteristic parameter, respectively .
步骤S603、根据所述第一目标频域数据和所述第二目标频域数据确定频域融合数据。Step S603: Determine frequency domain fusion data according to the first target frequency domain data and the second target frequency domain data.
在本实施例中,从多个预设处理电路的参考能量特征参数(以L1、L2、…、LI表示,其中I为预设处理电路的个数)选择与待处理的模拟音频信号的能量特征信息(以Lc表示)相邻的第一参考能量特征参数(以L i′表示,其中1≤i′≤I-1)和第二参考能量特征参数(以L i′+1表示),也即Lc处于L i′和L i′+1之间,其中第一参考能量特征参数L i′对应第一预处理电路,该第一预处理电路输出的第一数字音频信号经频域转换得到的频域数据为所述第一目标频域数据,而第二参考能量特征参数L i′+1对应第二预处理电路,该第二预处理电路输出的第二数字音频信号经频域转换得到的频域数据为所述第二目标频域数据,进而可以根据第一目标频域数据和第二目标频域数据进行叠加运算即可获取频域融合数据。 In this embodiment, the energy of the analog audio signal to be processed is selected from the reference energy characteristic parameters of multiple preset processing circuits (indicated by L1, L2,..., LI, where I is the number of preset processing circuits) The feature information (represented by Lc) is adjacent to the first reference energy feature parameter ( represented by L i′ , where 1≤i′≤I-1) and the second reference energy feature parameter (represented by L i′+1 ), That is, Lc is between L i′ and L i′+1 , where the first reference energy characteristic parameter L i′ corresponds to the first preprocessing circuit, and the first digital audio signal output by the first preprocessing circuit undergoes frequency domain conversion The obtained frequency domain data is the first target frequency domain data, and the second reference energy characteristic parameter L i′+1 corresponds to the second preprocessing circuit, and the second digital audio signal output by the second preprocessing circuit passes through the frequency domain. The converted frequency domain data is the second target frequency domain data, and the frequency domain fusion data can be obtained by performing superposition operation according to the first target frequency domain data and the second target frequency domain data.
当然,本实施例中可在确定第一参考能量特征参数Li和第二参考能量特征参数L i′+1后,以L i′和L i′-1(此时需要i′>1)对应的预处理电路输出的数字音频信号经频域转换得到的频域数据为所述第一目标频域数据,同样可以L i′+1和L i′+2(此时需要i′<I-1)对应的预处理 电路输出的数字音频信号经频域转换得到的频域数据为所述第二目标频域数据,当然第一目标频域数据和第二目标频域数据也可分别包括更多路的频域数据,此处不再举例。 Of course, in this embodiment, after determining the first reference energy characteristic parameter Li and the second reference energy characteristic parameter L i′+1 , L i′ and L i′-1 (in this case, i′>1 is required) correspond to The frequency domain data obtained by the frequency domain conversion of the digital audio signal output by the preprocessing circuit is the first target frequency domain data, which can also be L i′+1 and L i′+2 (in this case, i′<I- 1) The frequency domain data obtained by frequency domain conversion of the digital audio signal output by the corresponding preprocessing circuit is the second target frequency domain data. Of course, the first target frequency domain data and the second target frequency domain data may also include more For multiple channels of frequency domain data, no examples are given here.
在另一个可选实施例中,所述根据所述能量特征信息从所述多路频域数据中确定一路或至少两路目标频域数据,包括:In another optional embodiment, the determining one or at least two channels of target frequency domain data from the multiple channels of frequency domain data according to the energy characteristic information includes:
步骤S701、当所述所述能量特征信息小于所述多个参考能量特征参数中最小的第三参考能量特征参数时,根据所述第三参考能量特征参数从所述多路频域数据中确定第三目标频域数据;Step S701: When the energy feature information is less than the smallest third reference energy feature parameter among the multiple reference energy feature parameters, determine from the multiple channels of frequency domain data according to the third reference energy feature parameter The third target frequency domain data;
其中,第三目标频域数据是对所述多路数字音频信号中的第三数字音频信号进行所述频域转换得到的,所述第三数字音频信号是由第三参考能量特征参数对应的第三预处理电路对所述待处理的模拟音频信号得到的;Wherein, the third target frequency domain data is obtained by performing the frequency domain conversion on a third digital audio signal in the multi-channel digital audio signal, and the third digital audio signal is corresponding to a third reference energy characteristic parameter Obtained by the third preprocessing circuit on the analog audio signal to be processed;
步骤S702、根据第三目标频域数据获取所述频域融合数据。Step S702: Acquire the frequency domain fusion data according to the third target frequency domain data.
在本实施例中,若多个预设处理电路的参考能量特征参数(L1、L2、…、LI)中,最小的第三参考能量特征参数L1大于与待处理的模拟音频信号的能量特征信息Lc,也即Lc小于L1,其中第三参考能量特征参数L1对应第三预处理电路输出的第三数字音频信号经频域转换得到的频域数据为所述第三目标频域数据,进而可以根据第三目标频域数据获取所述频域融合数据。In this embodiment, if among the reference energy characteristic parameters (L1, L2,..., LI) of the plurality of preset processing circuits, the smallest third reference energy characteristic parameter L1 is greater than the energy characteristic information of the analog audio signal to be processed Lc, that is, Lc is less than L1, where the third reference energy characteristic parameter L1 corresponds to the third digital audio signal output by the third preprocessing circuit, and the frequency domain data obtained by frequency domain conversion is the third target frequency domain data, and then Acquire the frequency domain fusion data according to the third target frequency domain data.
当然,本实施例中可在确定待处理的模拟音频信号的能量特征信息Lc小于第三参考能量特征参数L1后,以L1及L2对应的预处理电路输出的数字音频信号经频域转换得到的频域数据为所述第三目标频域数据,当然第三目标频域数据也可分别包括更多路的频域数据,此处不再举例。Of course, in this embodiment, after determining that the energy feature information Lc of the analog audio signal to be processed is smaller than the third reference energy feature parameter L1, the digital audio signal output by the preprocessing circuit corresponding to L1 and L2 can be obtained by frequency domain conversion. The frequency domain data is the third target frequency domain data. Of course, the third target frequency domain data may also include more channels of frequency domain data, and no examples are given here.
在另一个可选实施例中,所述对所述多路频域数据进行融合以获取频域融合数据,还包括:In another optional embodiment, the fusing the multiple channels of frequency domain data to obtain frequency domain fusion data further includes:
步骤S801、当所述所述能量特征信息大于所述多个参考能量特征参数中最大的第四参考能量特征参数时,根据所述第四参考能量特征参数从所述多路频域数据中确定第四目标频域数据;Step S801: When the energy feature information is greater than the largest fourth reference energy feature parameter among the multiple reference energy feature parameters, determine from the multiple channels of frequency domain data according to the fourth reference energy feature parameter The fourth target frequency domain data;
其中,第四目标频域数据是对所述多路数字音频信号中的第四数字音频信号进行所述频域转换得到的,所述第四数字音频信号是由第四参考能量特征参数对应的第四预处理电路对所述待处理的模拟音频信号得到的;Wherein, the fourth target frequency domain data is obtained by performing the frequency domain conversion on the fourth digital audio signal in the multi-channel digital audio signal, and the fourth digital audio signal is corresponding to the fourth reference energy characteristic parameter Obtained by the fourth preprocessing circuit on the analog audio signal to be processed;
步骤S802、根据第四目标频域数据获取所述频域融合数据。Step S802: Acquire the frequency domain fusion data according to the fourth target frequency domain data.
在本实施例中,若多个预设处理电路的参考能量特征参数(L1、L2、…、LI)中,最大 的第四参考能量特征参数LI小于与待处理的模拟音频信号的能量特征信息Lc,也即Lc大于LI,其中第四参考能量特征参数LI对应第四预处理电路输出的第四数字音频信号经频域转换得到的频域数据为所述第四目标频域数据,进而可以根据第四目标频域数据获取所述频域融合数据。In this embodiment, if among the reference energy characteristic parameters (L1, L2,..., LI) of the plurality of preset processing circuits, the largest fourth reference energy characteristic parameter LI is less than the energy characteristic information of the analog audio signal to be processed Lc, that is, Lc is greater than LI, where the fourth reference energy characteristic parameter LI corresponds to the fourth digital audio signal output by the fourth preprocessing circuit and the frequency domain data obtained by frequency domain conversion is the fourth target frequency domain data, and then The frequency domain fusion data is acquired according to the fourth target frequency domain data.
当然,本实施例中可在确定待处理的模拟音频信号的能量特征信息Lc大于第四参考能量特征参数LI后,以LI及LI-1对应的预处理电路输出的数字音频信号经频域转换得到的频域数据为所述第四目标频域数据,当然第四目标频域数据也可分别包括更多路的频域数据,此处不再举例。Of course, in this embodiment, after it is determined that the energy feature information Lc of the analog audio signal to be processed is greater than the fourth reference energy feature parameter LI, the digital audio signal output by the preprocessing circuit corresponding to LI and LI-1 is converted into the frequency domain. The obtained frequency-domain data is the fourth target frequency-domain data. Of course, the fourth target frequency-domain data may also include more channels of frequency-domain data, which will not be illustrated here.
在上述实施例的基础上,在根据所述能量特征信息从所述多路频域数据中确定一路或至少两路目标频域数据时,可首先将待处理的模拟音频信号的能量特征信息Lc与多个预设处理电路的参考能量特征参数进行比对,若Lc小于(或等于)第三参考能量特征参数L1,则执行步骤701-702;若Lc大于(或等于)第四参考能量特征参数LI,则执行步骤801-802;若Lc处于相邻的第三参考能量特征参数L1和第四参考能量特征参数LI之间,则执行步骤601-603。On the basis of the foregoing embodiment, when one or at least two channels of target frequency domain data are determined from the multiple channels of frequency domain data according to the energy feature information, the energy feature information Lc of the analog audio signal to be processed may be first determined. Compare with the reference energy characteristic parameters of multiple preset processing circuits, if Lc is less than (or equal to) the third reference energy characteristic parameter L1, perform steps 701-702; if Lc is greater than (or equal to) the fourth reference energy characteristic If the parameter LI is between steps 801-802; if Lc is between the adjacent third reference energy characteristic parameter L1 and the fourth reference energy characteristic parameter LI, then steps 601-603 are performed.
在上述任一实施例的基础上,所述根据所述一路或至少两路目标频域数据确定频域融合数据,包括:On the basis of any of the foregoing embodiments, the determining frequency domain fusion data according to the one or at least two channels of target frequency domain data includes:
对所述至少两路频域数据进行叠加运算以获取频域融合数据。Perform a superposition operation on the at least two channels of frequency domain data to obtain frequency domain fusion data.
在本实施例中,对于至少两路目标频域数据进行融合时,采用对频域数据进行频谱叠加运算的方式,获取频域融合数据。In this embodiment, when at least two channels of target frequency domain data are fused, the frequency domain fusion data is obtained by performing a spectrum superposition operation on the frequency domain data.
进一步的,当所述一路或者多路频域数据包括第一目标频域数据和第二目标频域数据时,则所述根据所述一路或者多路频域数据获取频域融合数据,包括:Further, when the one or more channels of frequency domain data include the first target frequency domain data and the second target frequency domain data, the acquiring frequency domain fusion data according to the one or more channels of frequency domain data includes:
根据第一目标频域数据和第二目标频域数据对应的权重对所述第一路和第二路路频域数据进行叠加运算以获取频域融合数据。Perform a superposition operation on the first channel and the second channel of frequency domain data according to the weights corresponding to the first target frequency domain data and the second target frequency domain data to obtain frequency domain fusion data.
在本实施例中,进行叠加运算是可对第一目标频域数据和第二目标频域数据设置权重,通过将第一目标频域数据和第二目标频域数据以不同的权重进行叠加运算,则可得到不同动态范围的频域融合数据。In this embodiment, performing the superposition operation can set weights for the first target frequency domain data and the second target frequency domain data, by superposing the first target frequency domain data and the second target frequency domain data with different weights. , You can get frequency-domain fusion data with different dynamic ranges.
进一步的,所述多个预设处理电路中的每一个都对应一个各不相同的参考能量特征参数,多个参考能量特征参数是根据所述多个预处理电路包括的放大电路的模拟增益确定的,其中,所述第一目标频域数据和第二目标频域数据对应的权重是根据所述多个预设处理电路中与所述第一目标频域数据对应的第一预处理电路和与所述第二目标频域数据对应的第二预处 理电路的参考能量特征参数确定。Further, each of the plurality of preset processing circuits corresponds to a different reference energy characteristic parameter, and the plurality of reference energy characteristic parameters are determined according to the analog gain of the amplifying circuit included in the plurality of preprocessing circuits , Wherein the weights corresponding to the first target frequency domain data and the second target frequency domain data are based on the first preprocessing circuit and the first preprocessing circuit corresponding to the first target frequency domain data among the plurality of preset processing circuits The reference energy characteristic parameter of the second preprocessing circuit corresponding to the second target frequency domain data is determined.
在本实施例中,第一目标频域数据和第二目标频域数据对应的权重由对应的预处理电路的参考能量特征参数来确定,更具体的,可根据第一目标频域数据对应的第一参考能量特征参数Li以及第二目标频域数据对应的第二参考能量特征参数Li+1与待处理的模拟音频信号的能量特征信息Lc之间的大小关系确定,其中参考能量特征参数约接近模拟音频信号的能量特征信息Lc的,权重越大,例如,Lc更接近Li,则说明模拟音频信号更接近Li对应的预处理电路的数字音频信号,则需要增加Li对应的第一目标频域数据的权重。本实施例中可通过如下公式确定第一目标频域数据的权重a1和第二目标频域数据的权重a2:In this embodiment, the weights corresponding to the first target frequency domain data and the second target frequency domain data are determined by the reference energy characteristic parameters of the corresponding pre-processing circuit. More specifically, the weights corresponding to the first target frequency domain data can be determined. The magnitude relationship between the first reference energy feature parameter Li and the second reference energy feature parameter Li+1 corresponding to the second target frequency domain data and the energy feature information Lc of the analog audio signal to be processed is determined, wherein the reference energy feature parameter is about If the energy characteristic information Lc is close to the analog audio signal, the greater the weight, for example, Lc is closer to Li, it means that the analog audio signal is closer to the digital audio signal of the preprocessing circuit corresponding to Li, and the first target frequency corresponding to Li needs to be increased. The weight of the domain data. In this embodiment, the weight a1 of the first target frequency domain data and the weight a2 of the second target frequency domain data can be determined by the following formula:
Figure PCTCN2019121766-appb-000001
Figure PCTCN2019121766-appb-000001
Figure PCTCN2019121766-appb-000002
Figure PCTCN2019121766-appb-000002
在上述任一实施例的基础上,所述根据所述多路频域数据中的一路或至少两路目标频域数据确定频域融合数据,包括:On the basis of any of the foregoing embodiments, the determining frequency domain fusion data according to one or at least two channels of target frequency domain data among the multiple channels of frequency domain data includes:
步骤S901、根据所述一路或至少两路目标频域数据中每一路对应的压缩系数对所述一路或至少两路目标频域数据进行压缩处理;Step S901: Perform compression processing on the one or at least two channels of target frequency domain data according to the compression coefficient corresponding to each channel of the one or at least two channels of target frequency domain data;
步骤S902、根据所述压缩处理之后的一路或者多路频域数据获取频域融合数据。Step S902: Acquire frequency domain fusion data according to one or more channels of frequency domain data after the compression processing.
在本实施例中,由于录音系统的输出是存在数字量化范围的,也即对于音频信号的最大幅值和最小幅值具有限制,音频信号的最大幅值不能大于最大阈值,音频信号的最小幅值不能小于最小阈值,因此在获取频域融合数据时需要对目标频域数据进行压缩处理,以避免融合后的频域融合数据超过数字量化范围。本实施例中,对目标频域数据进行压缩的步骤可与上述的叠加运算同时进行,也可在叠加运算前完成。In this embodiment, since the output of the recording system has a digital quantization range, that is, there are restrictions on the maximum amplitude and minimum amplitude of the audio signal, the maximum amplitude of the audio signal cannot be greater than the maximum threshold, and the minimum amplitude of the audio signal The value cannot be less than the minimum threshold. Therefore, when acquiring the frequency domain fusion data, the target frequency domain data needs to be compressed to avoid the fusion frequency domain fusion data from exceeding the digital quantization range. In this embodiment, the step of compressing the target frequency domain data can be performed simultaneously with the above-mentioned superposition operation, or can be completed before the superposition operation.
进一步的,所述压缩处理为线性压缩处理。Further, the compression processing is linear compression processing.
在上述实施例的基础上,所述一路或者多路频域数据每一路频域数据对应的压缩系数是根据与所述每一路频域数据对应的预处理电路包括的放大器的模拟增益确定的。On the basis of the foregoing embodiment, the compression coefficient corresponding to each channel of frequency domain data of the one or more channels of frequency domain data is determined according to the analog gain of the amplifier included in the preprocessing circuit corresponding to each channel of frequency domain data.
在本实施例中,任一路频域数据对应的压缩系数可以为对应的预处理电路的通道均衡参数和缩放系数的乘积,其中某一预处理电路作为参考预处理电路,任一路预处理电路的通道均衡参数为该预处理电路模拟增益与参考预处理电路模拟增益的比值,缩放系数则根据该预处理电路输出的数字音频信号的大小获取。本实施例中可通过如下公式获取任一路频域数据对应的压缩系数:In this embodiment, the compression coefficient corresponding to any channel of frequency domain data can be the product of the channel equalization parameter and the scaling factor of the corresponding preprocessing circuit. A certain preprocessing circuit is used as a reference preprocessing circuit, and the The channel equalization parameter is the ratio of the analog gain of the preprocessing circuit to the analog gain of the reference preprocessing circuit, and the scaling factor is obtained according to the size of the digital audio signal output by the preprocessing circuit. In this embodiment, the compression coefficient corresponding to any channel of frequency domain data can be obtained by the following formula:
Figure PCTCN2019121766-appb-000003
Figure PCTCN2019121766-appb-000003
其中,
Figure PCTCN2019121766-appb-000004
为通道均衡参数,用于将第i′路预处理电路的频域数据与参考预处理电路的频域数据进行幅值均衡,G i′为预处理电路的模拟增益,G ref为参考预处理电路的模拟增益,α为缩放系数,用于对频域数据进行幅值缩放,一般而言,对于小信号,α≥1,使得信号保持或放大;对于大信号,α<1,使得信号缩小,从而达到压缩动态范围的目的。
among them,
Figure PCTCN2019121766-appb-000004
Is the channel equalization parameter, used to perform amplitude equalization between the frequency domain data of the i′th preprocessing circuit and the frequency domain data of the reference preprocessing circuit, G i′ is the analog gain of the preprocessing circuit, and G ref is the reference preprocessing The analog gain of the circuit, α is the scaling factor, used to scale the frequency domain data. Generally speaking, for small signals, α≥1, so that the signal is maintained or amplified; for large signals, α<1, so that the signal is reduced , So as to achieve the purpose of compressing the dynamic range.
需要说明的是,步骤S901-902可以仅在频域融合数据超出数字量化范围时执行,也即可在步骤S901前判断频域融合数据是否存在超出数字量化范围的可能性,若存在超出数字量化范围的可能性时,才执行步骤S901-902。It should be noted that steps S901-902 can be executed only when the frequency domain fusion data exceeds the digital quantization range, or it can be judged before step S901 whether the frequency domain fusion data has the possibility of exceeding the digital quantization range, if there is a possibility that the frequency domain fusion data exceeds the digital quantization range. Steps S901-902 are executed only when the range is possible.
在上述任一实施例的基础上,所述时域音频信号为当前帧时域音频信号,其中,上述实施例中步骤S404所述的根据所述时域音频信号获取输出音频信号,包括:On the basis of any of the foregoing embodiments, the time domain audio signal is the current frame time domain audio signal, wherein the step S404 in the foregoing embodiment described in step S404 obtaining the output audio signal according to the time domain audio signal includes:
将所述当前帧时域音频信号与当前帧时域音频信号之前获取的历史帧时域音频信号进行叠加处理以获取当前帧时域融合音频信号;Superimposing the current frame time domain audio signal with the historical frame time domain audio signal obtained before the current frame time domain audio signal to obtain the current frame time domain fusion audio signal;
根据所述当前帧时域融合音频信号确定所述输出音频信号。The output audio signal is determined according to the time domain fusion audio signal of the current frame.
在本实施例中,以一帧信号为单位进行上述的各音频信号处理流程,为了保证信号的连续性,相邻帧信号之间可具有一定的重叠,即前一帧信号的尾部与后一帧信号的头部具有重叠量,从而建立相邻帧之间的相关性。因此在通过S404中将当前帧频域融合数据转换为当前帧时域音频信号后,可将当前帧时域音频信号与前一帧时域音频信号中重叠部分进行重叠叠加运算,而当前帧时域音频信号与前一帧时域音频信号中未重叠部分则不进行叠加运算,从而得到当前帧时域融合音频信号,进而可根据所述当前帧时域融合音频信号确定所述输出音频信号。In this embodiment, the above-mentioned audio signal processing procedures are performed in units of one frame signal. In order to ensure the continuity of the signal, there may be a certain overlap between adjacent frame signals, that is, the tail of the previous frame signal and the next The header of the frame signal has an overlap amount, thereby establishing the correlation between adjacent frames. Therefore, after converting the frequency domain fusion data of the current frame into the time domain audio signal of the current frame in S404, the overlapping part of the time domain audio signal of the current frame and the time domain audio signal of the previous frame can be overlapped and superimposed. The non-overlapping portion of the time domain audio signal and the previous frame of time domain audio signal is not superimposed, so as to obtain the current frame time domain fused audio signal, and the output audio signal can be determined according to the current frame time domain fused audio signal.
请结合上述实施例参阅图11,图11是本说明书一实施例提供的终端设备600的示意性框图。该终端设备600包括处理器601和存储器602,还包括音频传感器603、扬声器604。Please refer to FIG. 11 in conjunction with the foregoing embodiment. FIG. 11 is a schematic block diagram of a terminal device 600 according to an embodiment of this specification. The terminal device 600 includes a processor 601 and a memory 602, and also includes an audio sensor 603 and a speaker 604.
其中,音频传感器603用于采集所述终端设备600的环境声音,所述扬声器604用于播放音频信息。The audio sensor 603 is used to collect environmental sounds of the terminal device 600, and the speaker 604 is used to play audio information.
示例性的,处理器601和存储器602通过总线605连接,该总线605比如为I2C(Inter-integrated Circuit)总线。Exemplarily, the processor 601 and the memory 602 are connected by a bus 605, and the bus 605 is, for example, an I2C (Inter-integrated Circuit) bus.
具体地,处理器601可以是微控制单元(Micro-controller Unit,MCU)、中央处理单元(Central Processing Unit,CPU)或数字信号处理器(Digital Signal Processor,DSP)等。Specifically, the processor 601 may be a micro-controller unit (MCU), a central processing unit (Central Processing Unit, CPU), a digital signal processor (Digital Signal Processor, DSP), or the like.
具体地,存储器602可以是Flash芯片、只读存储器(ROM,Read-Only Memory)磁盘、光盘、U盘或移动硬盘等。Specifically, the memory 602 may be a Flash chip, a read-only memory (ROM, Read-Only Memory) disk, an optical disk, a U disk, or a mobile hard disk.
其中,所述处理器601用于运行存储在存储器602中的计算机程序,并在执行所述计算 机程序时实现前述的用于终端设备的控制方法。Wherein, the processor 601 is used to run a computer program stored in the memory 602, and implement the aforementioned control method for terminal equipment when the computer program is executed.
示例性的,所述处理器601用于运行存储在存储器602中的计算机程序,并在执行所述计算机程序时实现如下步骤:Exemplarily, the processor 601 is configured to run a computer program stored in the memory 602, and implement the following steps when the computer program is executed:
根据所述终端设备的环境声音获取终端声音文件;Acquiring a terminal sound file according to the environmental sound of the terminal device;
将所述终端声音文件向所述可移动平台发送,以使所述可移动平台解码所述终端声音文件后播放;Sending the terminal sound file to the mobile platform, so that the mobile platform decodes the terminal sound file and plays it;
获取所述可移动平台发送的平台声音文件,所述平台声音文件由所述可移动平台采集所述可移动平台的环境声音后生成;Acquiring a platform sound file sent by the mobile platform, the platform sound file being generated by the mobile platform after collecting the environmental sound of the mobile platform;
解码所述平台声音文件得到平台音频信息,播放所述平台音频信息。Decoding the platform sound file to obtain platform audio information, and playing the platform audio information.
本说明书实施例提供的终端设备的具体原理和实现方式均与前述实施例的用于终端设备的控制方法类似,此处不再赘述。The specific principles and implementation manners of the terminal device provided in the embodiment of this specification are similar to the control method for the terminal device in the foregoing embodiment, and will not be repeated here.
本说明书的实施例中还提供一种计算机可读存储介质,所述计算机可读存储介质存储有计算机程序,所述计算机程序中包括程序指令,所述处理器执行所述程序指令,实现上述实施例提供的用于终端设备的控制方法的步骤。The embodiments of this specification also provide a computer-readable storage medium, the computer-readable storage medium stores a computer program, the computer program includes program instructions, and the processor executes the program instructions to implement the foregoing implementation The example provides the steps of the control method for terminal equipment.
其中,所述计算机可读存储介质可以是前述任一实施例所述的终端设备的内部存储单元,例如所述终端设备的硬盘或内存。所述计算机可读存储介质也可以是所述终端设备的外部存储设备,例如所述终端设备上配备的插接式硬盘,智能存储卡(Smart Media Card,SMC),安全数字(Secure Digital,SD)卡,闪存卡(Flash Card)等。The computer-readable storage medium may be the internal storage unit of the terminal device described in any of the foregoing embodiments, such as the hard disk or memory of the terminal device. The computer-readable storage medium may also be an external storage device of the terminal device, such as a plug-in hard disk equipped on the terminal device, a smart memory card (Smart Media Card, SMC), and Secure Digital (SD). ) Card, Flash Card, etc.
请参阅图12,图12是本说明书一实施例提供的可移动平台700的示意性框图。该可移动平台700包括处理器701和存储器702,还包括音频传感器703、扬声器704。Please refer to FIG. 12, which is a schematic block diagram of a movable platform 700 according to an embodiment of the present specification. The mobile platform 700 includes a processor 701 and a memory 702, and also includes an audio sensor 703 and a speaker 704.
其中,音频传感器703用于采集所述可移动平台700的环境声音,所述扬声器704用于播放音频信息。The audio sensor 703 is used to collect environmental sounds of the movable platform 700, and the speaker 704 is used to play audio information.
示例性的,处理器701和存储器702通过总线705连接,该总线705比如为I2C(Inter-integrated Circuit)总线。Exemplarily, the processor 701 and the memory 702 are connected by a bus 705, and the bus 705 is, for example, an I2C (Inter-integrated Circuit) bus.
具体地,处理器701可以是微控制单元(Micro-controller Unit,MCU)、中央处理单元(Central Processing Unit,CPU)或数字信号处理器(Digital Signal Processor,DSP)等。Specifically, the processor 701 may be a micro-controller unit (MCU), a central processing unit (Central Processing Unit, CPU), a digital signal processor (Digital Signal Processor, DSP), or the like.
具体地,存储器702可以是Flash芯片、只读存储器(ROM,Read-Only Memory)磁盘、光盘、U盘或移动硬盘等。Specifically, the memory 702 may be a Flash chip, a read-only memory (ROM, Read-Only Memory) disk, an optical disk, a U disk, or a mobile hard disk.
其中,所述处理器701用于运行存储在存储器702中的计算机程序,并在执行所述计算机程序时实现前述的用于可移动平台的控制方法。Wherein, the processor 701 is configured to run a computer program stored in the memory 702, and implement the aforementioned control method for a movable platform when the computer program is executed.
在一些实施方式中,如图2所示,可移动平台可以为可移动机器人,可移动机器人可以包括:In some embodiments, as shown in FIG. 2, the movable platform may be a movable robot, and the movable robot may include:
机器人本体110,包括底盘主体111和设于底盘主体111上的云台主体112,云台主体112用于搭载摄像装置101;The robot body 110 includes a chassis main body 111 and a pan/tilt main body 112 provided on the chassis main body 111. The pan/tilt main body 112 is used to carry the camera 101;
动力装置120,设于底盘主体111上,用于对机器人本体110提供移动动力;The power device 120 is arranged on the chassis body 111 and is used to provide moving power to the robot body 110;
音频传感器和扬声器,设于机器人本体110上,音频传感器用于采集环境声音,扬声器用于播放音频;An audio sensor and a speaker are arranged on the robot body 110, the audio sensor is used to collect environmental sounds, and the speaker is used to play audio;
通信装置,设于机器人本体110上,用于与终端设备进行通信。The communication device is provided on the robot body 110 and is used to communicate with the terminal device.
示例性的,所述处理器701用于运行存储在存储器702中的计算机程序,并在执行所述计算机程序时实现如下步骤:Exemplarily, the processor 701 is configured to run a computer program stored in the memory 702, and implement the following steps when the computer program is executed:
获取所述终端设备发送的终端声音文件,并解码所述终端声音文件后生成终端音频信息,播放所述终端音频信息;其中,所述终端声音文件由所述终端设备采集所述终端设备的环境声音后生成;Acquire the terminal sound file sent by the terminal device, decode the terminal sound file to generate terminal audio information, and play the terminal audio information; wherein the terminal sound file is collected by the terminal device from the environment of the terminal device Generated after the sound;
根据所述可移动平台的环境声音,生成平台声音文件;Generate a platform sound file according to the environmental sound of the movable platform;
将所述平台声音文件向所述终端设备发送,以使所述终端设备解码所述平台声音文件后播放。Send the platform sound file to the terminal device, so that the terminal device decodes the platform sound file and plays it.
本说明书实施例提供的可移动平台的具体原理和实现方式均与前述实施例的用于可移动平台的控制方法类似,此处不再赘述。The specific principles and implementation manners of the movable platform provided in the embodiment of this specification are similar to the control method for the movable platform of the foregoing embodiment, and will not be repeated here.
本说明书的实施例中还提供一种计算机可读存储介质,所述计算机可读存储介质存储有计算机程序,所述计算机程序中包括程序指令,所述计算机程序被处理器执行时使所述处理器实现上述实施例提供的用于可移动平台的控制方法的步骤。The embodiments of this specification also provide a computer-readable storage medium, the computer-readable storage medium stores a computer program, the computer program includes program instructions, the computer program is executed by a processor to cause the processing The device implements the steps of the control method for the movable platform provided in the above embodiment.
其中,所述计算机可读存储介质可以是前述任一实施例所述的可移动平台,如可移动机器人的内部存储单元,例如所述可移动机器人的硬盘或内存。所述计算机可读存储介质也可以是所述可移动平台的外部存储设备,例如所述可移动平台上配备的插接式硬盘,智能存储卡(Smart Media Card,SMC),安全数字(Secure Digital,SD)卡,闪存卡(Flash Card)等。Wherein, the computer-readable storage medium may be the movable platform described in any of the foregoing embodiments, such as an internal storage unit of a movable robot, for example, a hard disk or memory of the movable robot. The computer-readable storage medium may also be an external storage device of the movable platform, for example, a plug-in hard disk equipped on the movable platform, a smart memory card (Smart Media Card, SMC), and Secure Digital (Secure Digital). , SD) card, flash card (Flash Card), etc.
本说明书上述实施例提供的可移动平台、终端设备及其控制方法、存储介质,通过终端设备采集其周围环境的音频数据,并将该音频数据发送给对应的可移动平台,可以使得用户可以在距离可移动平台较远的地方通过可移动平台发出声音,例如对该可移动平台附近的人或者其他可移动平台喊话。另外,通过可移动平台采集其周围环境的音频数据,并将该音频数据发送给该可移动平台的终端设备,可使得该终端设备的用户即使不在可移动平台的附近, 也可以收听到可移动平台所处环境的声音场景,可以方便用户更加便捷和直观与周围环境进行互动,有利于用户对可移动平台的控制,满足用户传递语音信息的目的。The mobile platform, terminal device and its control method, and storage medium provided in the above-mentioned embodiments of this specification collect audio data of its surrounding environment through the terminal device and send the audio data to the corresponding mobile platform, so that users can A place far away from the movable platform makes a sound through the movable platform, for example, shouting to a person near the movable platform or other movable platforms. In addition, collecting the audio data of the surrounding environment through the mobile platform and sending the audio data to the terminal device of the mobile platform can enable the user of the terminal device to listen to the mobile platform even if it is not in the vicinity of the mobile platform. The sound scene of the environment where the platform is located can facilitate the user to interact with the surrounding environment more conveniently and intuitively, facilitate the user's control of the movable platform, and meet the user's purpose of transmitting voice information.
应当理解,在此本说明书中所使用的术语仅仅是出于描述特定实施例的目的而并不意在限制本说明书。It should be understood that the terms used in this specification are only for the purpose of describing specific embodiments and are not intended to limit the specification.
还应当理解,在本说明书和所附权利要求书中使用的术语“和/或”是指相关联列出的项中的一个或多个的任何组合以及所有可能组合,并且包括这些组合。It should also be understood that the term "and/or" used in this specification and the appended claims refers to any combination of one or more of the associated listed items and all possible combinations, and includes these combinations.
以上所述,仅为本说明书的具体实施方式,但本说明书的保护范围并不局限于此,任何熟悉本技术领域的技术人员在本说明书揭露的技术范围内,可轻易想到各种等效的修改或替换,这些修改或替换都应涵盖在本说明书的保护范围之内。因此,本说明书的保护范围应以权利要求的保护范围为准。The above are only specific implementations of this specification, but the scope of protection of this specification is not limited to this. Any person skilled in the art can easily think of various equivalents within the technical scope disclosed in this specification. Modifications or replacements, these modifications or replacements shall be covered within the protection scope of this manual. Therefore, the protection scope of this specification should be subject to the protection scope of the claims.

Claims (42)

  1. 一种控制方法,其特征在于,应用于一终端设备和一可移动平台构成的系统,所述终端设备和所述可移动平台均设有音频传感器和扬声器;A control method, characterized in that it is applied to a system composed of a terminal device and a movable platform, and both the terminal device and the movable platform are provided with audio sensors and speakers;
    所述方法包括:The method includes:
    所述终端设备根据所述终端设备的环境声音获取终端声音文件,并将所述终端声音文件向所述可移动平台发送;The terminal device obtains a terminal sound file according to the environmental sound of the terminal device, and sends the terminal sound file to the mobile platform;
    所述可移动平台接收所述终端设备发送的终端声音文件,并解码所述终端声音文件后进行播放;The mobile platform receives the terminal sound file sent by the terminal device, decodes the terminal sound file, and plays it;
    所述可移动平台根据所述可移动平台的环境声音,生成平台声音文件;The movable platform generates a platform sound file according to the environmental sound of the movable platform;
    所述终端设备接收所述可移动平台发送的平台声音文件后,解码所述平台声音文件得到平台音频信息,播放所述平台音频信息。After receiving the platform sound file sent by the mobile platform, the terminal device decodes the platform sound file to obtain platform audio information, and plays the platform audio information.
  2. 一种控制方法,其特征在于,用于终端设备,所述终端设备用于与一可移动平台进行通信,所述终端设备和所述可移动平台均设有音频传感器和扬声器;A control method, characterized in that it is used in a terminal device, the terminal device is used to communicate with a movable platform, and both the terminal device and the movable platform are provided with an audio sensor and a speaker;
    所述方法包括:The method includes:
    根据所述终端设备的环境声音获取终端声音文件;Acquiring a terminal sound file according to the environmental sound of the terminal device;
    将所述终端声音文件向所述可移动平台发送,以使所述可移动平台解码所述终端声音文件后播放;Sending the terminal sound file to the mobile platform, so that the mobile platform decodes the terminal sound file and plays it;
    获取所述可移动平台发送的平台声音文件,所述平台声音文件由所述可移动平台采集所述可移动平台的环境声音后生成;Acquiring a platform sound file sent by the mobile platform, the platform sound file being generated by the mobile platform after collecting the environmental sound of the mobile platform;
    解码所述平台声音文件得到平台音频信息,播放所述平台音频信息。Decoding the platform sound file to obtain platform audio information, and playing the platform audio information.
  3. 根据权利要求2所述的方法,其特征在于,还包括:The method according to claim 2, further comprising:
    显示平台控制界面,所述平台控制界面包括对讲按钮。A platform control interface is displayed, and the platform control interface includes an intercom button.
  4. 根据权利要求3所述的方法,其特征在于,所述根据所述终端设备的环境声音获取终端声音文件,包括:The method according to claim 3, wherein the obtaining a terminal sound file according to the environmental sound of the terminal device comprises:
    根据用户对所述对讲按钮的对讲控制操作,获取所述终端设备的环境声音,编码得到所述终端声音文件。According to the user's intercom control operation on the intercom button, the environmental sound of the terminal device is acquired, and the terminal sound file is obtained by encoding.
  5. 根据权利要求4所述的方法,其特征在于,所述方法还包括:The method according to claim 4, wherein the method further comprises:
    在所述平台控制界面显示所述环境声音的处理状态,所述处理状态包括无声音录入、 录制中、传输中、传输完成中的至少一项。The processing status of the environmental sound is displayed on the platform control interface, and the processing status includes at least one of silent recording, recording, transmission, and transmission completed.
  6. 根据权利要求4或5所述的方法,其特征在于,所述方法还包括:The method according to claim 4 or 5, wherein the method further comprises:
    获取所述终端设备的环境声音时,在所述平台控制界面显示所述环境声音的声音频谱图。When the environmental sound of the terminal device is acquired, the sound spectrogram of the environmental sound is displayed on the platform control interface.
  7. 根据权利要求4所述的方法,其特征在于,所述根据用户对所述对讲按钮的对讲控制操作,获取所述终端设备的环境声音,编码得到所述终端声音文件,包括:The method according to claim 4, wherein the obtaining the environmental sound of the terminal device according to the intercom control operation of the intercom button by the user, and encoding to obtain the terminal sound file comprises:
    在所述对讲按钮被按下后至松开前的时间段获取所述终端设备的环境声音,编码得到所述终端声音文件。The environmental sound of the terminal device is acquired in the time period after the intercom button is pressed to before the intercom button is released, and the terminal sound file is obtained by encoding.
  8. 根据权利要求7所述的方法,其特征在于,所述根据用户对所述对讲按钮的对讲控制操作,获取所述终端设备的环境声音,还包括:8. The method according to claim 7, wherein said acquiring the environmental sound of the terminal device according to the user's intercom control operation of the intercom button, further comprising:
    若所述对讲按钮被按下持续的时间达到预设时长,停止获取所述终端设备的环境声音,编码已经获取的声音得到所述终端声音文件。If the intercom button is pressed for a preset duration, stop acquiring the ambient sound of the terminal device, and encode the acquired sound to obtain the terminal sound file.
  9. 根据权利要求4所述的方法,其特征在于,还包括:The method according to claim 4, further comprising:
    根据用户对所述对讲按钮的设置操作,使能或关闭用户对所述对讲按钮的对讲控制操作,并调整所述对讲按钮的显示方式。According to the setting operation of the intercom button by the user, the intercom control operation of the intercom button by the user is enabled or disabled, and the display mode of the intercom button is adjusted.
  10. 根据权利要求3所述的方法,其特征在于,所述平台控制界面还包括平台控制按钮,所述方法还包括:The method according to claim 3, wherein the platform control interface further comprises a platform control button, and the method further comprises:
    根据用户对所述平台控制按钮的按钮触发操作,生成并向所述可移动平台发送对应的平台控制指令,以使所述可移动平台根据所述平台控制指令执行预设任务。According to a button trigger operation of the platform control button by the user, a corresponding platform control instruction is generated and sent to the movable platform, so that the movable platform executes a preset task according to the platform control instruction.
  11. 根据权利要求2-5、7-10中任一项所述的方法,其特征在于,所述方法还包括:The method according to any one of claims 2-5 and 7-10, wherein the method further comprises:
    显示录音记录列表,根据用户对所述录音记录列表中录音记录的播放控制操作,获取所述录音记录对应的终端声音文件。Display the recording record list, and obtain the terminal sound file corresponding to the recording record according to the user's playback control operation on the recording record in the recording record list.
  12. 根据权利要求11所述的方法,其特征在于,所述方法还包括:The method according to claim 11, wherein the method further comprises:
    根据用户的新增录音操作,获取和处理新录入的声音以得到新的终端声音文件,并在所述录音记录列表更新对应的录音记录。According to the new recording operation of the user, the newly recorded sound is acquired and processed to obtain a new terminal sound file, and the corresponding recording record is updated in the recording record list.
  13. 根据权利要求11所述的方法,其特征在于,所述方法还包括:The method according to claim 11, wherein the method further comprises:
    根据用户对所述录音记录列表中录音记录的循环播放操作,获取所述录音记录对应的终端声音文件;Obtaining a terminal sound file corresponding to the recording record according to the user's loop playback operation of the recording record in the recording record list;
    所述将所述终端声音文件向所述可移动平台发送,以使所述可移动平台解码所述终端声音文件后播放,包括:The sending the terminal sound file to the mobile platform so that the mobile platform decodes the terminal sound file and then plays it includes:
    将所述终端声音文件和循环指令向所述可移动平台发送,以使所述可移动平台解码所述终端声音文件后循环播放。The terminal sound file and the loop instruction are sent to the movable platform, so that the movable platform decodes the terminal sound file and plays it in a loop.
  14. 根据权利要求2-5、7-10中任一项所述的方法,其特征在于,所述方法还包括:The method according to any one of claims 2-5 and 7-10, wherein the method further comprises:
    显示录音记录列表;Display a list of recording records;
    根据用户对所述录音记录列表中录音记录的播放控制操作,向所述可移动平台发送所述录音记录的信息,以使所述可移动平台播放所述录音记录对应的终端音频信息。According to the user's playback control operation of the recording records in the recording record list, the information of the recording records is sent to the mobile platform, so that the mobile platform can play the terminal audio information corresponding to the recording records.
  15. 根据权利要求14所述的方法,其特征在于,所述方法还包括:The method according to claim 14, wherein the method further comprises:
    根据用户的新增录音操作,获取和处理新录入的声音以得到新的终端声音文件,并在所述录音记录列表更新对应的录音记录;According to the user's new recording operation, acquiring and processing the newly recorded sound to obtain a new terminal sound file, and updating the corresponding recording record in the recording record list;
    将所述终端声音文件发送给所述可移动平台,以使所述可移动平台存储解码所述终端声音文件得到的终端音频信息。The terminal sound file is sent to the mobile platform, so that the mobile platform stores terminal audio information obtained by decoding the terminal sound file.
  16. 根据权利要求15所述的方法,其特征在于,所述将所述终端声音文件发送给所述可移动平台,包括:The method according to claim 15, wherein the sending the terminal sound file to the mobile platform comprises:
    在得到所述新的终端声音文件后即时将所述终端声音文件发送给所述可移动平台;或者Send the terminal sound file to the mobile platform immediately after obtaining the new terminal sound file; or
    根据用户对所述新的终端声音文件对应的录音记录的播放控制操作,将所述终端声音文件发送给所述可移动平台。Send the terminal sound file to the mobile platform according to the user's playback control operation on the recording record corresponding to the new terminal sound file.
  17. 根据权利要求14所述的方法,其特征在于,所述方法还包括:The method according to claim 14, wherein the method further comprises:
    根据用户对所述录音记录列表中录音记录的循环播放操作,向所述可移动平台发送所述录音记录的信息和循环指令,以使所述可移动平台循环播放所述录音记录对应的终端音频信息。According to the user's circular playback operation of the recording records in the recording record list, send the recording record information and cycle instructions to the mobile platform, so that the mobile platform can play the terminal audio corresponding to the recording record in a loop information.
  18. 根据权利要求11-17中任一项所述的方法,其特征在于,所述方法还包括:The method according to any one of claims 11-17, wherein the method further comprises:
    在所述录音记录列表显示所述录音记录的播放状态和/或播放进度。The playback status and/or playback progress of the audio recording are displayed in the audio recording list.
  19. 根据权利要求2-5、7-10中任一项所述的方法,其特征在于,所述平台声音文件是所述可移动平台根据来源于至少另一可移动平台的声音生成的,或者是所述可移动平台根据来源于至少另一可移动平台的终端设备的用户的声音生成的。The method according to any one of claims 2-5 and 7-10, wherein the platform sound file is generated by the movable platform according to a sound from at least another movable platform, or The movable platform is generated based on the voice of a user of a terminal device originating from at least another movable platform.
  20. 根据权利要求2-5、7-10中任一项所述的方法,其特征在于,所述方法还包括:The method according to any one of claims 2-5 and 7-10, wherein the method further comprises:
    获取用户的控制语音,根据所述控制语音生成并向所述可移动平台发送对应的平台控制指令,以使所述可移动平台根据所述平台控制指令执行预设任务。Acquire a user's control voice, generate and send a corresponding platform control instruction to the movable platform according to the control voice, so that the movable platform executes a preset task according to the platform control instruction.
  21. 根据权利要求20所述的方法,其特征在于,所述获取用户的控制语音,包括:The method according to claim 20, wherein said obtaining the control voice of the user comprises:
    获取所述终端设备的环境声音,在所述环境声音中检测所述控制语音;和/或Acquire the environmental sound of the terminal device, and detect the control voice in the environmental sound; and/or
    在用户触发语音控制功能时,获取用户发出的控制语音。When the user triggers the voice control function, the control voice sent by the user is obtained.
  22. 根据权利要求2-5、7-10中任一项所述的方法,其特征在于,所述方法还包括:The method according to any one of claims 2-5 and 7-10, wherein the method further comprises:
    根据用户的对象设置操作,确定所述终端声音文件的播放对象;Determine the playback object of the terminal sound file according to the user's object setting operation;
    所述将所述终端声音文件向所述可移动平台发送,以使所述可移动平台解码所述终端声音文件后播放,包括:The sending the terminal sound file to the mobile platform so that the mobile platform decodes the terminal sound file and then plays it includes:
    将所述终端声音文件和所述播放对象的信息向所述可移动平台发送,以使所述可移动平台在识别到所述播放对象时播放所述终端声音文件。The terminal sound file and the information of the playback object are sent to the movable platform, so that the movable platform plays the terminal sound file when the playback object is recognized.
  23. 根据权利要求22所述的方法,其特征在于,所述根据用户的对象设置操作,确定所述终端声音文件的播放对象,包括:The method according to claim 22, wherein the determining the playback object of the terminal sound file according to the user's object setting operation comprises:
    从所述可移动平台获取所述可移动平台拍摄的图像,显示所述图像;Acquiring an image taken by the movable platform from the movable platform, and displaying the image;
    根据用户对所述图像中播放对象的选中操作,确定所述播放对象。According to the user's selection operation of the playback object in the image, the playback object is determined.
  24. 根据权利要求22所述的方法,其特征在于,所述根据用户的对象设置操作,确定所述终端声音文件的播放对象,包括:The method according to claim 22, wherein the determining the playback object of the terminal sound file according to the user's object setting operation comprises:
    根据用户对所述终端设备本地图像的选中操作,显示用户选中的本地图像;Displaying the local image selected by the user according to the user's selection operation on the local image of the terminal device;
    根据用户对所述本地图像中播放对象的确定操作,确定所述播放对象。The playback object is determined according to the user's determining operation on the playback object in the local image.
  25. 根据权利要求2-5、7-10中任一项所述的方法,其特征在于,所述方法还包括:The method according to any one of claims 2-5 and 7-10, wherein the method further comprises:
    根据用户的操作向所述可移动平台发送声音回传指令,以使所述可移动平台根据所述声音回传指令采集所述可移动平台的环境声音,生成所述平台声音文件后向所述终端设备发送。Send a sound return instruction to the movable platform according to the user's operation, so that the movable platform collects the environmental sound of the movable platform according to the sound return instruction, generates the platform sound file, and sends the sound file to the mobile platform. The terminal device sends.
  26. 根据权利要求4-9中任一项所述的方法,其特征在于,所述获取所述终端设备的环境声音,编码得到所述终端声音文件,包括:The method according to any one of claims 4-9, wherein the obtaining the environmental sound of the terminal device and encoding to obtain the terminal sound file comprises:
    对当前采集的环境声音进行实时编码,在录制结束后生成对应编码格式的终端声音文件。Encode the currently collected environmental sound in real time, and generate a terminal sound file in the corresponding encoding format after the recording ends.
  27. 根据权利要求2-5、7-10中任一项所述的方法,其特征在于,所述可移动平台包括可移动机器人,所述可移动机器人包括:The method according to any one of claims 2-5 and 7-10, wherein the movable platform comprises a movable robot, and the movable robot comprises:
    机器人本体,包括底盘主体和设于所述底盘主体上的云台主体,所述云台主体用于搭载摄像装置;The robot body includes a chassis main body and a pan/tilt main body provided on the chassis main body, and the pan/tilt main body is used to carry a camera device;
    动力装置,设于所述底盘主体上,用于对所述机器人本体提供移动动力;A power device, which is provided on the chassis body, and is used to provide moving power to the robot body;
    音频传感器和扬声器,设于所述机器人本体上,所述音频传感器用于采集环境声音, 所述扬声器用于播放音频;An audio sensor and a speaker are provided on the robot body, the audio sensor is used to collect environmental sounds, and the speaker is used to play audio;
    通信装置,设于所述机器人本体上,用于与所述终端设备进行通信。The communication device is arranged on the robot body and used to communicate with the terminal device.
  28. 一种控制方法,其特征在于,应用于可移动平台,所述可移动平台用于与一终端设备进行通信,所述终端设备和所述可移动平台均设有音频传感器和扬声器;A control method, characterized in that it is applied to a movable platform, the movable platform is used to communicate with a terminal device, and both the terminal device and the movable platform are provided with audio sensors and speakers;
    所述方法包括:The method includes:
    获取所述终端设备发送的终端声音文件,并解码所述终端声音文件后生成终端音频信息,播放所述终端音频信息;其中,所述终端声音文件由所述终端设备采集所述终端设备的环境声音后生成;Acquire the terminal sound file sent by the terminal device, decode the terminal sound file to generate terminal audio information, and play the terminal audio information; wherein the terminal sound file is collected by the terminal device from the environment of the terminal device Generated after the sound;
    根据所述可移动平台的环境声音,生成平台声音文件;Generate a platform sound file according to the environmental sound of the movable platform;
    将所述平台声音文件向所述终端设备发送,以使所述终端设备解码所述平台声音文件后播放。Send the platform sound file to the terminal device, so that the terminal device decodes the platform sound file and plays it.
  29. 根据权利要求28所述的方法,其特征在于,所述根据所述可移动平台的环境声音,生成平台声音文件,包括:The method according to claim 28, wherein the generating a platform sound file according to the environmental sound of the movable platform comprises:
    若接收到所述终端设备发送的声音回传指令,根据所述声音回传指令采集所述可移动平台的环境声音,生成所述平台声音文件。If a sound return instruction sent by the terminal device is received, the environmental sound of the movable platform is collected according to the sound return instruction, and the platform sound file is generated.
  30. 根据权利要求28所述的方法,其特征在于,所述方法还包括:The method according to claim 28, wherein the method further comprises:
    若获取所述终端设备发送的录音记录的信息,确定所述录音记录对应的终端音频信息,并播放所述录音记录对应的终端音频信息。If the information of the recording record sent by the terminal device is acquired, the terminal audio information corresponding to the recording record is determined, and the terminal audio information corresponding to the recording record is played.
  31. 根据权利要求28-30中任一项所述的方法,其特征在于,所述播放所述终端音频信息,包括:The method according to any one of claims 28-30, wherein the playing the terminal audio information comprises:
    若获取所述终端音频信息对应的循环指令,循环播放所述终端音频信息。If the loop instruction corresponding to the terminal audio information is acquired, the terminal audio information is played in a loop.
  32. 根据权利要求28-30中任一项所述的方法,其特征在于,所述方法还包括:The method according to any one of claims 28-30, wherein the method further comprises:
    向所述终端设备发送所述终端音频信息的播放状态和/或播放进度。Send the playback status and/or playback progress of the terminal audio information to the terminal device.
  33. 根据权利要求28-30中任一项所述的方法,其特征在于,所述方法还包括:The method according to any one of claims 28-30, wherein the method further comprises:
    在播放所述终端音频信息时,根据所述终端音频信息调整所述可移动平台的显示装置的显示参数。When the terminal audio information is played, the display parameters of the display device of the movable platform are adjusted according to the terminal audio information.
  34. 根据权利要求33所述的方法,其特征在于,所述根据所述终端音频信息调整所述可移动平台的显示装置的显示参数,包括:The method according to claim 33, wherein the adjusting the display parameters of the display device of the movable platform according to the terminal audio information comprises:
    根据所述终端音频信息的声强调整所述显示装置的显示亮度;和/或Adjusting the display brightness of the display device according to the sound intensity of the terminal audio information; and/or
    根据所述终端音频信息的声音频率调整所述显示装置的闪烁频率。Adjust the flicker frequency of the display device according to the sound frequency of the terminal audio information.
  35. 根据权利要求28-30中任一项所述的方法,其特征在于,所述根据所述可移动平台的环境声音,生成平台声音文件,包括:The method according to any one of claims 28-30, wherein the generating a platform sound file according to the environmental sound of the movable platform comprises:
    根据至少另一可移动平台的声音和/或至少另一可移动平台的终端设备的用户的声音,生成平台声音文件。A platform sound file is generated based on the sound of at least another movable platform and/or the sound of a user of at least another terminal device of the movable platform.
  36. 根据权利要求35所述的方法,其特征在于,所述方法还包括:The method according to claim 35, wherein the method further comprises:
    在未播放所述终端音频信息时,获取所述可移动平台的环境声音;和/或When the terminal audio information is not played, obtain the environmental sound of the movable platform; and/or
    获取所述可移动平台的环境声音,并将播放的所述终端音频信息从所述环境声音中滤除。Acquire the environmental sound of the movable platform, and filter the played terminal audio information from the environmental sound.
  37. 根据权利要求28-30中任一项所述的方法,其特征在于,所述方法还包括:The method according to any one of claims 28-30, wherein the method further comprises:
    若获取所述终端设备发送的平台控制指令,根据所述平台控制指令执行预设任务;If the platform control instruction sent by the terminal device is acquired, execute a preset task according to the platform control instruction;
    其中,所述平台控制指令是所述终端设备根据用户对所述终端设备的平台控制按钮的按钮触发操作发送的,或者是所述终端设备根据用户的控制语音发送的。Wherein, the platform control instruction is sent by the terminal device according to a button trigger operation of the platform control button of the terminal device by the user, or sent by the terminal device according to the control voice of the user.
  38. 根据权利要求28-30中任一项所述的方法,其特征在于,所述可移动平台搭载有摄像装置,所述方法还包括:The method according to any one of claims 28-30, wherein the movable platform is equipped with a camera device, and the method further comprises:
    若获取所述终端设备发送的播放对象的信息,根据所述信息在所述摄像装置拍摄的图像中识别所述播放对象;If acquiring the information of the playback object sent by the terminal device, identify the playback object in the image captured by the camera device according to the information;
    若识别到所述播放对象,向所述播放对象播放对应的终端音频信息。If the playback object is recognized, the corresponding terminal audio information is played to the playback object.
  39. 根据权利要求38所述的方法,其特征在于,所述方法还包括:The method according to claim 38, wherein the method further comprises:
    将所述摄像装置拍摄的图像发送给所述终端设备,以使所述终端设备根据用户对所述图像中播放对象的选中操作确定所述播放对象。The image taken by the camera device is sent to the terminal device, so that the terminal device determines the playback object according to the user's selection operation of the playback object in the image.
  40. 一种终端设备,其特征在于,包括音频传感器、扬声器、存储器和处理器;A terminal device, characterized in that it includes an audio sensor, a speaker, a memory, and a processor;
    所述音频传感器用于采集所述终端设备的环境声音,所述扬声器用于播放音频信息;The audio sensor is used to collect environmental sounds of the terminal device, and the speaker is used to play audio information;
    所述存储器用于存储计算机程序;The memory is used to store a computer program;
    所述处理器,用于执行所述计算机程序并在执行所述计算机程序时,实现如下步骤:The processor is configured to execute the computer program and, when executing the computer program, implement the following steps:
    根据所述终端设备的环境声音获取终端声音文件;Acquiring a terminal sound file according to the environmental sound of the terminal device;
    将所述终端声音文件向可移动平台发送,以使所述可移动平台解码所述终端声音文件后播放;Sending the terminal sound file to a mobile platform, so that the mobile platform decodes the terminal sound file and plays it;
    获取所述可移动平台发送的平台声音文件,所述平台声音文件由所述可移动平台采集所述可移动平台的环境声音后生成;Acquiring a platform sound file sent by the mobile platform, the platform sound file being generated by the mobile platform after collecting the environmental sound of the mobile platform;
    解码所述平台声音文件得到平台音频信息,播放所述平台音频信息。Decoding the platform sound file to obtain platform audio information, and playing the platform audio information.
  41. 一种可移动平台,其特征在于,包括音频传感器、扬声器、存储器和处理器;A movable platform, which is characterized in that it comprises an audio sensor, a speaker, a memory and a processor;
    所述音频传感器用于采集所述可移动平台的环境声音,所述扬声器用于播放音频信息;The audio sensor is used to collect environmental sounds of the movable platform, and the speaker is used to play audio information;
    所述存储器用于存储计算机程序;The memory is used to store a computer program;
    所述处理器,用于执行所述计算机程序并在执行所述计算机程序时,实现如下步骤:The processor is configured to execute the computer program and, when executing the computer program, implement the following steps:
    获取终端设备发送的终端声音文件,并解码所述终端声音文件后生成终端音频信息,播放所述终端音频信息;其中,所述终端声音文件由所述终端设备采集所述终端设备的环境声音后生成;Obtain the terminal sound file sent by the terminal device, decode the terminal sound file to generate terminal audio information, and play the terminal audio information; wherein, the terminal sound file is collected by the terminal device after the terminal device’s environmental sound generate;
    根据所述可移动平台的环境声音,生成平台声音文件;Generate a platform sound file according to the environmental sound of the movable platform;
    将所述平台声音文件向所述终端设备发送,以使所述终端设备解码所述平台声音文件后播放。Send the platform sound file to the terminal device, so that the terminal device decodes the platform sound file and plays it.
  42. 一种计算机可读存储介质,其特征在于,所述计算机可读存储介质存储有计算机程序,所述计算机程序被处理器执行时使所述处理器实现如权利要求1-39中任一项所述的方法。A computer-readable storage medium, characterized in that, the computer-readable storage medium stores a computer program, and when the computer program is executed by a processor, the processor realizes as described in any one of claims 1-39. The method described.
PCT/CN2019/121766 2019-11-28 2019-11-28 Mobile platform, terminal device and control method therefor, and storage medium WO2021102855A1 (en)

Priority Applications (2)

Application Number Priority Date Filing Date Title
PCT/CN2019/121766 WO2021102855A1 (en) 2019-11-28 2019-11-28 Mobile platform, terminal device and control method therefor, and storage medium
CN201980040354.XA CN112292867A (en) 2019-11-28 2019-11-28 Movable platform, terminal device, control method thereof and storage medium

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
PCT/CN2019/121766 WO2021102855A1 (en) 2019-11-28 2019-11-28 Mobile platform, terminal device and control method therefor, and storage medium

Publications (1)

Publication Number Publication Date
WO2021102855A1 true WO2021102855A1 (en) 2021-06-03

Family

ID=74419423

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2019/121766 WO2021102855A1 (en) 2019-11-28 2019-11-28 Mobile platform, terminal device and control method therefor, and storage medium

Country Status (2)

Country Link
CN (1) CN112292867A (en)
WO (1) WO2021102855A1 (en)

Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN107065863A (en) * 2017-03-13 2017-08-18 山东大学 A kind of guide to visitors based on face recognition technology explains robot and method
CN107225574A (en) * 2017-07-21 2017-10-03 哈尔滨雷掣科技有限责任公司 A kind of domestic robot system based on mobile terminal
CN108942952A (en) * 2018-04-23 2018-12-07 杨水祥 A kind of medical robot
CN208538475U (en) * 2018-07-13 2019-02-22 深圳市优必选科技有限公司 A kind of intelligent robot
CN110039553A (en) * 2019-04-09 2019-07-23 苏州晨本智能科技有限公司 A kind of robot system of drinking with a guest based on voice training
CN110164542A (en) * 2018-01-18 2019-08-23 汤庆佳 A kind of intelligent human-body monitor system based on virtual reality

Family Cites Families (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2016154868A1 (en) * 2015-03-31 2016-10-06 深圳市大疆创新科技有限公司 Flight system, aircraft, sound apparatus and sound processing method
CN110209957A (en) * 2019-06-06 2019-09-06 北京猎户星空科技有限公司 Explanation method, apparatus, equipment and storage medium based on intelligent robot

Patent Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN107065863A (en) * 2017-03-13 2017-08-18 山东大学 A kind of guide to visitors based on face recognition technology explains robot and method
CN107225574A (en) * 2017-07-21 2017-10-03 哈尔滨雷掣科技有限责任公司 A kind of domestic robot system based on mobile terminal
CN110164542A (en) * 2018-01-18 2019-08-23 汤庆佳 A kind of intelligent human-body monitor system based on virtual reality
CN108942952A (en) * 2018-04-23 2018-12-07 杨水祥 A kind of medical robot
CN208538475U (en) * 2018-07-13 2019-02-22 深圳市优必选科技有限公司 A kind of intelligent robot
CN110039553A (en) * 2019-04-09 2019-07-23 苏州晨本智能科技有限公司 A kind of robot system of drinking with a guest based on voice training

Also Published As

Publication number Publication date
CN112292867A (en) 2021-01-29

Similar Documents

Publication Publication Date Title
US20210158821A1 (en) Image display apparatus and method of controlling the same
EP3163748B1 (en) Method, device and terminal for adjusting volume
CN109361865B (en) Shooting method and terminal
WO2017113937A1 (en) Mobile terminal and noise reduction method
CN108924412B (en) Shooting method and terminal equipment
EP3660660A1 (en) Processing method for sound effect of recording and mobile terminal
JP2012040655A (en) Method for controlling robot, program, and robot
CN111370018B (en) Audio data processing method, electronic device and medium
CN108513067B (en) Shooting control method and mobile terminal
US10733762B2 (en) Dynamically calibrating a depth sensor
CN109473097B (en) Intelligent voice equipment and control method thereof
CN108156378A (en) Photographic method, mobile terminal and computer readable storage medium
CN113132863A (en) Stereo pickup method, apparatus, terminal device, and computer-readable storage medium
CN111417053B (en) Sound pickup volume control method, sound pickup volume control device and storage medium
CN111917980A (en) Photographing control method and device, storage medium and electronic equipment
CN111741511A (en) Quick matching method and head-mounted electronic equipment
CN111447365B (en) Shooting method and electronic equipment
WO2014189005A1 (en) Musical-performance recording system, musical-performance recording method, and musical instrument
CN108737934A (en) A kind of intelligent sound box and its control method
WO2019147034A1 (en) Electronic device for controlling sound and operation method therefor
CN110881105A (en) Shooting method and electronic equipment
WO2021102855A1 (en) Mobile platform, terminal device and control method therefor, and storage medium
CN108600623B (en) Refocusing display method and terminal device
CN111049972A (en) Audio playing method and terminal equipment
JP6835205B2 (en) Shooting sound pickup device, sound pick-up control system, shooting sound pick-up device control method, and shooting sound pick-up control system control method

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 19954426

Country of ref document: EP

Kind code of ref document: A1

NENP Non-entry into the national phase

Ref country code: DE

122 Ep: pct application non-entry in european phase

Ref document number: 19954426

Country of ref document: EP

Kind code of ref document: A1