CN114566163B

CN114566163B - Public transport voice processing method, device, system, electronic equipment and medium

Info

Publication number: CN114566163B
Application number: CN202210170319.3A
Authority: CN
Inventors: 赵丁漫; 严军; 邓秋雄; 杨征宇; 拜正斌; 刘杰
Original assignee: Chengdu Zhiyuanhui Information Technology Co Ltd
Current assignee: Chengdu Zhiyuanhui Information Technology Co Ltd
Priority date: 2022-02-23
Filing date: 2022-02-23
Publication date: 2023-05-02
Anticipated expiration: 2042-02-23
Also published as: CN114566163A

Abstract

The invention discloses a public transportation voice processing method, which comprises the following steps: s1: acquiring confirmation information of a voice equipment terminal to be played; s2: acquiring a voice issuing instruction, wherein the voice issuing instruction is a voice real-time issuing instruction; s3: acquiring current voice input in real time according to the voice real-time issuing instruction; s4: performing standardized processing on the current voice input in real time according to a voice algorithm; s5: and sending a voice playing instruction and the voice subjected to standardized processing to a voice equipment terminal to be played. The invention carries out standardized processing on the audio content broadcasted by different staff through the processing of the voice algorithm after pickup, so that the broadcasted audio voice and speech speed are kept consistent, standardized management on content broadcasting is realized for public traffic managers, and passengers feel specialization of public traffic through standardized broadcasting.

Description

Public transport voice processing method, device, system, electronic equipment and medium

Technical Field

The invention relates to the technical field of communication, in particular to a public transportation voice processing method, a device, a system, electronic equipment and a medium.

Background

As shown in fig. 1, the broadcasting mode of the voice broadcasting in the public transportation scene is single, the staff can only broadcast the voice at all stations, the voice evacuation cannot be performed according to the actual situation, the scene is limited, the management is inconvenient for the manager according to the actual situation, in addition, the voice broadcasting system does not process the audio, the content of the voice broadcasting is directly broadcasted after the speaking of the staff is finished, the speed of the voice of each person is different, the spread of the broadcasted content, such as speed, affects the overall image of the public transportation, and therefore the public transportation voice processing method, the device, the system, the electronic equipment and the medium are significant.

Disclosure of Invention

The invention aims to provide a main control system based on public transportation audio self-adaption so as to solve the problems in the background technology.

In order to solve the technical problems, the invention adopts the following scheme:

a public transportation voice processing method, comprising the steps of:

s1: acquiring confirmation information of a voice equipment terminal to be played;

s2: acquiring a voice issuing instruction, wherein the voice issuing instruction is a voice real-time issuing instruction;

s3: acquiring current voice input in real time according to the voice real-time issuing instruction;

s4: performing standardized processing on the current voice input in real time according to a voice algorithm;

s5: and sending a voice playing instruction and the voice subjected to standardized processing to a device terminal to be played.

Further, the voice algorithm is as follows:

presetting a reference speech speed V0;

based on the current voice input in real time, obtaining the voice speed V1 of the current voice input in real time;

and determining the double speed M of the current real-time input voice after the standardization processing according to V0 and V1, wherein the double speed M=V0/V1, namely the voice after the standardization processing is the current real-time input voice of the double speed M.

Further, the preset reference speech speed V0 is 240-300 words/min.

Further, the preset reference speech rate may be determined according to a preset voice.

Further, the preset voice includes a duration T and a word number W, and a preset reference voice velocity v0=w/T.

Further, when the S2 is executed, the voice issuing instruction is a preset broadcasting table issuing instruction;

if the acquired voice issuing instruction is the preset broadcasting table issuing instruction, executing Sn: and sending a voice playing instruction and presetting voices in a playing list to a device terminal to be played.

The invention also provides a public transportation voice processing device, which comprises a three-dimensional model, wherein the three-dimensional model comprises a device confirmation module, a voice instruction acquisition module, a pickup module, a standardization module and a issuing module;

the device confirmation module is used for confirming the device terminal of the voice to be played;

the voice command acquisition module is used for acquiring voice issuing commands, wherein the voice issuing commands comprise voice real-time issuing commands and preset broadcasting table issuing commands;

the sound pickup module is used for sending an instruction in real time according to the voice to acquire the voice input in real time currently;

the normalization module is used for performing normalization processing on the current voice input in real time according to a voice algorithm;

and the issuing module is used for issuing a voice playing instruction and the standardized voice or the voice in the preset playing list to the equipment terminal for playing the voice.

The invention also provides a public transportation voice processing system, which comprises the processing device and a plurality of voice equipment terminals, wherein the voice equipment terminals are in communication connection with the three-dimensional model.

The invention also provides an electronic device for public transportation voice processing, which comprises:

one or more processors;

and a storage unit for storing one or more programs, which when executed by the one or more processors, enable the one or more processors to implement the aforementioned public transportation voice processing method.

The present invention also provides a computer-readable storage medium having stored thereon a computer program which, when executed by a processor, enables the aforementioned public transportation voice processing method to be implemented.

The invention has the beneficial effects that:

1. according to the invention, the playing equipment of the region can be selected in the three-dimensional model to accurately broadcast according to the region needing to be evacuated or the region needing to be played, so that the total station broadcasting is not needed, and management by a manager according to actual conditions is facilitated;

2. the invention carries out standardized processing on the audio content broadcasted by different staff through the processing of the voice algorithm after pickup, so that the broadcasted audio voice and speech speed are kept consistent, standardized management on content broadcasting is realized for public traffic managers, and passengers feel specialization of public traffic through standardized broadcasting.

Drawings

FIG. 1 is a workflow diagram of the background art;

FIG. 2 is a schematic representation of a three-dimensional model of the present invention;

FIG. 3 is a block diagram of the structure of the present invention;

FIG. 4 is a workflow diagram of user selection of voice real-time delivery;

fig. 5 is a flowchart of the user selecting a preset playlist to issue.

Detailed Description

The present invention will be described in further detail with reference to examples and drawings, but embodiments of the present invention are not limited thereto.

In the description of the present invention, it should be noted that, directions or positional relationships indicated by terms such as "center", "upper", "lower", "left", "right", "vertical", "longitudinal", "lateral", "horizontal", "inner", "outer", "front", "rear", "top", "bottom", etc., are directions or positional relationships based on those shown in the drawings, or are directions or positional relationships conventionally put in use of the inventive product, are merely for convenience of describing the present invention and for simplifying the description, and are not indicative or implying that the apparatus or element to be referred to must have a specific direction, be constructed and operated in a specific direction, and therefore should not be construed as limiting the present invention.

In the description of the present invention, it should also be noted that, unless explicitly specified and limited otherwise, the terms "disposed," "configured," "mounted," "connected," and "connected" are to be construed broadly, and may be, for example, fixedly connected, detachably connected, or integrally connected; can be mechanically or electrically connected; can be directly connected or indirectly connected through an intermediate medium, and can be communication between two elements. The specific meaning of the above terms in the present invention will be understood in specific cases by those of ordinary skill in the art.

Examples

As shown in fig. 3, the execution subject in the present embodiment may be a touch display device terminal including a three-dimensional model, and the voice processing method includes the steps of:

the obtaining of the confirmation information of the to-be-played voice equipment terminal may be that the user selects the equipment icon of the to-be-played voice equipment terminal in the three-dimensional model through touching, the equipment icon of the voice equipment terminal is a A, B, C … … G exit sign with voice broadcasting function or other equipment icons with voice broadcasting function as shown in fig. 2, or the user selects the equipment icon of the to-be-played voice in the three-dimensional model through mouse click.

the obtaining of the voice issuing instruction can be obtained by touching a device icon which is long in demand and is required to issue voice or double clicking a device icon which is required to issue voice to call out a voice issuing function after the user selects the device which is required to issue voice content in the three-dimensional model, and real-time voice broadcasting is selected according to the user demand, so that the voice real-time issuing instruction is obtained.

the current voice input in real time can be acquired by calling a pickup device or a pickup function in a three-dimensional model to pick up voice after the user selects real-time voice broadcasting.

the normalization processing is to normalize the tone and the speed of the voice after the pickup of the user is finished through a voice algorithm, so that the audio played after the delivery can be kept consistent.

Based on the technical scheme, the voice algorithm provided by the invention is as follows: firstly, determining a preset reference speech speed V0, and setting the preset reference speech speed V0 to 240-300 words/min; secondly, based on the current voice input in real time, obtaining the voice speed V1 of the current voice input in real time; and determining the double speed M of the current real-time input voice after the normalization processing according to V0 and V1, wherein the double speed M=V0/V1, namely the normalized voice is the current real-time input voice of the double speed M.

The preset reference speech speed can be set according to preset speech, the preset speech source can be standard broadcasting recorded by staff, the preset speech comprises a duration T and a word number W, and then the preset reference speech speed V0 = W/T.

In addition, the acquired voice issuing instruction can also be an issuing instruction for a preset broadcasting table; if the acquired voice issuing instruction is the preset broadcasting table issuing instruction, executing Sn: and sending a voice playing instruction and voice in a preset playing list to a device terminal for playing the voice, wherein the voice in the preset playing list is recorded audio subjected to standardized processing.

the equipment confirming module is used for acquiring the confirmation information of the voice equipment terminal to be played;

The invention also provides a public transportation voice processing system, which comprises the processing device and a plurality of equipment playing terminals, wherein the equipment playing terminals are in communication connection with the three-dimensional model, and the equipment playing terminals in each area correspond to the equipment icons on the three-dimensional model.

The following describes the above embodiments through specific application scenarios:

when a worker needs to perform voice broadcasting in the G area as shown in fig. 2, the voice equipment icon in the G area is selected through touch of the touch screen, the issuing function is called out in a long-time mode, the issuing function comprises preset broadcasting list issuing and voice real-time issuing, the worker selects voice real-time issuing, the pickup function in the three-dimensional model is called for voice pickup, voice of the worker is standardized through a voice algorithm after pickup is completed, and after issuing is confirmed, the voice equipment terminal in the G area is issued.

The invention also provides an electronic device for public transportation voice processing, comprising: one or more processors; and a storage unit for storing one or more programs, which when executed by the one or more processors, enable the one or more processors to implement the aforementioned public transportation voice processing method.

The foregoing description of the preferred embodiment of the invention is not intended to limit the invention in any way, but rather to cover all modifications, equivalents, improvements and alternatives falling within the spirit and principles of the invention.

Claims

1. A public transportation voice processing method, comprising the steps of:

s1: confirming address information of a voice equipment terminal to be played;

s5: issuing a voice playing instruction and a standardized voice to a voice equipment terminal to be played;

the voice algorithm is as follows:

presetting a reference speech speed V0;

2. The public transportation voice processing method according to claim 1, wherein the preset reference voice velocity V0 is 240-300 words/min.

3. The public transportation voice processing method according to claim 1, wherein the preset reference speech rate is determined according to a preset voice.

4. A public transportation voice processing method according to claim 3, wherein the preset voice includes a duration T and a word number W, and the preset reference voice velocity v0=w/T.

5. The public transportation voice processing method according to claim 1, further comprising executing S2, wherein the voice issuing instruction is a preset playlist issuing instruction;

6. A public transportation speech processing device, characterized in that: the device comprises a device confirmation module, a voice command acquisition module, a pickup module, a standardization module, a issuing module and a three-dimensional model window module, wherein the three-dimensional model window module comprises address information of a voice device terminal to be played;

the voice algorithm is as follows:

presetting a reference speech speed V0;

determining a doubling speed M after the current real-time input voice is subjected to standardization processing according to V0 and V1, wherein the doubling speed M=V0/V1, namely the standardized voice is the current real-time input voice with the doubling speed M;

the issuing module is used for issuing a voice playing instruction and the standardized voice or the voice in the preset playing list to the voice equipment terminal to be played.

7. A public transportation speech processing system, characterized by: comprising the processing device of claim 6 and a number of terminals of the voice equipment to be played.

8. An electronic device for public transportation voice processing, comprising:

one or more processors;

a storage unit for storing one or more programs which, when executed by the one or more processors, enable the one or more processors to implement a public transportation speech processing method according to any one of claims 1 to 5.

9. A computer-readable storage medium, on which a computer program is stored, characterized in that the computer program, when being executed by a processor, is capable of realizing a public transportation speech processing method according to any one of claims 1 to 5.