CN114566163B - Public transport voice processing method, device, system, electronic equipment and medium - Google Patents
Public transport voice processing method, device, system, electronic equipment and medium Download PDFInfo
- Publication number
- CN114566163B CN114566163B CN202210170319.3A CN202210170319A CN114566163B CN 114566163 B CN114566163 B CN 114566163B CN 202210170319 A CN202210170319 A CN 202210170319A CN 114566163 B CN114566163 B CN 114566163B
- Authority
- CN
- China
- Prior art keywords
- voice
- issuing
- real
- instruction
- processing
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Active
Links
Images
Classifications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/22—Procedures used during a speech recognition process, e.g. man-machine dialogue
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/06—Creation of reference templates; Training of speech recognition systems, e.g. adaptation to the characteristics of the speaker's voice
- G10L15/063—Training
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04H—BROADCAST COMMUNICATION
- H04H20/00—Arrangements for broadcast or for distribution combined with broadcast
- H04H20/53—Arrangements specially adapted for specific applications, e.g. for traffic information or for mobile receivers
- H04H20/61—Arrangements specially adapted for specific applications, e.g. for traffic information or for mobile receivers for local area broadcast, e.g. instore broadcast
- H04H20/62—Arrangements specially adapted for specific applications, e.g. for traffic information or for mobile receivers for local area broadcast, e.g. instore broadcast for transportation systems, e.g. in vehicles
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/06—Creation of reference templates; Training of speech recognition systems, e.g. adaptation to the characteristics of the speaker's voice
- G10L15/063—Training
- G10L2015/0631—Creating reference templates; Clustering
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/22—Procedures used during a speech recognition process, e.g. man-machine dialogue
- G10L2015/223—Execution procedure of a spoken command
Abstract
The invention discloses a public transportation voice processing method, which comprises the following steps: s1: acquiring confirmation information of a voice equipment terminal to be played; s2: acquiring a voice issuing instruction, wherein the voice issuing instruction is a voice real-time issuing instruction; s3: acquiring current voice input in real time according to the voice real-time issuing instruction; s4: performing standardized processing on the current voice input in real time according to a voice algorithm; s5: and sending a voice playing instruction and the voice subjected to standardized processing to a voice equipment terminal to be played. The invention carries out standardized processing on the audio content broadcasted by different staff through the processing of the voice algorithm after pickup, so that the broadcasted audio voice and speech speed are kept consistent, standardized management on content broadcasting is realized for public traffic managers, and passengers feel specialization of public traffic through standardized broadcasting.
Description
Technical Field
The invention relates to the technical field of communication, in particular to a public transportation voice processing method, a device, a system, electronic equipment and a medium.
Background
As shown in fig. 1, the broadcasting mode of the voice broadcasting in the public transportation scene is single, the staff can only broadcast the voice at all stations, the voice evacuation cannot be performed according to the actual situation, the scene is limited, the management is inconvenient for the manager according to the actual situation, in addition, the voice broadcasting system does not process the audio, the content of the voice broadcasting is directly broadcasted after the speaking of the staff is finished, the speed of the voice of each person is different, the spread of the broadcasted content, such as speed, affects the overall image of the public transportation, and therefore the public transportation voice processing method, the device, the system, the electronic equipment and the medium are significant.
Disclosure of Invention
The invention aims to provide a main control system based on public transportation audio self-adaption so as to solve the problems in the background technology.
In order to solve the technical problems, the invention adopts the following scheme:
a public transportation voice processing method, comprising the steps of:
s1: acquiring confirmation information of a voice equipment terminal to be played;
s2: acquiring a voice issuing instruction, wherein the voice issuing instruction is a voice real-time issuing instruction;
s3: acquiring current voice input in real time according to the voice real-time issuing instruction;
s4: performing standardized processing on the current voice input in real time according to a voice algorithm;
s5: and sending a voice playing instruction and the voice subjected to standardized processing to a device terminal to be played.
Further, the voice algorithm is as follows:
presetting a reference speech speed V0;
based on the current voice input in real time, obtaining the voice speed V1 of the current voice input in real time;
and determining the double speed M of the current real-time input voice after the standardization processing according to V0 and V1, wherein the double speed M=V0/V1, namely the voice after the standardization processing is the current real-time input voice of the double speed M.
Further, the preset reference speech speed V0 is 240-300 words/min.
Further, the preset reference speech rate may be determined according to a preset voice.
Further, the preset voice includes a duration T and a word number W, and a preset reference voice velocity v0=w/T.
Further, when the S2 is executed, the voice issuing instruction is a preset broadcasting table issuing instruction;
if the acquired voice issuing instruction is the preset broadcasting table issuing instruction, executing Sn: and sending a voice playing instruction and presetting voices in a playing list to a device terminal to be played.
The invention also provides a public transportation voice processing device, which comprises a three-dimensional model, wherein the three-dimensional model comprises a device confirmation module, a voice instruction acquisition module, a pickup module, a standardization module and a issuing module;
the device confirmation module is used for confirming the device terminal of the voice to be played;
the voice command acquisition module is used for acquiring voice issuing commands, wherein the voice issuing commands comprise voice real-time issuing commands and preset broadcasting table issuing commands;
the sound pickup module is used for sending an instruction in real time according to the voice to acquire the voice input in real time currently;
the normalization module is used for performing normalization processing on the current voice input in real time according to a voice algorithm;
and the issuing module is used for issuing a voice playing instruction and the standardized voice or the voice in the preset playing list to the equipment terminal for playing the voice.
The invention also provides a public transportation voice processing system, which comprises the processing device and a plurality of voice equipment terminals, wherein the voice equipment terminals are in communication connection with the three-dimensional model.
The invention also provides an electronic device for public transportation voice processing, which comprises:
one or more processors;
and a storage unit for storing one or more programs, which when executed by the one or more processors, enable the one or more processors to implement the aforementioned public transportation voice processing method.
The present invention also provides a computer-readable storage medium having stored thereon a computer program which, when executed by a processor, enables the aforementioned public transportation voice processing method to be implemented.
The invention has the beneficial effects that:
1. according to the invention, the playing equipment of the region can be selected in the three-dimensional model to accurately broadcast according to the region needing to be evacuated or the region needing to be played, so that the total station broadcasting is not needed, and management by a manager according to actual conditions is facilitated;
2. the invention carries out standardized processing on the audio content broadcasted by different staff through the processing of the voice algorithm after pickup, so that the broadcasted audio voice and speech speed are kept consistent, standardized management on content broadcasting is realized for public traffic managers, and passengers feel specialization of public traffic through standardized broadcasting.
Drawings
FIG. 1 is a workflow diagram of the background art;
FIG. 2 is a schematic representation of a three-dimensional model of the present invention;
FIG. 3 is a block diagram of the structure of the present invention;
FIG. 4 is a workflow diagram of user selection of voice real-time delivery;
fig. 5 is a flowchart of the user selecting a preset playlist to issue.
Detailed Description
The present invention will be described in further detail with reference to examples and drawings, but embodiments of the present invention are not limited thereto.
In the description of the present invention, it should be noted that, directions or positional relationships indicated by terms such as "center", "upper", "lower", "left", "right", "vertical", "longitudinal", "lateral", "horizontal", "inner", "outer", "front", "rear", "top", "bottom", etc., are directions or positional relationships based on those shown in the drawings, or are directions or positional relationships conventionally put in use of the inventive product, are merely for convenience of describing the present invention and for simplifying the description, and are not indicative or implying that the apparatus or element to be referred to must have a specific direction, be constructed and operated in a specific direction, and therefore should not be construed as limiting the present invention.
In the description of the present invention, it should also be noted that, unless explicitly specified and limited otherwise, the terms "disposed," "configured," "mounted," "connected," and "connected" are to be construed broadly, and may be, for example, fixedly connected, detachably connected, or integrally connected; can be mechanically or electrically connected; can be directly connected or indirectly connected through an intermediate medium, and can be communication between two elements. The specific meaning of the above terms in the present invention will be understood in specific cases by those of ordinary skill in the art.
Examples
As shown in fig. 3, the execution subject in the present embodiment may be a touch display device terminal including a three-dimensional model, and the voice processing method includes the steps of:
s1: acquiring confirmation information of a voice equipment terminal to be played;
the obtaining of the confirmation information of the to-be-played voice equipment terminal may be that the user selects the equipment icon of the to-be-played voice equipment terminal in the three-dimensional model through touching, the equipment icon of the voice equipment terminal is a A, B, C … … G exit sign with voice broadcasting function or other equipment icons with voice broadcasting function as shown in fig. 2, or the user selects the equipment icon of the to-be-played voice in the three-dimensional model through mouse click.
S2: acquiring a voice issuing instruction, wherein the voice issuing instruction is a voice real-time issuing instruction;
the obtaining of the voice issuing instruction can be obtained by touching a device icon which is long in demand and is required to issue voice or double clicking a device icon which is required to issue voice to call out a voice issuing function after the user selects the device which is required to issue voice content in the three-dimensional model, and real-time voice broadcasting is selected according to the user demand, so that the voice real-time issuing instruction is obtained.
S3: acquiring current voice input in real time according to the voice real-time issuing instruction;
the current voice input in real time can be acquired by calling a pickup device or a pickup function in a three-dimensional model to pick up voice after the user selects real-time voice broadcasting.
S4: performing standardized processing on the current voice input in real time according to a voice algorithm;
the normalization processing is to normalize the tone and the speed of the voice after the pickup of the user is finished through a voice algorithm, so that the audio played after the delivery can be kept consistent.
S5: and sending a voice playing instruction and the voice subjected to standardized processing to a device terminal to be played.
Based on the technical scheme, the voice algorithm provided by the invention is as follows: firstly, determining a preset reference speech speed V0, and setting the preset reference speech speed V0 to 240-300 words/min; secondly, based on the current voice input in real time, obtaining the voice speed V1 of the current voice input in real time; and determining the double speed M of the current real-time input voice after the normalization processing according to V0 and V1, wherein the double speed M=V0/V1, namely the normalized voice is the current real-time input voice of the double speed M.
The preset reference speech speed can be set according to preset speech, the preset speech source can be standard broadcasting recorded by staff, the preset speech comprises a duration T and a word number W, and then the preset reference speech speed V0 = W/T.
In addition, the acquired voice issuing instruction can also be an issuing instruction for a preset broadcasting table; if the acquired voice issuing instruction is the preset broadcasting table issuing instruction, executing Sn: and sending a voice playing instruction and voice in a preset playing list to a device terminal for playing the voice, wherein the voice in the preset playing list is recorded audio subjected to standardized processing.
The invention also provides a public transportation voice processing device, which comprises a three-dimensional model, wherein the three-dimensional model comprises a device confirmation module, a voice instruction acquisition module, a pickup module, a standardization module and a issuing module;
the equipment confirming module is used for acquiring the confirmation information of the voice equipment terminal to be played;
the voice command acquisition module is used for acquiring voice issuing commands, wherein the voice issuing commands comprise voice real-time issuing commands and preset broadcasting table issuing commands;
the sound pickup module is used for sending an instruction in real time according to the voice to acquire the voice input in real time currently;
the normalization module is used for performing normalization processing on the current voice input in real time according to a voice algorithm;
and the issuing module is used for issuing a voice playing instruction and the standardized voice or the voice in the preset playing list to the equipment terminal for playing the voice.
The invention also provides a public transportation voice processing system, which comprises the processing device and a plurality of equipment playing terminals, wherein the equipment playing terminals are in communication connection with the three-dimensional model, and the equipment playing terminals in each area correspond to the equipment icons on the three-dimensional model.
The following describes the above embodiments through specific application scenarios:
when a worker needs to perform voice broadcasting in the G area as shown in fig. 2, the voice equipment icon in the G area is selected through touch of the touch screen, the issuing function is called out in a long-time mode, the issuing function comprises preset broadcasting list issuing and voice real-time issuing, the worker selects voice real-time issuing, the pickup function in the three-dimensional model is called for voice pickup, voice of the worker is standardized through a voice algorithm after pickup is completed, and after issuing is confirmed, the voice equipment terminal in the G area is issued.
The invention also provides an electronic device for public transportation voice processing, comprising: one or more processors; and a storage unit for storing one or more programs, which when executed by the one or more processors, enable the one or more processors to implement the aforementioned public transportation voice processing method.
The present invention also provides a computer-readable storage medium having stored thereon a computer program which, when executed by a processor, enables the aforementioned public transportation voice processing method to be implemented.
The foregoing description of the preferred embodiment of the invention is not intended to limit the invention in any way, but rather to cover all modifications, equivalents, improvements and alternatives falling within the spirit and principles of the invention.
Claims (9)
1. A public transportation voice processing method, comprising the steps of:
s1: confirming address information of a voice equipment terminal to be played;
s2: acquiring a voice issuing instruction, wherein the voice issuing instruction is a voice real-time issuing instruction;
s3: acquiring current voice input in real time according to the voice real-time issuing instruction;
s4: performing standardized processing on the current voice input in real time according to a voice algorithm;
s5: issuing a voice playing instruction and a standardized voice to a voice equipment terminal to be played;
the voice algorithm is as follows:
presetting a reference speech speed V0;
based on the current voice input in real time, obtaining the voice speed V1 of the current voice input in real time;
and determining the double speed M of the current real-time input voice after the standardization processing according to V0 and V1, wherein the double speed M=V0/V1, namely the voice after the standardization processing is the current real-time input voice of the double speed M.
2. The public transportation voice processing method according to claim 1, wherein the preset reference voice velocity V0 is 240-300 words/min.
3. The public transportation voice processing method according to claim 1, wherein the preset reference speech rate is determined according to a preset voice.
4. A public transportation voice processing method according to claim 3, wherein the preset voice includes a duration T and a word number W, and the preset reference voice velocity v0=w/T.
5. The public transportation voice processing method according to claim 1, further comprising executing S2, wherein the voice issuing instruction is a preset playlist issuing instruction;
if the acquired voice issuing instruction is the preset broadcasting table issuing instruction, executing Sn: and sending a voice playing instruction and presetting voices in a playing list to a device terminal to be played.
6. A public transportation speech processing device, characterized in that: the device comprises a device confirmation module, a voice command acquisition module, a pickup module, a standardization module, a issuing module and a three-dimensional model window module, wherein the three-dimensional model window module comprises address information of a voice device terminal to be played;
the equipment confirming module is used for acquiring the confirmation information of the voice equipment terminal to be played;
the voice command acquisition module is used for acquiring voice issuing commands, wherein the voice issuing commands comprise voice real-time issuing commands and preset broadcasting table issuing commands;
the sound pickup module is used for sending an instruction in real time according to the voice to acquire the voice input in real time currently;
the normalization module is used for performing normalization processing on the current voice input in real time according to a voice algorithm;
the voice algorithm is as follows:
presetting a reference speech speed V0;
based on the current voice input in real time, obtaining the voice speed V1 of the current voice input in real time;
determining a doubling speed M after the current real-time input voice is subjected to standardization processing according to V0 and V1, wherein the doubling speed M=V0/V1, namely the standardized voice is the current real-time input voice with the doubling speed M;
the issuing module is used for issuing a voice playing instruction and the standardized voice or the voice in the preset playing list to the voice equipment terminal to be played.
7. A public transportation speech processing system, characterized by: comprising the processing device of claim 6 and a number of terminals of the voice equipment to be played.
8. An electronic device for public transportation voice processing, comprising:
one or more processors;
a storage unit for storing one or more programs which, when executed by the one or more processors, enable the one or more processors to implement a public transportation speech processing method according to any one of claims 1 to 5.
9. A computer-readable storage medium, on which a computer program is stored, characterized in that the computer program, when being executed by a processor, is capable of realizing a public transportation speech processing method according to any one of claims 1 to 5.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202210170319.3A CN114566163B (en) | 2022-02-23 | 2022-02-23 | Public transport voice processing method, device, system, electronic equipment and medium |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202210170319.3A CN114566163B (en) | 2022-02-23 | 2022-02-23 | Public transport voice processing method, device, system, electronic equipment and medium |
Publications (2)
Publication Number | Publication Date |
---|---|
CN114566163A CN114566163A (en) | 2022-05-31 |
CN114566163B true CN114566163B (en) | 2023-05-02 |
Family
ID=81714096
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN202210170319.3A Active CN114566163B (en) | 2022-02-23 | 2022-02-23 | Public transport voice processing method, device, system, electronic equipment and medium |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN114566163B (en) |
Citations (8)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN101834940A (en) * | 2010-03-23 | 2010-09-15 | 中兴通讯股份有限公司 | Control method of voice service and voice service system |
CN104301399A (en) * | 2014-09-28 | 2015-01-21 | 深圳市星盘科技有限公司 | System and method for remotely controlling loudspeaker box through voice |
CN204390492U (en) * | 2015-01-13 | 2015-06-10 | 深圳市京华信息技术有限公司 | A kind of car networking traffic information broadcasting terminals and broadcasting system |
CN107205095A (en) * | 2017-07-25 | 2017-09-26 | 广东欧珀移动通信有限公司 | Player method, device and the terminal of voice messaging |
CN107682240A (en) * | 2017-09-27 | 2018-02-09 | 四川长虹电器股份有限公司 | A kind of distributed sound interactive system for intelligent domestic |
CN108563208A (en) * | 2018-06-28 | 2018-09-21 | 马雷明 | Intelligent domestic system and its control method |
CN111128159A (en) * | 2019-12-18 | 2020-05-08 | 上海智勘科技有限公司 | Method and system for realizing multi-channel message distribution of intelligent loudspeaker box |
CN113079201A (en) * | 2019-04-11 | 2021-07-06 | 创新先进技术有限公司 | Information processing system, method, device and equipment |
-
2022
- 2022-02-23 CN CN202210170319.3A patent/CN114566163B/en active Active
Patent Citations (8)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN101834940A (en) * | 2010-03-23 | 2010-09-15 | 中兴通讯股份有限公司 | Control method of voice service and voice service system |
CN104301399A (en) * | 2014-09-28 | 2015-01-21 | 深圳市星盘科技有限公司 | System and method for remotely controlling loudspeaker box through voice |
CN204390492U (en) * | 2015-01-13 | 2015-06-10 | 深圳市京华信息技术有限公司 | A kind of car networking traffic information broadcasting terminals and broadcasting system |
CN107205095A (en) * | 2017-07-25 | 2017-09-26 | 广东欧珀移动通信有限公司 | Player method, device and the terminal of voice messaging |
CN107682240A (en) * | 2017-09-27 | 2018-02-09 | 四川长虹电器股份有限公司 | A kind of distributed sound interactive system for intelligent domestic |
CN108563208A (en) * | 2018-06-28 | 2018-09-21 | 马雷明 | Intelligent domestic system and its control method |
CN113079201A (en) * | 2019-04-11 | 2021-07-06 | 创新先进技术有限公司 | Information processing system, method, device and equipment |
CN111128159A (en) * | 2019-12-18 | 2020-05-08 | 上海智勘科技有限公司 | Method and system for realizing multi-channel message distribution of intelligent loudspeaker box |
Also Published As
Publication number | Publication date |
---|---|
CN114566163A (en) | 2022-05-31 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
US8126155B2 (en) | Remote audio device management system | |
CN109658932B (en) | Equipment control method, device, equipment and medium | |
US20120197770A1 (en) | System and method for real time text streaming | |
CN106254311A (en) | Live broadcasting method and device, live data streams methods of exhibiting and device | |
CN112653902B (en) | Speaker recognition method and device and electronic equipment | |
CN113132787A (en) | Live content display method and device, electronic equipment and storage medium | |
EP2993860A1 (en) | Method, apparatus, and system for presenting communication information in video communication | |
WO2019071808A1 (en) | Video image display method, apparatus and system, terminal device, and storage medium | |
CN110769189B (en) | Video conference switching method and device and readable storage medium | |
CN108903521B (en) | Man-machine interaction method applied to intelligent picture frame and intelligent picture frame | |
CN112383809A (en) | Subtitle display method, device and storage medium | |
CN112711366A (en) | Image generation method and device and electronic equipment | |
US20180255163A1 (en) | Automatically delaying playback of a message | |
CN114566163B (en) | Public transport voice processing method, device, system, electronic equipment and medium | |
CN114227702A (en) | Intelligent conference guiding method and device based on robot and robot | |
CN112702468A (en) | Call control method and device | |
CN108418979B (en) | Telephone traffic continuation prompting method and device, computer equipment and storage medium | |
CN113891168B (en) | Subtitle processing method, subtitle processing device, electronic equipment and storage medium | |
CN113490136B (en) | Sound information processing method and device, computer storage medium and electronic equipment | |
WO2023216119A1 (en) | Audio signal encoding method and apparatus, electronic device and storage medium | |
CN113452853B (en) | Voice interaction method and device, electronic equipment and storage medium | |
CN106657312A (en) | Remote management method, apparatus and system | |
CN114594892A (en) | Remote interaction method, remote interaction device and computer storage medium | |
JP2007074443A (en) | Incoming side communication terminal, outgoing side communication terminal, server device, call back system, call back reservation system, call back control method and call back control program | |
CN209002108U (en) | Distribution infrastructure project Field Monitoring System |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
GR01 | Patent grant | ||
GR01 | Patent grant |