CN114566163B - Public transport voice processing method, device, system, electronic equipment and medium - Google Patents

Public transport voice processing method, device, system, electronic equipment and medium Download PDF

Info

Publication number
CN114566163B
CN114566163B CN202210170319.3A CN202210170319A CN114566163B CN 114566163 B CN114566163 B CN 114566163B CN 202210170319 A CN202210170319 A CN 202210170319A CN 114566163 B CN114566163 B CN 114566163B
Authority
CN
China
Prior art keywords
voice
issuing
real
instruction
processing
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
CN202210170319.3A
Other languages
Chinese (zh)
Other versions
CN114566163A (en
Inventor
赵丁漫
严军
邓秋雄
杨征宇
拜正斌
刘杰
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Chengdu Zhiyuanhui Information Technology Co Ltd
Original Assignee
Chengdu Zhiyuanhui Information Technology Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Chengdu Zhiyuanhui Information Technology Co Ltd filed Critical Chengdu Zhiyuanhui Information Technology Co Ltd
Priority to CN202210170319.3A priority Critical patent/CN114566163B/en
Publication of CN114566163A publication Critical patent/CN114566163A/en
Application granted granted Critical
Publication of CN114566163B publication Critical patent/CN114566163B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/22Procedures used during a speech recognition process, e.g. man-machine dialogue
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/06Creation of reference templates; Training of speech recognition systems, e.g. adaptation to the characteristics of the speaker's voice
    • G10L15/063Training
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04HBROADCAST COMMUNICATION
    • H04H20/00Arrangements for broadcast or for distribution combined with broadcast
    • H04H20/53Arrangements specially adapted for specific applications, e.g. for traffic information or for mobile receivers
    • H04H20/61Arrangements specially adapted for specific applications, e.g. for traffic information or for mobile receivers for local area broadcast, e.g. instore broadcast
    • H04H20/62Arrangements specially adapted for specific applications, e.g. for traffic information or for mobile receivers for local area broadcast, e.g. instore broadcast for transportation systems, e.g. in vehicles
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/06Creation of reference templates; Training of speech recognition systems, e.g. adaptation to the characteristics of the speaker's voice
    • G10L15/063Training
    • G10L2015/0631Creating reference templates; Clustering
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/22Procedures used during a speech recognition process, e.g. man-machine dialogue
    • G10L2015/223Execution procedure of a spoken command

Abstract

The invention discloses a public transportation voice processing method, which comprises the following steps: s1: acquiring confirmation information of a voice equipment terminal to be played; s2: acquiring a voice issuing instruction, wherein the voice issuing instruction is a voice real-time issuing instruction; s3: acquiring current voice input in real time according to the voice real-time issuing instruction; s4: performing standardized processing on the current voice input in real time according to a voice algorithm; s5: and sending a voice playing instruction and the voice subjected to standardized processing to a voice equipment terminal to be played. The invention carries out standardized processing on the audio content broadcasted by different staff through the processing of the voice algorithm after pickup, so that the broadcasted audio voice and speech speed are kept consistent, standardized management on content broadcasting is realized for public traffic managers, and passengers feel specialization of public traffic through standardized broadcasting.

Description

Public transport voice processing method, device, system, electronic equipment and medium
Technical Field
The invention relates to the technical field of communication, in particular to a public transportation voice processing method, a device, a system, electronic equipment and a medium.
Background
As shown in fig. 1, the broadcasting mode of the voice broadcasting in the public transportation scene is single, the staff can only broadcast the voice at all stations, the voice evacuation cannot be performed according to the actual situation, the scene is limited, the management is inconvenient for the manager according to the actual situation, in addition, the voice broadcasting system does not process the audio, the content of the voice broadcasting is directly broadcasted after the speaking of the staff is finished, the speed of the voice of each person is different, the spread of the broadcasted content, such as speed, affects the overall image of the public transportation, and therefore the public transportation voice processing method, the device, the system, the electronic equipment and the medium are significant.
Disclosure of Invention
The invention aims to provide a main control system based on public transportation audio self-adaption so as to solve the problems in the background technology.
In order to solve the technical problems, the invention adopts the following scheme:
a public transportation voice processing method, comprising the steps of:
s1: acquiring confirmation information of a voice equipment terminal to be played;
s2: acquiring a voice issuing instruction, wherein the voice issuing instruction is a voice real-time issuing instruction;
s3: acquiring current voice input in real time according to the voice real-time issuing instruction;
s4: performing standardized processing on the current voice input in real time according to a voice algorithm;
s5: and sending a voice playing instruction and the voice subjected to standardized processing to a device terminal to be played.
Further, the voice algorithm is as follows:
presetting a reference speech speed V0;
based on the current voice input in real time, obtaining the voice speed V1 of the current voice input in real time;
and determining the double speed M of the current real-time input voice after the standardization processing according to V0 and V1, wherein the double speed M=V0/V1, namely the voice after the standardization processing is the current real-time input voice of the double speed M.
Further, the preset reference speech speed V0 is 240-300 words/min.
Further, the preset reference speech rate may be determined according to a preset voice.
Further, the preset voice includes a duration T and a word number W, and a preset reference voice velocity v0=w/T.
Further, when the S2 is executed, the voice issuing instruction is a preset broadcasting table issuing instruction;
if the acquired voice issuing instruction is the preset broadcasting table issuing instruction, executing Sn: and sending a voice playing instruction and presetting voices in a playing list to a device terminal to be played.
The invention also provides a public transportation voice processing device, which comprises a three-dimensional model, wherein the three-dimensional model comprises a device confirmation module, a voice instruction acquisition module, a pickup module, a standardization module and a issuing module;
the device confirmation module is used for confirming the device terminal of the voice to be played;
the voice command acquisition module is used for acquiring voice issuing commands, wherein the voice issuing commands comprise voice real-time issuing commands and preset broadcasting table issuing commands;
the sound pickup module is used for sending an instruction in real time according to the voice to acquire the voice input in real time currently;
the normalization module is used for performing normalization processing on the current voice input in real time according to a voice algorithm;
and the issuing module is used for issuing a voice playing instruction and the standardized voice or the voice in the preset playing list to the equipment terminal for playing the voice.
The invention also provides a public transportation voice processing system, which comprises the processing device and a plurality of voice equipment terminals, wherein the voice equipment terminals are in communication connection with the three-dimensional model.
The invention also provides an electronic device for public transportation voice processing, which comprises:
one or more processors;
and a storage unit for storing one or more programs, which when executed by the one or more processors, enable the one or more processors to implement the aforementioned public transportation voice processing method.
The present invention also provides a computer-readable storage medium having stored thereon a computer program which, when executed by a processor, enables the aforementioned public transportation voice processing method to be implemented.
The invention has the beneficial effects that:
1. according to the invention, the playing equipment of the region can be selected in the three-dimensional model to accurately broadcast according to the region needing to be evacuated or the region needing to be played, so that the total station broadcasting is not needed, and management by a manager according to actual conditions is facilitated;
2. the invention carries out standardized processing on the audio content broadcasted by different staff through the processing of the voice algorithm after pickup, so that the broadcasted audio voice and speech speed are kept consistent, standardized management on content broadcasting is realized for public traffic managers, and passengers feel specialization of public traffic through standardized broadcasting.
Drawings
FIG. 1 is a workflow diagram of the background art;
FIG. 2 is a schematic representation of a three-dimensional model of the present invention;
FIG. 3 is a block diagram of the structure of the present invention;
FIG. 4 is a workflow diagram of user selection of voice real-time delivery;
fig. 5 is a flowchart of the user selecting a preset playlist to issue.
Detailed Description
The present invention will be described in further detail with reference to examples and drawings, but embodiments of the present invention are not limited thereto.
In the description of the present invention, it should be noted that, directions or positional relationships indicated by terms such as "center", "upper", "lower", "left", "right", "vertical", "longitudinal", "lateral", "horizontal", "inner", "outer", "front", "rear", "top", "bottom", etc., are directions or positional relationships based on those shown in the drawings, or are directions or positional relationships conventionally put in use of the inventive product, are merely for convenience of describing the present invention and for simplifying the description, and are not indicative or implying that the apparatus or element to be referred to must have a specific direction, be constructed and operated in a specific direction, and therefore should not be construed as limiting the present invention.
In the description of the present invention, it should also be noted that, unless explicitly specified and limited otherwise, the terms "disposed," "configured," "mounted," "connected," and "connected" are to be construed broadly, and may be, for example, fixedly connected, detachably connected, or integrally connected; can be mechanically or electrically connected; can be directly connected or indirectly connected through an intermediate medium, and can be communication between two elements. The specific meaning of the above terms in the present invention will be understood in specific cases by those of ordinary skill in the art.
Examples
As shown in fig. 3, the execution subject in the present embodiment may be a touch display device terminal including a three-dimensional model, and the voice processing method includes the steps of:
s1: acquiring confirmation information of a voice equipment terminal to be played;
the obtaining of the confirmation information of the to-be-played voice equipment terminal may be that the user selects the equipment icon of the to-be-played voice equipment terminal in the three-dimensional model through touching, the equipment icon of the voice equipment terminal is a A, B, C … … G exit sign with voice broadcasting function or other equipment icons with voice broadcasting function as shown in fig. 2, or the user selects the equipment icon of the to-be-played voice in the three-dimensional model through mouse click.
S2: acquiring a voice issuing instruction, wherein the voice issuing instruction is a voice real-time issuing instruction;
the obtaining of the voice issuing instruction can be obtained by touching a device icon which is long in demand and is required to issue voice or double clicking a device icon which is required to issue voice to call out a voice issuing function after the user selects the device which is required to issue voice content in the three-dimensional model, and real-time voice broadcasting is selected according to the user demand, so that the voice real-time issuing instruction is obtained.
S3: acquiring current voice input in real time according to the voice real-time issuing instruction;
the current voice input in real time can be acquired by calling a pickup device or a pickup function in a three-dimensional model to pick up voice after the user selects real-time voice broadcasting.
S4: performing standardized processing on the current voice input in real time according to a voice algorithm;
the normalization processing is to normalize the tone and the speed of the voice after the pickup of the user is finished through a voice algorithm, so that the audio played after the delivery can be kept consistent.
S5: and sending a voice playing instruction and the voice subjected to standardized processing to a device terminal to be played.
Based on the technical scheme, the voice algorithm provided by the invention is as follows: firstly, determining a preset reference speech speed V0, and setting the preset reference speech speed V0 to 240-300 words/min; secondly, based on the current voice input in real time, obtaining the voice speed V1 of the current voice input in real time; and determining the double speed M of the current real-time input voice after the normalization processing according to V0 and V1, wherein the double speed M=V0/V1, namely the normalized voice is the current real-time input voice of the double speed M.
The preset reference speech speed can be set according to preset speech, the preset speech source can be standard broadcasting recorded by staff, the preset speech comprises a duration T and a word number W, and then the preset reference speech speed V0 = W/T.
In addition, the acquired voice issuing instruction can also be an issuing instruction for a preset broadcasting table; if the acquired voice issuing instruction is the preset broadcasting table issuing instruction, executing Sn: and sending a voice playing instruction and voice in a preset playing list to a device terminal for playing the voice, wherein the voice in the preset playing list is recorded audio subjected to standardized processing.
The invention also provides a public transportation voice processing device, which comprises a three-dimensional model, wherein the three-dimensional model comprises a device confirmation module, a voice instruction acquisition module, a pickup module, a standardization module and a issuing module;
the equipment confirming module is used for acquiring the confirmation information of the voice equipment terminal to be played;
the voice command acquisition module is used for acquiring voice issuing commands, wherein the voice issuing commands comprise voice real-time issuing commands and preset broadcasting table issuing commands;
the sound pickup module is used for sending an instruction in real time according to the voice to acquire the voice input in real time currently;
the normalization module is used for performing normalization processing on the current voice input in real time according to a voice algorithm;
and the issuing module is used for issuing a voice playing instruction and the standardized voice or the voice in the preset playing list to the equipment terminal for playing the voice.
The invention also provides a public transportation voice processing system, which comprises the processing device and a plurality of equipment playing terminals, wherein the equipment playing terminals are in communication connection with the three-dimensional model, and the equipment playing terminals in each area correspond to the equipment icons on the three-dimensional model.
The following describes the above embodiments through specific application scenarios:
when a worker needs to perform voice broadcasting in the G area as shown in fig. 2, the voice equipment icon in the G area is selected through touch of the touch screen, the issuing function is called out in a long-time mode, the issuing function comprises preset broadcasting list issuing and voice real-time issuing, the worker selects voice real-time issuing, the pickup function in the three-dimensional model is called for voice pickup, voice of the worker is standardized through a voice algorithm after pickup is completed, and after issuing is confirmed, the voice equipment terminal in the G area is issued.
The invention also provides an electronic device for public transportation voice processing, comprising: one or more processors; and a storage unit for storing one or more programs, which when executed by the one or more processors, enable the one or more processors to implement the aforementioned public transportation voice processing method.
The present invention also provides a computer-readable storage medium having stored thereon a computer program which, when executed by a processor, enables the aforementioned public transportation voice processing method to be implemented.
The foregoing description of the preferred embodiment of the invention is not intended to limit the invention in any way, but rather to cover all modifications, equivalents, improvements and alternatives falling within the spirit and principles of the invention.

Claims (9)

1. A public transportation voice processing method, comprising the steps of:
s1: confirming address information of a voice equipment terminal to be played;
s2: acquiring a voice issuing instruction, wherein the voice issuing instruction is a voice real-time issuing instruction;
s3: acquiring current voice input in real time according to the voice real-time issuing instruction;
s4: performing standardized processing on the current voice input in real time according to a voice algorithm;
s5: issuing a voice playing instruction and a standardized voice to a voice equipment terminal to be played;
the voice algorithm is as follows:
presetting a reference speech speed V0;
based on the current voice input in real time, obtaining the voice speed V1 of the current voice input in real time;
and determining the double speed M of the current real-time input voice after the standardization processing according to V0 and V1, wherein the double speed M=V0/V1, namely the voice after the standardization processing is the current real-time input voice of the double speed M.
2. The public transportation voice processing method according to claim 1, wherein the preset reference voice velocity V0 is 240-300 words/min.
3. The public transportation voice processing method according to claim 1, wherein the preset reference speech rate is determined according to a preset voice.
4. A public transportation voice processing method according to claim 3, wherein the preset voice includes a duration T and a word number W, and the preset reference voice velocity v0=w/T.
5. The public transportation voice processing method according to claim 1, further comprising executing S2, wherein the voice issuing instruction is a preset playlist issuing instruction;
if the acquired voice issuing instruction is the preset broadcasting table issuing instruction, executing Sn: and sending a voice playing instruction and presetting voices in a playing list to a device terminal to be played.
6. A public transportation speech processing device, characterized in that: the device comprises a device confirmation module, a voice command acquisition module, a pickup module, a standardization module, a issuing module and a three-dimensional model window module, wherein the three-dimensional model window module comprises address information of a voice device terminal to be played;
the equipment confirming module is used for acquiring the confirmation information of the voice equipment terminal to be played;
the voice command acquisition module is used for acquiring voice issuing commands, wherein the voice issuing commands comprise voice real-time issuing commands and preset broadcasting table issuing commands;
the sound pickup module is used for sending an instruction in real time according to the voice to acquire the voice input in real time currently;
the normalization module is used for performing normalization processing on the current voice input in real time according to a voice algorithm;
the voice algorithm is as follows:
presetting a reference speech speed V0;
based on the current voice input in real time, obtaining the voice speed V1 of the current voice input in real time;
determining a doubling speed M after the current real-time input voice is subjected to standardization processing according to V0 and V1, wherein the doubling speed M=V0/V1, namely the standardized voice is the current real-time input voice with the doubling speed M;
the issuing module is used for issuing a voice playing instruction and the standardized voice or the voice in the preset playing list to the voice equipment terminal to be played.
7. A public transportation speech processing system, characterized by: comprising the processing device of claim 6 and a number of terminals of the voice equipment to be played.
8. An electronic device for public transportation voice processing, comprising:
one or more processors;
a storage unit for storing one or more programs which, when executed by the one or more processors, enable the one or more processors to implement a public transportation speech processing method according to any one of claims 1 to 5.
9. A computer-readable storage medium, on which a computer program is stored, characterized in that the computer program, when being executed by a processor, is capable of realizing a public transportation speech processing method according to any one of claims 1 to 5.
CN202210170319.3A 2022-02-23 2022-02-23 Public transport voice processing method, device, system, electronic equipment and medium Active CN114566163B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202210170319.3A CN114566163B (en) 2022-02-23 2022-02-23 Public transport voice processing method, device, system, electronic equipment and medium

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202210170319.3A CN114566163B (en) 2022-02-23 2022-02-23 Public transport voice processing method, device, system, electronic equipment and medium

Publications (2)

Publication Number Publication Date
CN114566163A CN114566163A (en) 2022-05-31
CN114566163B true CN114566163B (en) 2023-05-02

Family

ID=81714096

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202210170319.3A Active CN114566163B (en) 2022-02-23 2022-02-23 Public transport voice processing method, device, system, electronic equipment and medium

Country Status (1)

Country Link
CN (1) CN114566163B (en)

Citations (8)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101834940A (en) * 2010-03-23 2010-09-15 中兴通讯股份有限公司 Control method of voice service and voice service system
CN104301399A (en) * 2014-09-28 2015-01-21 深圳市星盘科技有限公司 System and method for remotely controlling loudspeaker box through voice
CN204390492U (en) * 2015-01-13 2015-06-10 深圳市京华信息技术有限公司 A kind of car networking traffic information broadcasting terminals and broadcasting system
CN107205095A (en) * 2017-07-25 2017-09-26 广东欧珀移动通信有限公司 Player method, device and the terminal of voice messaging
CN107682240A (en) * 2017-09-27 2018-02-09 四川长虹电器股份有限公司 A kind of distributed sound interactive system for intelligent domestic
CN108563208A (en) * 2018-06-28 2018-09-21 马雷明 Intelligent domestic system and its control method
CN111128159A (en) * 2019-12-18 2020-05-08 上海智勘科技有限公司 Method and system for realizing multi-channel message distribution of intelligent loudspeaker box
CN113079201A (en) * 2019-04-11 2021-07-06 创新先进技术有限公司 Information processing system, method, device and equipment

Patent Citations (8)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101834940A (en) * 2010-03-23 2010-09-15 中兴通讯股份有限公司 Control method of voice service and voice service system
CN104301399A (en) * 2014-09-28 2015-01-21 深圳市星盘科技有限公司 System and method for remotely controlling loudspeaker box through voice
CN204390492U (en) * 2015-01-13 2015-06-10 深圳市京华信息技术有限公司 A kind of car networking traffic information broadcasting terminals and broadcasting system
CN107205095A (en) * 2017-07-25 2017-09-26 广东欧珀移动通信有限公司 Player method, device and the terminal of voice messaging
CN107682240A (en) * 2017-09-27 2018-02-09 四川长虹电器股份有限公司 A kind of distributed sound interactive system for intelligent domestic
CN108563208A (en) * 2018-06-28 2018-09-21 马雷明 Intelligent domestic system and its control method
CN113079201A (en) * 2019-04-11 2021-07-06 创新先进技术有限公司 Information processing system, method, device and equipment
CN111128159A (en) * 2019-12-18 2020-05-08 上海智勘科技有限公司 Method and system for realizing multi-channel message distribution of intelligent loudspeaker box

Also Published As

Publication number Publication date
CN114566163A (en) 2022-05-31

Similar Documents

Publication Publication Date Title
US8126155B2 (en) Remote audio device management system
CN109658932B (en) Equipment control method, device, equipment and medium
US20120197770A1 (en) System and method for real time text streaming
CN106254311A (en) Live broadcasting method and device, live data streams methods of exhibiting and device
CN112653902B (en) Speaker recognition method and device and electronic equipment
CN113132787A (en) Live content display method and device, electronic equipment and storage medium
EP2993860A1 (en) Method, apparatus, and system for presenting communication information in video communication
WO2019071808A1 (en) Video image display method, apparatus and system, terminal device, and storage medium
CN110769189B (en) Video conference switching method and device and readable storage medium
CN108903521B (en) Man-machine interaction method applied to intelligent picture frame and intelligent picture frame
CN112383809A (en) Subtitle display method, device and storage medium
CN112711366A (en) Image generation method and device and electronic equipment
US20180255163A1 (en) Automatically delaying playback of a message
CN114566163B (en) Public transport voice processing method, device, system, electronic equipment and medium
CN114227702A (en) Intelligent conference guiding method and device based on robot and robot
CN112702468A (en) Call control method and device
CN108418979B (en) Telephone traffic continuation prompting method and device, computer equipment and storage medium
CN113891168B (en) Subtitle processing method, subtitle processing device, electronic equipment and storage medium
CN113490136B (en) Sound information processing method and device, computer storage medium and electronic equipment
WO2023216119A1 (en) Audio signal encoding method and apparatus, electronic device and storage medium
CN113452853B (en) Voice interaction method and device, electronic equipment and storage medium
CN106657312A (en) Remote management method, apparatus and system
CN114594892A (en) Remote interaction method, remote interaction device and computer storage medium
JP2007074443A (en) Incoming side communication terminal, outgoing side communication terminal, server device, call back system, call back reservation system, call back control method and call back control program
CN209002108U (en) Distribution infrastructure project Field Monitoring System

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant