CN110797024A - VHF (very high frequency) maritime safety communication system based on voice recognition and subtitle display - Google Patents

VHF (very high frequency) maritime safety communication system based on voice recognition and subtitle display Download PDF

Info

Publication number
CN110797024A
CN110797024A CN201911083849.9A CN201911083849A CN110797024A CN 110797024 A CN110797024 A CN 110797024A CN 201911083849 A CN201911083849 A CN 201911083849A CN 110797024 A CN110797024 A CN 110797024A
Authority
CN
China
Prior art keywords
voice
voice information
unit
system based
display
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN201911083849.9A
Other languages
Chinese (zh)
Inventor
林彬
俞雁韬
崔昆涛
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Dalian Maritime University
Original Assignee
Dalian Maritime University
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Dalian Maritime University filed Critical Dalian Maritime University
Priority to CN201911083849.9A priority Critical patent/CN110797024A/en
Publication of CN110797024A publication Critical patent/CN110797024A/en
Pending legal-status Critical Current

Links

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/22Procedures used during a speech recognition process, e.g. man-machine dialogue
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/02Feature extraction for speech recognition; Selection of recognition unit
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L21/00Speech or voice signal processing techniques to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
    • G10L21/02Speech enhancement, e.g. noise reduction or echo cancellation
    • G10L21/0208Noise filtering
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L25/00Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
    • G10L25/03Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 characterised by the type of extracted parameters
    • G10L25/21Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 characterised by the type of extracted parameters the extracted parameters being power information
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L25/00Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
    • G10L25/45Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 characterised by the type of analysis window
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N5/00Details of television systems
    • H04N5/222Studio circuitry; Studio devices; Studio equipment
    • H04N5/262Studio circuits, e.g. for mixing, switching-over, change of character of image, other special effects ; Cameras specially adapted for the electronic generation of special effects
    • H04N5/278Subtitling

Landscapes

  • Engineering & Computer Science (AREA)
  • Multimedia (AREA)
  • Computational Linguistics (AREA)
  • Health & Medical Sciences (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Human Computer Interaction (AREA)
  • Physics & Mathematics (AREA)
  • Acoustics & Sound (AREA)
  • Signal Processing (AREA)
  • Computer Vision & Pattern Recognition (AREA)
  • Quality & Reliability (AREA)
  • Telephonic Communication Services (AREA)

Abstract

The invention discloses a VHF marine safety communication system based on voice recognition and caption display, which comprises: the voice preprocessing unit carries out framing processing on the received voice information, then carries out windowing processing on the received voice information, and then carries out fast Fourier transform and cache processing on the windowed voice signal; a storage unit for receiving the processed voice information transmitted by the voice preprocessing unit and storing the voice in real time; and the voice recognition unit receives the processed voice information transmitted by the voice preprocessing unit, and transmits the characters to the display unit for playback display. The system can convert the audio frequency being played into corresponding characters through a voice recognition technology, and the characters are displayed on a caption display screen in a caption mode, so that a user can receive audio information and obtain character information at the same time, and the content of maritime communication is enriched.

Description

VHF (very high frequency) maritime safety communication system based on voice recognition and subtitle display
Technical Field
The invention relates to the technical field of marine communication, in particular to a VHF marine safety communication system based on voice recognition and subtitle display.
Background
Very High Frequency (VHF) communication is an important component of marine radio communication, and currently, VHF radiotelephones equipped for GMDSS marine vessels generally only have radiotelephone and DSC communication functions, and are two-way communication systems, with a transmission communication frequency range of 156.025MHz to 157.425MHz, a reception communication frequency range of 156.025MHz to 162.025MHz, and voice signals as communication contents.
At present, communication between ships on the near sea surface and between ships and banks is mainly realized through voice communication of VHF communication equipment, communication contents are single voice signals, and due to the fact that voice information has the characteristic of instantaneity and certain noise interference exists in the sea navigation environment, a user is easy to have the situation that the voice information is not completely heard when the user uses the existing VHF equipment for communication, and the situation that corresponding ship collision avoiding scenes need to be made in time is extremely unfavorable, and therefore the safety of sea traffic is endangered.
Disclosure of Invention
According to the problems existing in the prior art, the invention discloses a VHF marine safety communication system based on voice recognition and caption display, which comprises the following specific schemes:
a microphone for receiving audio information played on the marine vessel;
the voice preprocessing unit is used for receiving the voice information transmitted by the loudspeaker, performing framing processing on the received voice information and then windowing processing on the voice information, and performing fast Fourier transform and cache processing on a windowed voice signal;
a storage unit for receiving the processed voice information transmitted by the voice preprocessing unit and storing the voice in real time;
the voice recognition unit receives the processed voice information transmitted by the voice preprocessing unit, and converts the received voice information into a character form through feature extraction and mode matching;
the voice recognition unit transmits the characters to the display unit for playback display.
Further, the signal after frame windowing is set as
Figure BDA0002264780010000011
The frequency spectrum of the kth frequency point of the ith frame signal obtained by performing fast Fourier transform on the ith frame signal is as follows:
Figure BDA0002264780010000021
Figure BDA0002264780010000022
has an amplitude spectrum of
Figure BDA0002264780010000023
XR(i, k) is the real part of the k frequency point of the ith frame signal, XIAnd (i, k) is the imaginary part of the k frequency point of the ith frame signal.
Figure BDA0002264780010000024
Has a power spectrum of
G(i,k)=[XR(i,k)2+XI(i,k)2],k=0,1,…,N-1 (4)
Figure BDA0002264780010000025
Total power of
Figure BDA0002264780010000026
And performing frame windowing on the incoming signals, calculating the amplitude spectrum and the power spectrum and caching the amplitude spectrum and the power spectrum.
Further, the loudspeaker carries out denoising and howling elimination processing on the received voice information.
Due to the adoption of the technical scheme, the VHF marine safety communication system based on voice recognition and caption display is additionally provided with a voice playback function, so that a user can conveniently play historical call records by selecting a playback mode, and the user can conveniently confirm the incompletely understood voice; the audio which is being played can be converted into corresponding characters through a voice recognition technology, and the characters are displayed on a caption display screen in a caption mode, so that a user can obtain character information while receiving audio information, the content of marine communication is enriched, the accuracy of understanding marine traffic information by the user is improved in a mode of combining voice and characters, the situation that the user does not timely process information such as ship collision avoidance and the like because the user does not completely hear transient voice information is prevented, and the marine traffic safety is better guaranteed; in the practical application process, two display screens, namely a main display screen and a subtitle display screen, are designed on the system, and work information and the audio corresponding subtitles are respectively displayed, so that better use experience is provided for users.
Drawings
In order to more clearly illustrate the embodiments of the present application or the technical solutions in the prior art, the drawings needed to be used in the description of the embodiments or the prior art will be briefly described below, it is obvious that the drawings in the following description are only some embodiments described in the present application, and other drawings can be obtained by those skilled in the art without creative efforts.
FIG. 1 is a schematic diagram of the system of the present disclosure;
Detailed Description
In order to make the technical solutions and advantages of the present invention clearer, the following describes the technical solutions in the embodiments of the present invention clearly and completely with reference to the drawings in the embodiments of the present invention:
a VHF maritime secure communication system based on speech recognition and caption display as shown in fig. 1, comprising: the device comprises a loudspeaker, a voice preprocessing unit, a storage unit, a voice recognition unit and a display unit. The microphone is used for receiving audio information played on a marine vessel and carrying out denoising and howling elimination processing on the voice. The voice preprocessing unit carries out framing processing on the received voice information, then carries out windowing processing on the received voice information, and then carries out fast Fourier transform and cache processing on the windowed voice signal. The storage module stores the cached voice in real time, and the voice recognition unit converts the received voice information into a character form through feature extraction and mode recognition and displays the character form on the display unit.
Further, when performing fast fourier transform on a speech signal, setting the signals after framing and windowing as x to (n), and performing fast fourier transform on the signals to obtain the frequency spectrum of the kth frequency point of the ith frame signal as:
Figure BDA0002264780010000032
has an amplitude spectrum of
XR(i, k) is the real part of the k frequency point of the ith frame signal, XIAnd (i, k) is the imaginary part of the k frequency point of the ith frame signal.
Figure BDA0002264780010000034
Has a power spectrum of
G(i,k)=[XR(i,k)2+XI(i,k)2],k=0,1,…,N-1 (4)
Figure BDA0002264780010000035
Total power of
And performing frame windowing on each frame of incoming signals, calculating a magnitude spectrum and a power spectrum, and caching.
Because the system has the voice playback function and can convert the audio frequency being played into corresponding characters and display the characters on the caption display screen, the information processing process comprises the following steps that on one hand, the system stores the received audio frequency and then plays the audio frequency for subsequent playback, and on the other hand, the received audio frequency is converted into the characters through the voice recognition unit and displayed on the caption display screen; the voice recognition technology enables a user to obtain subtitle information when receiving audio information, enriches the content of marine communication, improves the accuracy of understanding marine traffic information by the user in a mode of combining voice and characters, prevents the user from not processing information such as ship collision avoidance and the like in time because the user does not completely hear transient voice information, and better guarantees the marine traffic safety.
The above description is only for the preferred embodiment of the present invention, but the scope of the present invention is not limited thereto, and any person skilled in the art should be considered to be within the technical scope of the present invention, and the technical solutions and the inventive concepts thereof according to the present invention should be equivalent or changed within the scope of the present invention.

Claims (3)

1. A VHF maritime secure communication system based on speech recognition and caption display, comprising:
a microphone for receiving audio information played on the marine vessel;
the voice preprocessing unit is used for receiving the voice information transmitted by the loudspeaker, performing framing processing on the received voice information and then windowing processing on the voice information, and performing fast Fourier transform and cache processing on a windowed voice signal;
a storage unit for receiving the processed voice information transmitted by the voice preprocessing unit and storing the voice in real time;
the voice recognition unit is used for receiving the processed voice information transmitted by the voice preprocessing unit and converting the received voice information into a character form;
the voice recognition unit transmits the characters to the display unit for playback display.
2. A VHF maritime security communication system based on speech recognition and caption display as claimed in claim 1, further characterized in that: setting the signal after windowing as
Figure FDA0002264778000000011
The frequency spectrum of the kth frequency point of the ith frame signal obtained by performing fast Fourier transform on the ith frame signal is as follows:
Figure FDA0002264778000000013
has an amplitude spectrum of
Figure FDA0002264778000000014
XR(i, k) is the real part of the k frequency point of the ith frame signal, XIAnd (i, k) is the imaginary part of the k frequency point of the ith frame signal.
Figure FDA0002264778000000015
Has a power spectrum of
G(i,k)=[XR(i,k)2+XI(i,k)2],k=0,1,…,N-1 (4)
Figure FDA0002264778000000016
Total power of
Figure FDA0002264778000000021
And performing frame windowing on the incoming signals, calculating the amplitude spectrum and the power spectrum and caching the amplitude spectrum and the power spectrum.
3. A VHF maritime security communication system based on speech recognition and caption display as claimed in claim 1, further characterized in that: and the loudspeaker carries out denoising and howling elimination processing on the received voice information.
CN201911083849.9A 2019-11-07 2019-11-07 VHF (very high frequency) maritime safety communication system based on voice recognition and subtitle display Pending CN110797024A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201911083849.9A CN110797024A (en) 2019-11-07 2019-11-07 VHF (very high frequency) maritime safety communication system based on voice recognition and subtitle display

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201911083849.9A CN110797024A (en) 2019-11-07 2019-11-07 VHF (very high frequency) maritime safety communication system based on voice recognition and subtitle display

Publications (1)

Publication Number Publication Date
CN110797024A true CN110797024A (en) 2020-02-14

Family

ID=69443290

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201911083849.9A Pending CN110797024A (en) 2019-11-07 2019-11-07 VHF (very high frequency) maritime safety communication system based on voice recognition and subtitle display

Country Status (1)

Country Link
CN (1) CN110797024A (en)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN111951794A (en) * 2020-07-29 2020-11-17 深圳星标科技股份有限公司 Ground station automatic response method, ground station automatic response device, computer equipment and storage medium thereof

Citations (9)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20040181404A1 (en) * 2003-03-01 2004-09-16 Shedd Jonathan Elias Weather radio with speech to text recognition of audio forecast and display summary of weather
CN1798167A (en) * 2004-12-31 2006-07-05 乐金电子(中国)研究开发中心有限公司 Mobile terminal with noise-identification communication-variation function and method of varying the same
CN202276337U (en) * 2011-10-13 2012-06-13 张明亮 VHF/UHF digital intelligent emergency receiving terminal
CN202330723U (en) * 2011-11-18 2012-07-11 交通运输部天津水运工程科学研究所 AIS (automatic identification system) shipborne terminal system based on Beidou satellite navigation
CN103136978A (en) * 2013-03-01 2013-06-05 上海海事大学 Ship traffic management and ship driving comprehensive imitator system
US8688092B1 (en) * 2007-03-26 2014-04-01 Callwave Communications, Llc Methods and systems for managing telecommunications and for translating voice messages to text messages
US20150172766A1 (en) * 2013-12-12 2015-06-18 Samsung Electronics Co., Ltd. Image display apparatus, method for driving image display apparatus, method for displaying an image, and computer readable recording medium therefor
CN105577882A (en) * 2015-05-28 2016-05-11 东莞酷派软件技术有限公司 Information display method and user terminal
CN106385548A (en) * 2016-09-05 2017-02-08 努比亚技术有限公司 Mobile terminal and method for generating video captions

Patent Citations (9)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20040181404A1 (en) * 2003-03-01 2004-09-16 Shedd Jonathan Elias Weather radio with speech to text recognition of audio forecast and display summary of weather
CN1798167A (en) * 2004-12-31 2006-07-05 乐金电子(中国)研究开发中心有限公司 Mobile terminal with noise-identification communication-variation function and method of varying the same
US8688092B1 (en) * 2007-03-26 2014-04-01 Callwave Communications, Llc Methods and systems for managing telecommunications and for translating voice messages to text messages
CN202276337U (en) * 2011-10-13 2012-06-13 张明亮 VHF/UHF digital intelligent emergency receiving terminal
CN202330723U (en) * 2011-11-18 2012-07-11 交通运输部天津水运工程科学研究所 AIS (automatic identification system) shipborne terminal system based on Beidou satellite navigation
CN103136978A (en) * 2013-03-01 2013-06-05 上海海事大学 Ship traffic management and ship driving comprehensive imitator system
US20150172766A1 (en) * 2013-12-12 2015-06-18 Samsung Electronics Co., Ltd. Image display apparatus, method for driving image display apparatus, method for displaying an image, and computer readable recording medium therefor
CN105577882A (en) * 2015-05-28 2016-05-11 东莞酷派软件技术有限公司 Information display method and user terminal
CN106385548A (en) * 2016-09-05 2017-02-08 努比亚技术有限公司 Mobile terminal and method for generating video captions

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
窦路: "VTS雷达信号及VHF信号数据记录与回放的研究", 《中国优秀博硕士学位论文全文数据库(硕士)信息科技辑》 *

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN111951794A (en) * 2020-07-29 2020-11-17 深圳星标科技股份有限公司 Ground station automatic response method, ground station automatic response device, computer equipment and storage medium thereof

Similar Documents

Publication Publication Date Title
US4351062A (en) Method and apparatus for suppressing digital error noise in digital communication
US8515748B2 (en) Mobile phone communication gap recovery
JPS59210758A (en) Digital hand-free telephone set
MY132748A (en) Apparatus for speech-based generation, audio translation, and manipulation of text messages over voice lines.
CN107527623A (en) Screen transmission method, device, electronic equipment and computer-readable recording medium
CN106569773A (en) Terminal and voice interaction processing method
EP1397796A1 (en) Speech quality indication
CN110797024A (en) VHF (very high frequency) maritime safety communication system based on voice recognition and subtitle display
CN110389743A (en) Car audio system and vehicle
CN104981870A (en) Speech enhancement device
CN104505096A (en) Method and device using music to transmit hidden information
CN106685575B (en) A kind of device for preventing from being eavesdropped using mobile phone
JPH0946233A (en) Sound encoding method/device and sound decoding method/ device
CN103366757A (en) Communication system and method with echo cancellation mechanism
CN106160687A (en) A kind of volume adjustment device and method, relevant device
US5602913A (en) Robust double-talk detection
US20030028379A1 (en) System for converting electronic content to a transmittable signal and transmitting the resulting signal
KR102607120B1 (en) Sound data noise canceling method and apparatus, electronic device , computer readable storage medium and computer program
CN107967919A (en) Eliminate the method, device and mobile terminal of TDD noises
GB1593835A (en) Reduction of adjacent channel interference in stereo receivers
CN205028649U (en) Ware is sheltered to multichannel sound
US20140372111A1 (en) Voice recognition enhancement
CN105791937A (en) Audio/video processing method and related equipment
JPS6384216A (en) Voice/data multiplexer
CN210634505U (en) Vehicle-mounted machine system and vehicle-mounted entertainment system

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
RJ01 Rejection of invention patent application after publication

Application publication date: 20200214

RJ01 Rejection of invention patent application after publication