CN110797024A - VHF (very high frequency) maritime safety communication system based on voice recognition and subtitle display - Google Patents
VHF (very high frequency) maritime safety communication system based on voice recognition and subtitle display Download PDFInfo
- Publication number
- CN110797024A CN110797024A CN201911083849.9A CN201911083849A CN110797024A CN 110797024 A CN110797024 A CN 110797024A CN 201911083849 A CN201911083849 A CN 201911083849A CN 110797024 A CN110797024 A CN 110797024A
- Authority
- CN
- China
- Prior art keywords
- voice
- voice information
- unit
- system based
- display
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Pending
Links
- 238000004891 communication Methods 0.000 title claims abstract description 26
- 238000007781 pre-processing Methods 0.000 claims abstract description 11
- 238000009432 framing Methods 0.000 claims abstract description 5
- 238000001228 spectrum Methods 0.000 claims description 19
- 230000008030 elimination Effects 0.000 claims description 3
- 238000003379 elimination reaction Methods 0.000 claims description 3
- 238000005516 engineering process Methods 0.000 abstract description 3
- 238000000034 method Methods 0.000 description 3
- 238000000605 extraction Methods 0.000 description 2
- 230000001052 transient effect Effects 0.000 description 2
- 230000005540 biological transmission Effects 0.000 description 1
- 238000010586 diagram Methods 0.000 description 1
- 230000010365 information processing Effects 0.000 description 1
Classifications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/22—Procedures used during a speech recognition process, e.g. man-machine dialogue
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/02—Feature extraction for speech recognition; Selection of recognition unit
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L21/00—Speech or voice signal processing techniques to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
- G10L21/02—Speech enhancement, e.g. noise reduction or echo cancellation
- G10L21/0208—Noise filtering
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L25/00—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
- G10L25/03—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 characterised by the type of extracted parameters
- G10L25/21—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 characterised by the type of extracted parameters the extracted parameters being power information
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L25/00—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
- G10L25/45—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 characterised by the type of analysis window
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N5/00—Details of television systems
- H04N5/222—Studio circuitry; Studio devices; Studio equipment
- H04N5/262—Studio circuits, e.g. for mixing, switching-over, change of character of image, other special effects ; Cameras specially adapted for the electronic generation of special effects
- H04N5/278—Subtitling
Landscapes
- Engineering & Computer Science (AREA)
- Multimedia (AREA)
- Computational Linguistics (AREA)
- Health & Medical Sciences (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Human Computer Interaction (AREA)
- Physics & Mathematics (AREA)
- Acoustics & Sound (AREA)
- Signal Processing (AREA)
- Computer Vision & Pattern Recognition (AREA)
- Quality & Reliability (AREA)
- Telephonic Communication Services (AREA)
Abstract
The invention discloses a VHF marine safety communication system based on voice recognition and caption display, which comprises: the voice preprocessing unit carries out framing processing on the received voice information, then carries out windowing processing on the received voice information, and then carries out fast Fourier transform and cache processing on the windowed voice signal; a storage unit for receiving the processed voice information transmitted by the voice preprocessing unit and storing the voice in real time; and the voice recognition unit receives the processed voice information transmitted by the voice preprocessing unit, and transmits the characters to the display unit for playback display. The system can convert the audio frequency being played into corresponding characters through a voice recognition technology, and the characters are displayed on a caption display screen in a caption mode, so that a user can receive audio information and obtain character information at the same time, and the content of maritime communication is enriched.
Description
Technical Field
The invention relates to the technical field of marine communication, in particular to a VHF marine safety communication system based on voice recognition and subtitle display.
Background
Very High Frequency (VHF) communication is an important component of marine radio communication, and currently, VHF radiotelephones equipped for GMDSS marine vessels generally only have radiotelephone and DSC communication functions, and are two-way communication systems, with a transmission communication frequency range of 156.025MHz to 157.425MHz, a reception communication frequency range of 156.025MHz to 162.025MHz, and voice signals as communication contents.
At present, communication between ships on the near sea surface and between ships and banks is mainly realized through voice communication of VHF communication equipment, communication contents are single voice signals, and due to the fact that voice information has the characteristic of instantaneity and certain noise interference exists in the sea navigation environment, a user is easy to have the situation that the voice information is not completely heard when the user uses the existing VHF equipment for communication, and the situation that corresponding ship collision avoiding scenes need to be made in time is extremely unfavorable, and therefore the safety of sea traffic is endangered.
Disclosure of Invention
According to the problems existing in the prior art, the invention discloses a VHF marine safety communication system based on voice recognition and caption display, which comprises the following specific schemes:
a microphone for receiving audio information played on the marine vessel;
the voice preprocessing unit is used for receiving the voice information transmitted by the loudspeaker, performing framing processing on the received voice information and then windowing processing on the voice information, and performing fast Fourier transform and cache processing on a windowed voice signal;
a storage unit for receiving the processed voice information transmitted by the voice preprocessing unit and storing the voice in real time;
the voice recognition unit receives the processed voice information transmitted by the voice preprocessing unit, and converts the received voice information into a character form through feature extraction and mode matching;
the voice recognition unit transmits the characters to the display unit for playback display.
Further, the signal after frame windowing is set asThe frequency spectrum of the kth frequency point of the ith frame signal obtained by performing fast Fourier transform on the ith frame signal is as follows:
XR(i, k) is the real part of the k frequency point of the ith frame signal, XIAnd (i, k) is the imaginary part of the k frequency point of the ith frame signal.
G(i,k)=[XR(i,k)2+XI(i,k)2],k=0,1,…,N-1 (4)
And performing frame windowing on the incoming signals, calculating the amplitude spectrum and the power spectrum and caching the amplitude spectrum and the power spectrum.
Further, the loudspeaker carries out denoising and howling elimination processing on the received voice information.
Due to the adoption of the technical scheme, the VHF marine safety communication system based on voice recognition and caption display is additionally provided with a voice playback function, so that a user can conveniently play historical call records by selecting a playback mode, and the user can conveniently confirm the incompletely understood voice; the audio which is being played can be converted into corresponding characters through a voice recognition technology, and the characters are displayed on a caption display screen in a caption mode, so that a user can obtain character information while receiving audio information, the content of marine communication is enriched, the accuracy of understanding marine traffic information by the user is improved in a mode of combining voice and characters, the situation that the user does not timely process information such as ship collision avoidance and the like because the user does not completely hear transient voice information is prevented, and the marine traffic safety is better guaranteed; in the practical application process, two display screens, namely a main display screen and a subtitle display screen, are designed on the system, and work information and the audio corresponding subtitles are respectively displayed, so that better use experience is provided for users.
Drawings
In order to more clearly illustrate the embodiments of the present application or the technical solutions in the prior art, the drawings needed to be used in the description of the embodiments or the prior art will be briefly described below, it is obvious that the drawings in the following description are only some embodiments described in the present application, and other drawings can be obtained by those skilled in the art without creative efforts.
FIG. 1 is a schematic diagram of the system of the present disclosure;
Detailed Description
In order to make the technical solutions and advantages of the present invention clearer, the following describes the technical solutions in the embodiments of the present invention clearly and completely with reference to the drawings in the embodiments of the present invention:
a VHF maritime secure communication system based on speech recognition and caption display as shown in fig. 1, comprising: the device comprises a loudspeaker, a voice preprocessing unit, a storage unit, a voice recognition unit and a display unit. The microphone is used for receiving audio information played on a marine vessel and carrying out denoising and howling elimination processing on the voice. The voice preprocessing unit carries out framing processing on the received voice information, then carries out windowing processing on the received voice information, and then carries out fast Fourier transform and cache processing on the windowed voice signal. The storage module stores the cached voice in real time, and the voice recognition unit converts the received voice information into a character form through feature extraction and mode recognition and displays the character form on the display unit.
Further, when performing fast fourier transform on a speech signal, setting the signals after framing and windowing as x to (n), and performing fast fourier transform on the signals to obtain the frequency spectrum of the kth frequency point of the ith frame signal as:
XR(i, k) is the real part of the k frequency point of the ith frame signal, XIAnd (i, k) is the imaginary part of the k frequency point of the ith frame signal.
G(i,k)=[XR(i,k)2+XI(i,k)2],k=0,1,…,N-1 (4)
And performing frame windowing on each frame of incoming signals, calculating a magnitude spectrum and a power spectrum, and caching.
Because the system has the voice playback function and can convert the audio frequency being played into corresponding characters and display the characters on the caption display screen, the information processing process comprises the following steps that on one hand, the system stores the received audio frequency and then plays the audio frequency for subsequent playback, and on the other hand, the received audio frequency is converted into the characters through the voice recognition unit and displayed on the caption display screen; the voice recognition technology enables a user to obtain subtitle information when receiving audio information, enriches the content of marine communication, improves the accuracy of understanding marine traffic information by the user in a mode of combining voice and characters, prevents the user from not processing information such as ship collision avoidance and the like in time because the user does not completely hear transient voice information, and better guarantees the marine traffic safety.
The above description is only for the preferred embodiment of the present invention, but the scope of the present invention is not limited thereto, and any person skilled in the art should be considered to be within the technical scope of the present invention, and the technical solutions and the inventive concepts thereof according to the present invention should be equivalent or changed within the scope of the present invention.
Claims (3)
1. A VHF maritime secure communication system based on speech recognition and caption display, comprising:
a microphone for receiving audio information played on the marine vessel;
the voice preprocessing unit is used for receiving the voice information transmitted by the loudspeaker, performing framing processing on the received voice information and then windowing processing on the voice information, and performing fast Fourier transform and cache processing on a windowed voice signal;
a storage unit for receiving the processed voice information transmitted by the voice preprocessing unit and storing the voice in real time;
the voice recognition unit is used for receiving the processed voice information transmitted by the voice preprocessing unit and converting the received voice information into a character form;
the voice recognition unit transmits the characters to the display unit for playback display.
2. A VHF maritime security communication system based on speech recognition and caption display as claimed in claim 1, further characterized in that: setting the signal after windowing asThe frequency spectrum of the kth frequency point of the ith frame signal obtained by performing fast Fourier transform on the ith frame signal is as follows:
XR(i, k) is the real part of the k frequency point of the ith frame signal, XIAnd (i, k) is the imaginary part of the k frequency point of the ith frame signal.
G(i,k)=[XR(i,k)2+XI(i,k)2],k=0,1,…,N-1 (4)
And performing frame windowing on the incoming signals, calculating the amplitude spectrum and the power spectrum and caching the amplitude spectrum and the power spectrum.
3. A VHF maritime security communication system based on speech recognition and caption display as claimed in claim 1, further characterized in that: and the loudspeaker carries out denoising and howling elimination processing on the received voice information.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201911083849.9A CN110797024A (en) | 2019-11-07 | 2019-11-07 | VHF (very high frequency) maritime safety communication system based on voice recognition and subtitle display |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201911083849.9A CN110797024A (en) | 2019-11-07 | 2019-11-07 | VHF (very high frequency) maritime safety communication system based on voice recognition and subtitle display |
Publications (1)
Publication Number | Publication Date |
---|---|
CN110797024A true CN110797024A (en) | 2020-02-14 |
Family
ID=69443290
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN201911083849.9A Pending CN110797024A (en) | 2019-11-07 | 2019-11-07 | VHF (very high frequency) maritime safety communication system based on voice recognition and subtitle display |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN110797024A (en) |
Cited By (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN111951794A (en) * | 2020-07-29 | 2020-11-17 | 深圳星标科技股份有限公司 | Ground station automatic response method, ground station automatic response device, computer equipment and storage medium thereof |
Citations (9)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20040181404A1 (en) * | 2003-03-01 | 2004-09-16 | Shedd Jonathan Elias | Weather radio with speech to text recognition of audio forecast and display summary of weather |
CN1798167A (en) * | 2004-12-31 | 2006-07-05 | 乐金电子(中国)研究开发中心有限公司 | Mobile terminal with noise-identification communication-variation function and method of varying the same |
CN202276337U (en) * | 2011-10-13 | 2012-06-13 | 张明亮 | VHF/UHF digital intelligent emergency receiving terminal |
CN202330723U (en) * | 2011-11-18 | 2012-07-11 | 交通运输部天津水运工程科学研究所 | AIS (automatic identification system) shipborne terminal system based on Beidou satellite navigation |
CN103136978A (en) * | 2013-03-01 | 2013-06-05 | 上海海事大学 | Ship traffic management and ship driving comprehensive imitator system |
US8688092B1 (en) * | 2007-03-26 | 2014-04-01 | Callwave Communications, Llc | Methods and systems for managing telecommunications and for translating voice messages to text messages |
US20150172766A1 (en) * | 2013-12-12 | 2015-06-18 | Samsung Electronics Co., Ltd. | Image display apparatus, method for driving image display apparatus, method for displaying an image, and computer readable recording medium therefor |
CN105577882A (en) * | 2015-05-28 | 2016-05-11 | 东莞酷派软件技术有限公司 | Information display method and user terminal |
CN106385548A (en) * | 2016-09-05 | 2017-02-08 | 努比亚技术有限公司 | Mobile terminal and method for generating video captions |
-
2019
- 2019-11-07 CN CN201911083849.9A patent/CN110797024A/en active Pending
Patent Citations (9)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20040181404A1 (en) * | 2003-03-01 | 2004-09-16 | Shedd Jonathan Elias | Weather radio with speech to text recognition of audio forecast and display summary of weather |
CN1798167A (en) * | 2004-12-31 | 2006-07-05 | 乐金电子(中国)研究开发中心有限公司 | Mobile terminal with noise-identification communication-variation function and method of varying the same |
US8688092B1 (en) * | 2007-03-26 | 2014-04-01 | Callwave Communications, Llc | Methods and systems for managing telecommunications and for translating voice messages to text messages |
CN202276337U (en) * | 2011-10-13 | 2012-06-13 | 张明亮 | VHF/UHF digital intelligent emergency receiving terminal |
CN202330723U (en) * | 2011-11-18 | 2012-07-11 | 交通运输部天津水运工程科学研究所 | AIS (automatic identification system) shipborne terminal system based on Beidou satellite navigation |
CN103136978A (en) * | 2013-03-01 | 2013-06-05 | 上海海事大学 | Ship traffic management and ship driving comprehensive imitator system |
US20150172766A1 (en) * | 2013-12-12 | 2015-06-18 | Samsung Electronics Co., Ltd. | Image display apparatus, method for driving image display apparatus, method for displaying an image, and computer readable recording medium therefor |
CN105577882A (en) * | 2015-05-28 | 2016-05-11 | 东莞酷派软件技术有限公司 | Information display method and user terminal |
CN106385548A (en) * | 2016-09-05 | 2017-02-08 | 努比亚技术有限公司 | Mobile terminal and method for generating video captions |
Non-Patent Citations (1)
Title |
---|
窦路: "VTS雷达信号及VHF信号数据记录与回放的研究", 《中国优秀博硕士学位论文全文数据库(硕士)信息科技辑》 * |
Cited By (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN111951794A (en) * | 2020-07-29 | 2020-11-17 | 深圳星标科技股份有限公司 | Ground station automatic response method, ground station automatic response device, computer equipment and storage medium thereof |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
US4351062A (en) | Method and apparatus for suppressing digital error noise in digital communication | |
US8515748B2 (en) | Mobile phone communication gap recovery | |
JPS59210758A (en) | Digital hand-free telephone set | |
MY132748A (en) | Apparatus for speech-based generation, audio translation, and manipulation of text messages over voice lines. | |
CN107527623A (en) | Screen transmission method, device, electronic equipment and computer-readable recording medium | |
CN106569773A (en) | Terminal and voice interaction processing method | |
EP1397796A1 (en) | Speech quality indication | |
CN110797024A (en) | VHF (very high frequency) maritime safety communication system based on voice recognition and subtitle display | |
CN110389743A (en) | Car audio system and vehicle | |
CN104981870A (en) | Speech enhancement device | |
CN104505096A (en) | Method and device using music to transmit hidden information | |
CN106685575B (en) | A kind of device for preventing from being eavesdropped using mobile phone | |
JPH0946233A (en) | Sound encoding method/device and sound decoding method/ device | |
CN103366757A (en) | Communication system and method with echo cancellation mechanism | |
CN106160687A (en) | A kind of volume adjustment device and method, relevant device | |
US5602913A (en) | Robust double-talk detection | |
US20030028379A1 (en) | System for converting electronic content to a transmittable signal and transmitting the resulting signal | |
KR102607120B1 (en) | Sound data noise canceling method and apparatus, electronic device , computer readable storage medium and computer program | |
CN107967919A (en) | Eliminate the method, device and mobile terminal of TDD noises | |
GB1593835A (en) | Reduction of adjacent channel interference in stereo receivers | |
CN205028649U (en) | Ware is sheltered to multichannel sound | |
US20140372111A1 (en) | Voice recognition enhancement | |
CN105791937A (en) | Audio/video processing method and related equipment | |
JPS6384216A (en) | Voice/data multiplexer | |
CN210634505U (en) | Vehicle-mounted machine system and vehicle-mounted entertainment system |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
RJ01 | Rejection of invention patent application after publication |
Application publication date: 20200214 |
|
RJ01 | Rejection of invention patent application after publication |