CN105141919A - Monitoring terminal device remotely controlled by voice - Google Patents
Monitoring terminal device remotely controlled by voice Download PDFInfo
- Publication number
- CN105141919A CN105141919A CN201510550089.3A CN201510550089A CN105141919A CN 105141919 A CN105141919 A CN 105141919A CN 201510550089 A CN201510550089 A CN 201510550089A CN 105141919 A CN105141919 A CN 105141919A
- Authority
- CN
- China
- Prior art keywords
- voice
- module
- terminal device
- remote
- monitor terminal
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Pending
Links
Landscapes
- Telephonic Communication Services (AREA)
Abstract
The invention provides a monitoring terminal device remotely controlled by voice. The monitoring terminal device comprises a control and storage module, a sound acquisition and output module, a high-speed image and video acquisition and display module, a wireless communication module and a power supply and management module, wherein the sound acquisition and output module, a high-speed pass voice processing module, the wireless communication module and the power supply and management module are connected with the control and storage module respectively to realize data interaction. The monitoring terminal device is characterized in that local or remote monitoring equipment is controlled on the basis of voice through the control and storage module and the sound acquisition and output module, so that unmanned operation of the monitoring equipment, including startup, video recording and shutdown of the monitoring equipment, is realized, and online, real-time and multi-directional monitoring of a site is realized. Through adoption of the monitoring terminal device, multi-directional, online and real-time site monitoring is realized by a law enforcement officer or a rescue worker, so that management personnel can learn about site information rapidly, comprehensively and systematically.
Description
Technical field
The present invention relates to a kind of supervising device, particularly a kind of monitor terminal device of Voice-remote-control.
Background technology
The subject matter that current monitor terminal device exists has:
1) on-the-spot in law enforcement or emergency relief, the time concerns life and the property of a lot of people, and field condition is very complicated, and manual operation is very inconvenient, and wastes time, and likely can have influence on the quality time of law enforcement or rescue;
2) terminal installation manually starts, and when fortuitous event appears in operating personnel or scene, may ensure that on-site supervision continues to carry out smoothly;
3) site environment very severe sometimes, has very large harm to the health of the person.
In view of this, the monitor terminal device that a kind of Voice-remote-control is provided is necessary, to solve the problem.
Summary of the invention
The object of the invention is: in order to solve prior art Problems existing, thus a kind of monitor terminal device of Voice-remote-control is provided, on the basis of current monitor terminal (comprising recorder, individual soldier etc.), increase Voice-remote-control module, the unlatching to Local or Remote monitor terminal, automatic camera and recording function can be realized, and have selection, timely, fast automaticly upload electronic evidence to command centre, ensure that on duty and administrative staff are fast, comprehensively, system understands field data.
For achieving the above object, the present invention adopts following technical scheme: a kind of monitor terminal device of Voice-remote-control, comprise: control and memory module, sound collection output module, high speed image video acquisition display module, wireless communication module, power supply and administration module, sound collection output module, pass through speech processing module at a high speed, wireless communication module, power supply and administration module are connected with control and memory module and realize data interaction respectively, it is characterized in that, by controlling and memory module and sound collection output module, based on voice, Local or Remote watch-dog is controlled, realize unattended to watch-dog, comprise the unlatching of watch-dog, video recording and closedown, realize carrying out online to scene, in real time, multi-faceted monitoring.
The monitor terminal device of Voice-remote-control as above, is characterized in that, sound collection output module comprises speech processing module, and its algorithm realization comprises dynamic monitoring, voice enhancement algorithm, sound effect algorithms, speech recognition, and voice wake up and Voice command.
The monitor terminal device of Voice-remote-control as above, is characterized in that, the workflow of described speech processing module is:
1) dynamic monitoring: the detection realizing voice signal;
2) voice enhancement algorithm: realize the preliminary treatment to signal, described voice enhancement algorithm mainly comprises data acquisition, wavelet analysis, the enhancing of filtering processed voice, binaryzation, is convenient to follow-up signal and is accurately detected;
3) sound effect algorithms: eliminate the interference of ambient noise, makes the voice signal distortion that detects less;
4) speech recognition: voice signal, after above-mentioned process, identifies the voice signal of Local or Remote;
5) voice wake up: recognize voice signal above-mentioned, are converted to the control command of monitor terminal device.
The monitor terminal device of Voice-remote-control as above, is characterized in that, what described voice woke up realizes principle is, to the voice signal of input, carry out phonetic algorithm process, result is exported to and wakes execution up, it is that voice wake algorithm up that described voice wake key up, and its handling process is:
1. acoustic feature extracts: usually choose the MFCC feature used in speech recognition as acoustic feature;
2. wake word up to detect: will the acoustic feature that obtains be extracted, adopt the acoustic model of training to calculate acoustic score waking up on word Sampling network, comprise the word that will detect in the path that acoustic score has most, then determine to detect to wake word up;
3. wake word up to confirm: the word that wakes up of detection confirms waking up on word network, finally confirmed score, the thresholding of this score and setting judges, final confirm that score is greater than and arranges thresholding and then confirm successfully, otherwise confirm unsuccessfully, confirm that unsuccessfully then coming back to the 1st step extracts acoustic model.
The monitor terminal device of Voice-remote-control as above, is characterized in that, described monitor terminal device, except being connected with high in the clouds, uplink data, to high in the clouds, is browsed outside the data of high in the clouds, also can be wireless interconnected with other law enforcement device, realize speech talkback, video communication and remote monitoring function.
The invention has the beneficial effects as follows: the monitor terminal device of Voice-remote-control of the present invention, staff is controlled Local or Remote wireless monitoring terminal by voice, realizes the automatic unlatching to Local or Remote terminal, automatic video recording the function uploaded in time; Ensure that law enfrocement official and scene are in the unexpected situation of appearance, on-site supervision task is not still interrupted; Simultaneously, replace operating terminal by hand by voice, greatly improve the operating efficiency of monitor terminal, for the valuable time is got in law enforcement or emergency relief, this device all has major application prospect in departments such as public security, law court, fire-fighting, hospital, prisons.
Compared to prior art, tool of the present invention has the following advantages:
1) at law enforcement or emergency relief scene, replace manual operation with voice, not only bring great convenience to operating personnel, also can get the quality time to law enforcement or rescue;
2) when fortuitous event appears in operating personnel or scene, this device automatically can be opened Local or Remote monitor terminal, record a video and uplink data, thus ensures that on-site supervision is not interrupted by the external world;
3) environment very severe at the scene, when having very large harm to the health of the person, it is unattended that this device can realize Local or Remote monitor terminal, guarantees the safety of security personnel's life and property.
Law enfrocement official or rescue personnel utilize this device, realize multi-faceted, online, the real-time monitoring site of law enfrocement official, thus ensure that administrative staff are quick, comprehensive, system understands field data; This is regulate the law enforcement to handle a case or the management process of emergency relief; Strengthen controling effectively to in-situ processing links; Realize the regulation and standardization of law enforcement or rescue work, modernization; Improve implementation quality and work efficiency etc. comprehensively, provide strong instrument.
Accompanying drawing explanation
Fig. 1 is the Voice-remote-control functional hardware connection layout of monitor terminal device of the present invention.
Fig. 2 is the realization flow figure of the speech processing module of monitor terminal device of the present invention.
Embodiment
In order to understand the present invention better, illustrate content of the present invention further below in conjunction with embodiment, but content of the present invention is not only confined to the following examples.Those skilled in the art can make various changes or modifications the present invention, and these equivalent form of values are equally within claims limited range listed by the application.
As shown in Figure 1, this device comprises: control and memory module, wireless communication module, high speed image video acquisition display module, sound collection output module, power supply and administration module.
Control and memory module comprise: the button of management and control individual soldier law enforcement device, camera, screen, microphone, loudspeaker, the modules such as wireless module; Storage device is built-in flash and high speed DDR memory, also can use the SD card of each type; This device also can support USB and all kinds of SD card interface, various transducer (as: gravity, temperature, gyroscope etc.), by USB interface, can support the terminal of USB interface, as USB flash disk, USB interface hard disk, and the camera of USB, USB keyboard and mouse etc.
Wireless communication module can support various radio connection, comprise the support of the wireless network of operator, as supported the GSM/TD-SCDMA/TD-LTE etc. of China Mobile, the GSM/WCDMA/FDD-LTE etc. that supports the GSM/WCDMA/FDD-LTE of CHINAUNICOM etc., support the GSM/CDMA/FDD-LTE of China Telecom etc., support Overseas Carriers respectively.This wireless communication module also supports positioning function, as supported GPS/GLONASS/BEIDOU/GALILEO etc. and realizing positioning function, convenient to position at server end, alert analysis, personal scheduling etc., support satellite and mobile network's calibration, for staff provides evidence record more accurately simultaneously.
High speed image video acquisition display module is responsible for taking pictures or recording a video and interface display, this device supports multiple video recording and display device, 360 degree of pan-shots are carried out by multiple video recording equipment, the high definition photography of 120 frames/second, and continuous vari-focus ability, in conjunction with the algorithm for image enhancement that this device is exclusive, the multiple photos combination of high-speed and continuous being taken obtains the photo of more high definition, this device can also pass through face recognition algorithm, image enhaucament is carried out for facial characteristics, multiple photos is adopted to carry out image co-registration, the photo of final acquisition each side excellent effect, and retain the different picture of one group of effect as a reference.This device also can by various wireline interface external camera or display device, and connected mode includes but not limited to WIFI, CVBS, USB, HDMI etc.
Sound collection output module is the key modules of this device, hardware device comprises: microphone, loudspeaker, and speech processing module, also support many mike systems, this system also supports directed recording audio simultaneously, has the ability eliminating the aspects such as ambient noise, wherein the most key is speech processing module, and its algorithm realization flow chart as shown in Figure 2.Flow chart comprises: voice enhancement algorithm, detection of dynamic and sound effect algorithms, speech recognition, and voice wake up and speech control module etc., hereafter does concrete introduction to implementation method.
Dynamic monitoring: the detection realizing voice signal.Local or Remote voice signal can be caught in scene fast changing at the scene.
Voice enhancement algorithm: realize the preliminary treatment to signal.This algorithm mainly comprises the parts such as data acquisition, wavelet analysis, the enhancing of filtering processed voice, binaryzation, is convenient to follow-up signal and is accurately detected.
Sound effect algorithms: eliminate the interference of ambient noise, makes the voice signal distortion that detects less.
Speech recognition: voice signal, after above-mentioned process, identifies the voice signal of Local or Remote.
Because speech recognition technology developed recently is very fast, technology innovation is very fast, traditional off-line speech recognition technology discrimination and ease for use all very poor, comparatively large by environmental interference during use, substantially can not meet the demand of accurately monitoring.The speech recognition technology of this device, adopt wireless communication technology, upload the data to high in the clouds when local recognition effect is bad and identify, use high in the clouds latest algorithm and database and semantics recognition, realize identification and the understanding of natural-sounding, thus obtain voice signal accurately in time, fast.
Voice wake up: recognize voice signal above-mentioned, are converted to the control command of monitor terminal device.It realizes principle, to the voice signal of input, carries out phonetic algorithm process, result is exported to and wake execution up.The key of this module is that voice wake algorithm up, and its handling process is:
1. acoustic feature extracts: usually choose the MFCC(Mel frequency cepstrum coefficient used in speech recognition) feature is as acoustic feature.
2. wake word up to detect: will the acoustic feature that obtains be extracted, adopt the acoustic model of training to calculate acoustic score waking up on word Sampling network, comprise the word that will detect in the path that acoustic score has most, then determine to detect to wake word up.
3. wake word up to confirm: the word that wakes up of detection confirms waking up on word network, and finally confirmed score, the thresholding of this score and setting judges, final confirmation score is greater than and arranges thresholding and then confirm successfully, otherwise confirms unsuccessfully.Confirm that unsuccessfully then coming back to the 1st step extracts acoustic model.
The monitor terminal device with radio communication function that this device describes, except being connected with high in the clouds, uplink data, to high in the clouds, is browsed outside the data of high in the clouds, also can be wireless interconnected with other law enforcement device, realizes the functions such as speech talkback, video communication, remote monitoring.Bluetooth earphone, wireless camera, wireless display, wireless input/output unit can also be connected, the environment for use of great expanding monitoring terminal installation and function, and in conjunction with special application program, realize various different function.
The originality of this device is embodied in and utilizes speech processing algorithm technology, the combining wireless communication technology, realizes carrying out Voice command to Local or Remote monitor terminal.This for it is pressed for time, bad environments, fast changing monitoring site have important practical significance.
The present invention is that law enfrocement official or rescue personnel utilize this device, realizes multi-faceted, online, the real-time monitoring site of law enfrocement official, thus ensures that administrative staff are quick, comprehensive, system understands field data; This is regulate the law enforcement to handle a case or the management process of emergency relief; Strengthen controling effectively to in-situ processing links; Realize the regulation and standardization of law enforcement or rescue work, modernization; Improve implementation quality and work efficiency etc. comprehensively, provide strong instrument.
The content be not described in detail in this specification belongs to the known prior art of professional and technical personnel in the field.
Claims (5)
1. the monitor terminal device of a Voice-remote-control, comprise: control and memory module, sound collection output module, high speed image video acquisition display module, wireless communication module, power supply and administration module, sound collection output module, pass through speech processing module at a high speed, wireless communication module, power supply and administration module are connected with control and memory module and realize data interaction respectively, it is characterized in that, by controlling and memory module and sound collection output module, based on voice, Local or Remote watch-dog is controlled, realize unattended to watch-dog, comprise the unlatching of watch-dog, video recording and closedown, realize carrying out online to scene, in real time, multi-faceted monitoring.
2. the monitor terminal device of Voice-remote-control according to claim 1, it is characterized in that, sound collection output module comprises speech processing module, and its algorithm realization comprises dynamic monitoring, voice enhancement algorithm, sound effect algorithms, speech recognition, voice wake up and Voice command.
3. the monitor terminal device of Voice-remote-control according to claim 2, is characterized in that, the workflow of described speech processing module is:
1) dynamic monitoring: the detection realizing voice signal;
2) voice enhancement algorithm: realize the preliminary treatment to signal, described voice enhancement algorithm mainly comprises data acquisition, wavelet analysis, the enhancing of filtering processed voice, binaryzation, is convenient to follow-up signal and is accurately detected;
3) sound effect algorithms: eliminate the interference of ambient noise, makes the voice signal distortion that detects less;
4) speech recognition: voice signal, after above-mentioned process, identifies the voice signal of Local or Remote;
5) voice wake up: recognize voice signal above-mentioned, are converted to the control command of monitor terminal device.
4. the monitor terminal device of Voice-remote-control according to claim 3, it is characterized in that, what described voice woke up realizes principle is, to the voice signal of input, carry out phonetic algorithm process, result exported to and wake execution up, it is that voice wake algorithm up that described voice wake key up, and its handling process is:
1. acoustic feature extracts: usually choose the MFCC feature used in speech recognition as acoustic feature;
2. wake word up to detect: will the acoustic feature that obtains be extracted, adopt the acoustic model of training to calculate acoustic score waking up on word Sampling network, comprise the word that will detect in the path that acoustic score has most, then determine to detect to wake word up;
3. wake word up to confirm: the word that wakes up of detection confirms waking up on word network, finally confirmed score, the thresholding of this score and setting judges, final confirm that score is greater than and arranges thresholding and then confirm successfully, otherwise confirm unsuccessfully, confirm that unsuccessfully then coming back to the 1st step extracts acoustic model.
5. the monitor terminal device of Voice-remote-control according to claim 1, it is characterized in that, described monitor terminal device, except being connected with high in the clouds, uplink data is to high in the clouds, browse outside the data of high in the clouds, also can be wireless interconnected with other law enforcement device, realize speech talkback, video communication and remote monitoring function.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201510550089.3A CN105141919A (en) | 2015-09-01 | 2015-09-01 | Monitoring terminal device remotely controlled by voice |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201510550089.3A CN105141919A (en) | 2015-09-01 | 2015-09-01 | Monitoring terminal device remotely controlled by voice |
Publications (1)
Publication Number | Publication Date |
---|---|
CN105141919A true CN105141919A (en) | 2015-12-09 |
Family
ID=54727115
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN201510550089.3A Pending CN105141919A (en) | 2015-09-01 | 2015-09-01 | Monitoring terminal device remotely controlled by voice |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN105141919A (en) |
Cited By (6)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN106297777A (en) * | 2016-08-11 | 2017-01-04 | 广州视源电子科技股份有限公司 | A kind of method and apparatus waking up voice service up |
CN106791689A (en) * | 2017-01-04 | 2017-05-31 | 深圳源诚技术有限公司 | A kind of intelligent back vision mirror multi-cam monitoring system and its implementation |
CN107222711A (en) * | 2017-05-27 | 2017-09-29 | 北方民族大学 | Monitoring system, method and the client of warehoused cargo |
CN108154878A (en) * | 2017-12-12 | 2018-06-12 | 北京小米移动软件有限公司 | Control the method and device of monitoring device |
CN108171951A (en) * | 2018-01-03 | 2018-06-15 | 李文清 | A kind of Intelligent home remote controller based on bluetooth |
CN114327350A (en) * | 2021-12-22 | 2022-04-12 | 合肥德铭电子有限公司 | Voice interaction system based on wireless endoscope camera shooting and use method thereof |
Citations (6)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20050080620A1 (en) * | 2003-10-09 | 2005-04-14 | General Electric Company | Digitization of work processes using wearable wireless devices capable of vocal command recognition in noisy environments |
CN1703923A (en) * | 2002-10-18 | 2005-11-30 | 中国科学院声学研究所 | Portable digital mobile communication apparatus and voice control method and system thereof |
CN101345668A (en) * | 2008-08-22 | 2009-01-14 | 中兴通讯股份有限公司 | Control method and apparatus for monitoring equipment |
CN101742110A (en) * | 2008-11-10 | 2010-06-16 | 天津三星电子有限公司 | Video camera set by speech recognition system |
CN102999161A (en) * | 2012-11-13 | 2013-03-27 | 安徽科大讯飞信息科技股份有限公司 | Implementation method and application of voice awakening module |
CN204156991U (en) * | 2014-10-11 | 2015-02-11 | 武汉同迅智能科技有限公司 | A kind of law court's executive system with on-site law-enforcing equipment |
-
2015
- 2015-09-01 CN CN201510550089.3A patent/CN105141919A/en active Pending
Patent Citations (6)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN1703923A (en) * | 2002-10-18 | 2005-11-30 | 中国科学院声学研究所 | Portable digital mobile communication apparatus and voice control method and system thereof |
US20050080620A1 (en) * | 2003-10-09 | 2005-04-14 | General Electric Company | Digitization of work processes using wearable wireless devices capable of vocal command recognition in noisy environments |
CN101345668A (en) * | 2008-08-22 | 2009-01-14 | 中兴通讯股份有限公司 | Control method and apparatus for monitoring equipment |
CN101742110A (en) * | 2008-11-10 | 2010-06-16 | 天津三星电子有限公司 | Video camera set by speech recognition system |
CN102999161A (en) * | 2012-11-13 | 2013-03-27 | 安徽科大讯飞信息科技股份有限公司 | Implementation method and application of voice awakening module |
CN204156991U (en) * | 2014-10-11 | 2015-02-11 | 武汉同迅智能科技有限公司 | A kind of law court's executive system with on-site law-enforcing equipment |
Cited By (8)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN106297777A (en) * | 2016-08-11 | 2017-01-04 | 广州视源电子科技股份有限公司 | A kind of method and apparatus waking up voice service up |
CN106297777B (en) * | 2016-08-11 | 2019-11-22 | 广州视源电子科技股份有限公司 | A kind of method and apparatus waking up voice service |
CN106791689A (en) * | 2017-01-04 | 2017-05-31 | 深圳源诚技术有限公司 | A kind of intelligent back vision mirror multi-cam monitoring system and its implementation |
CN106791689B (en) * | 2017-01-04 | 2020-06-09 | 深圳源诚技术有限公司 | Intelligent rearview mirror multi-camera monitoring system |
CN107222711A (en) * | 2017-05-27 | 2017-09-29 | 北方民族大学 | Monitoring system, method and the client of warehoused cargo |
CN108154878A (en) * | 2017-12-12 | 2018-06-12 | 北京小米移动软件有限公司 | Control the method and device of monitoring device |
CN108171951A (en) * | 2018-01-03 | 2018-06-15 | 李文清 | A kind of Intelligent home remote controller based on bluetooth |
CN114327350A (en) * | 2021-12-22 | 2022-04-12 | 合肥德铭电子有限公司 | Voice interaction system based on wireless endoscope camera shooting and use method thereof |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN105141919A (en) | Monitoring terminal device remotely controlled by voice | |
US11625910B2 (en) | Methods and apparatus to operate a mobile camera for low-power usage | |
US11158067B1 (en) | Neighborhood alert mode for triggering multi-device recording, multi-camera locating, and multi-camera event stitching for audio/video recording and communication devices | |
US20180233010A1 (en) | Neighborhood alert mode for triggering multi-device recording, multi-camera motion tracking, and multi-camera event stitching for audio/video recording and communication devices | |
CN204856812U (en) | Automatic warning robot patroles | |
CN205247182U (en) | Intelligent building system | |
CN102081813B (en) | Face identification intelligent safety door | |
CN103167160A (en) | System and method for achieving awakening and unlocking of mobile phone based on technology of temperature sense and human face recognition | |
CN103281223A (en) | Modernized intelligent home security system | |
JP3173008U (en) | Entrance monitoring device | |
US20180151039A1 (en) | Neighborhood Security Cameras | |
CN101472066A (en) | Near-end control method of image viewfinding device and image viewfinding device applying the method | |
CN102509369A (en) | Embedded intelligent entrance guard system | |
CN206115454U (en) | Face identification is dull and stereotyped | |
US11393108B1 (en) | Neighborhood alert mode for triggering multi-device recording, multi-camera locating, and multi-camera event stitching for audio/video recording and communication devices | |
CN109143882A (en) | A kind of control method and device | |
CN207096984U (en) | A kind of chemical illumination immunity analysis instrument inspection data inquiry unit | |
CN202007615U (en) | Face identification intelligent safety door | |
US11032762B1 (en) | Saving power by spoofing a device | |
CN203288020U (en) | Anti-theft system based on face recognition | |
CN212463229U (en) | Management and education conversation integrated terminal equipment | |
CN209199178U (en) | A kind of low-power consumption that human face data can issue networking human face recognition door lock system | |
CN208686432U (en) | It is a kind of based on wireless portable emergency rescue system | |
CN202663500U (en) | Wireless command mobile video integrated machine | |
CN112861775B (en) | Deep neural network-based consultation personnel identification recording system and method |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
C06 | Publication | ||
PB01 | Publication | ||
C10 | Entry into substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
WD01 | Invention patent application deemed withdrawn after publication |
Application publication date: 20151209 |
|
WD01 | Invention patent application deemed withdrawn after publication |