CN110366017A - A kind of smart television voice cam device and intelligent TV set - Google Patents
A kind of smart television voice cam device and intelligent TV set Download PDFInfo
- Publication number
- CN110366017A CN110366017A CN201910493460.5A CN201910493460A CN110366017A CN 110366017 A CN110366017 A CN 110366017A CN 201910493460 A CN201910493460 A CN 201910493460A CN 110366017 A CN110366017 A CN 110366017A
- Authority
- CN
- China
- Prior art keywords
- voice
- signal
- smart television
- cam device
- transmission module
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Pending
Links
- 230000005540 biological transmission Effects 0.000 claims abstract description 63
- 238000002592 echocardiography Methods 0.000 claims abstract description 7
- 238000012545 processing Methods 0.000 claims description 42
- XUIMIQQOPSSXEZ-UHFFFAOYSA-N Silicon Chemical compound [Si] XUIMIQQOPSSXEZ-UHFFFAOYSA-N 0.000 claims description 34
- 229910052710 silicon Inorganic materials 0.000 claims description 34
- 239000010703 silicon Substances 0.000 claims description 34
- 241000209140 Triticum Species 0.000 claims description 33
- 235000021307 Triticum Nutrition 0.000 claims description 33
- 238000004088 simulation Methods 0.000 claims description 25
- 238000000034 method Methods 0.000 claims description 7
- 230000008569 process Effects 0.000 claims description 5
- 230000005611 electricity Effects 0.000 claims description 2
- 230000008859 change Effects 0.000 abstract description 10
- 238000006243 chemical reaction Methods 0.000 description 11
- 230000006870 function Effects 0.000 description 11
- 210000001699 lower leg Anatomy 0.000 description 7
- 238000010586 diagram Methods 0.000 description 6
- 102000008482 12E7 Antigen Human genes 0.000 description 5
- 108010020567 12E7 Antigen Proteins 0.000 description 5
- 101000893549 Homo sapiens Growth/differentiation factor 15 Proteins 0.000 description 5
- 101000692878 Homo sapiens Regulator of MON1-CCZ1 complex Proteins 0.000 description 5
- 102100026436 Regulator of MON1-CCZ1 complex Human genes 0.000 description 5
- 238000013473 artificial intelligence Methods 0.000 description 4
- 102100039435 C-X-C motif chemokine 17 Human genes 0.000 description 2
- 101000889048 Homo sapiens C-X-C motif chemokine 17 Proteins 0.000 description 2
- 241000208340 Araliaceae Species 0.000 description 1
- 235000005035 Panax pseudoginseng ssp. pseudoginseng Nutrition 0.000 description 1
- 235000003140 Panax quinquefolius Nutrition 0.000 description 1
- 235000008331 Pinus X rigitaeda Nutrition 0.000 description 1
- 235000011613 Pinus brutia Nutrition 0.000 description 1
- 241000018646 Pinus brutia Species 0.000 description 1
- 230000015572 biosynthetic process Effects 0.000 description 1
- 238000004891 communication Methods 0.000 description 1
- 230000007812 deficiency Effects 0.000 description 1
- 238000013461 design Methods 0.000 description 1
- 230000000694 effects Effects 0.000 description 1
- 235000008434 ginseng Nutrition 0.000 description 1
- 238000003384 imaging method Methods 0.000 description 1
- 238000000465 moulding Methods 0.000 description 1
- 230000002093 peripheral effect Effects 0.000 description 1
- 230000009467 reduction Effects 0.000 description 1
- 230000004044 response Effects 0.000 description 1
- 230000005236 sound signal Effects 0.000 description 1
- 238000006467 substitution reaction Methods 0.000 description 1
- 238000012546 transfer Methods 0.000 description 1
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06V—IMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
- G06V10/00—Arrangements for image or video recognition or understanding
- G06V10/94—Hardware or software architectures specially adapted for image or video understanding
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/22—Procedures used during a speech recognition process, e.g. man-machine dialogue
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L21/00—Speech or voice signal processing techniques to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
- G10L21/02—Speech enhancement, e.g. noise reduction or echo cancellation
- G10L21/0208—Noise filtering
- G10L21/0216—Noise filtering characterised by the method used for estimating noise
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N21/00—Selective content distribution, e.g. interactive television or video on demand [VOD]
- H04N21/40—Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
- H04N21/41—Structure of client; Structure of client peripherals
- H04N21/422—Input-only peripherals, i.e. input devices connected to specially adapted client devices, e.g. global positioning system [GPS]
- H04N21/42203—Input-only peripherals, i.e. input devices connected to specially adapted client devices, e.g. global positioning system [GPS] sound input device, e.g. microphone
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N21/00—Selective content distribution, e.g. interactive television or video on demand [VOD]
- H04N21/40—Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
- H04N21/41—Structure of client; Structure of client peripherals
- H04N21/422—Input-only peripherals, i.e. input devices connected to specially adapted client devices, e.g. global positioning system [GPS]
- H04N21/4223—Cameras
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N21/00—Selective content distribution, e.g. interactive television or video on demand [VOD]
- H04N21/40—Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
- H04N21/41—Structure of client; Structure of client peripherals
- H04N21/426—Internal components of the client ; Characteristics thereof
- H04N21/42607—Internal components of the client ; Characteristics thereof for processing the incoming bitstream
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/22—Procedures used during a speech recognition process, e.g. man-machine dialogue
- G10L2015/223—Execution procedure of a spoken command
Landscapes
- Engineering & Computer Science (AREA)
- Multimedia (AREA)
- Signal Processing (AREA)
- Physics & Mathematics (AREA)
- Health & Medical Sciences (AREA)
- Computational Linguistics (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Human Computer Interaction (AREA)
- Acoustics & Sound (AREA)
- Theoretical Computer Science (AREA)
- General Physics & Mathematics (AREA)
- Quality & Reliability (AREA)
- Software Systems (AREA)
- Two-Way Televisions, Distribution Of Moving Picture Or The Like (AREA)
Abstract
The invention discloses a kind of smart television voice cam device and intelligent TV sets, it is connect in the form of external equipment with TV master chip transmission module, described device includes voice camera module, the voice camera module integrated form setting, and connect in the form of external equipment with TV master chip transmission module;Voice camera module is acquired external voice signal and picture signal, the transmission module reaches voice camera module from TV chip reception of echoes reference signal simultaneously, echo is eliminated according to echo reference signal by voice camera module and image recognition is carried out according to picture signal, and generates main signal and TV master chip is reached by transmission module.The present invention by by smart television voice cam device independently of television set outside, and described device is attached by transmission module and television set, and then without carrying out hardware change to TV motherboard, traditional tv intelligence can easily be realized by directly connecting this device.
Description
Technical field
The present invention relates to intelligent television field, in particular to a kind of smart television voice cam device and smart television
Machine.
Background technique
A new sales growth point of the artificial intelligence as TV at present, on TV using more and more extensive,
Greatly enrich the usage scenario of TV.Speech recognition and image recognition are two important entrances of artificial intelligence, wherein language
Sound, which identifies, to be needed to realize by far field voice array, and image recognition needs to realize by camera module.
In existing artificial intelligence television set, Yao Shixian phonetic function is needed far field voice array in the form of platelet
It is arranged in television cabin.
But it is existing the way in television cabin is arranged in field far field voice array, not only will increase TV backboard
Thickness, make moulding unsightly;And to the complete machine sold away, traditional tv can not be upgraded to artificial intelligence on hardware
It can TV.
Thus the prior art could be improved and improve.
Summary of the invention
Place in view of above-mentioned deficiencies of the prior art, the purpose of the present invention is to provide a kind of smart television voice cameras
Device and intelligent TV set, by by smart television voice cam device independently of television set outside, and described device is led to
It crosses and is attached with television set, and then without carrying out hardware change to TV motherboard, directly connecting this device can be easily real
Existing traditional tv is intelligent.
In order to achieve the above object, this invention takes following technical schemes:
On the one hand, the present invention provides a kind of smart television voice cam devices, by transmission module with external equipment
Form connect with TV master chip, described device includes voice camera module, and the voice camera module integrated form is set
It sets, and is connect in the form of external equipment with TV master chip transmission module;The voice camera module is to external sound
Sound signal and picture signal are acquired, while the transmission module reaches voice from TV chip reception of echoes reference signal and takes the photograph
As head module, echo is eliminated according to echo reference signal by voice camera module and image knowledge is carried out according to picture signal
Not, it and generates main signal TV master chip is reached by transmission module.The present invention is by by smart television voice cam device
Be attached by USB interface and earphone interface with television set outside television set, and by the circuit, so without pair
TV motherboard carries out hardware change, and traditional tv intelligence can easily be realized by directly connecting this circuit.
Optionally, in the embodiment of the present invention, the voice camera module includes power supply unit, sound processing unit, takes the photograph
As head mould group and hub;External voice signal and echo reference signal are obtained by sound processing unit and carry out echo processing,
Signal switchs to the first signal and reaches hub after carrying out echo processing;Image is carried out by camera module acquisition picture signal again
Processing generates second signal to hub;Processing is carried out to the first signal and the second signal by hub and generates main signal, and is led to
It crosses delivery module and reaches TV master chip.In the embodiment of the present invention, pass through each unit and device progress in simulation camera
Echo processing and image procossing finally realize the speech recognition and image recognition of conventional television, keep conventional television intelligent.
Optionally, in the embodiment of the present invention, the power supply unit is specifically used for being separately connected sound processing unit, camera
Mould group and hub are to provide corresponding voltage;The power supply unit obtains required supply voltage by connecting with transmission module.
The power supply unit includes the first voltage-stablizer and the second voltage-stablizer;By transmission module as power supply, by the first voltage-stablizer
Different voltage is successively changed into the second voltage-stablizer to power to sound processing unit and hub.In the embodiment of the present invention, pass through
The first voltage-stablizer and the second voltage-stablizer in voltage cell carry out two-stage conversion to the voltage of USB interface, to meet each unit
The power demands different with device.
Optionally, in the embodiment of the present invention, the voice camera module further includes simulation silicon wheat, the simulation silicon wheat tool
Body carries out echo processing for acquiring external voice signal and being converted into analog signal to sound processing unit.The embodiment of the present invention
In, while realizing echo reference signal analog-to-digital conversion by sound processing unit, external voice is believed in conjunction with simulation silicon wheat
It number is converted into analog signal and carries out the analog-to-digital conversion of external voice signal.
Optionally, in the embodiment of the present invention, the sound processing unit includes MCU, analog-digital converter;The analog-to-digital conversion
The echo reference signal that device receives external voice signal simultaneously by simulation silicon wheat and earphone interface transmits, then reach MCU progress
Echo cancellation process simultaneously generates second signal to hub.In the embodiment of the present invention, external voice is believed by analog-digital converter
Number and echo reference signal carry out analog-to-digital conversion, then echo cancellation process is carried out by MCU and realizes accurate speech identifying function.
Optionally, in the embodiment of the present invention, the hub will Interface Expanding be all the way two-way interface, while receive first
The first signal and the second signal are merged into main signal, and reach TV master chip by signal and second signal.
A kind of intelligent TV set, the television set include TV master chip interconnected, transmission module and any of the above-described
Described in smart television voice cam device, when the TV master chip by transmission module by with smart television voice
When cam device connects, the TV master chip obtains the master that smart television voice cam device transmits by transmission module
Signal makes television set carry out speech recognition or image recognition.
Optionally, in the embodiment of the present invention, the transmission module is wire transmission module or wireless transport module.
Optionally, in the embodiment of the present invention, the wire transmission module includes wired coffret and earphone interface, described
Earphone interface connect with the smart television voice cam device and sends back to the smart television voice cam device
Acoustic reference signal;The wire transmission interface connect with smart television voice cam device and receives the smart television language
The main signal that sound cam device transmits.
Compared to the prior art, smart television voice cam device and intelligent TV set provided by the invention, pass through
Transmission module is connect in the form of external equipment with TV master chip, and the smart television voice cam device includes that voice is taken the photograph
As head module, the voice camera module integrated form setting, and through transmission module in the form of external equipment and TV master
Chip connection;The voice camera module is acquired external voice signal and picture signal, while the transmission module
Voice camera module is reached from TV chip reception of echoes reference signal, by voice camera module according to echo reference signal
It eliminates echo and image recognition is carried out according to picture signal, and generate main signal and TV master chip is reached by transmission module.
The present invention by by smart television voice cam device independently of television set outside, and by described device by transmission module with
Television set is attached, and then without carrying out hardware change to TV motherboard, biography can easily be realized by directly connecting this device
System television set intelligently.
Detailed description of the invention
Fig. 1 is the structural block diagram of intelligent TV set provided by the invention;
Fig. 2 is the structural block diagram of the intelligent TV set provided by the invention with wire transmission module;
Fig. 3 is the structural block diagram of the intelligent TV set provided by the invention with wireless transport module;
Fig. 4 is the structural block diagram of the intelligent TV set provided by the invention with wire transmission interface;
Fig. 5 is the structural block diagram of the specific embodiment of smart television voice cam device provided by the invention;
Fig. 6 is the circuit diagram of smart television voice cam device provided by the invention.
Specific embodiment
In view of field far field speech microphone array is arranged in television cabin in the prior art, television set thickness will increase
Degree, and the shortcomings that conventional television can not be adapted to, the present invention by by smart television voice cam device independently of TV
Outside machine, and described device is attached by transmission module and television set, and then without carrying out hardware more to TV motherboard
Change, traditional tv intelligence can easily be realized by directly connecting this device.
To make the purpose of the present invention, technical solution and effect clearer, clear and definite, right as follows in conjunction with drawings and embodiments
The present invention is further described.It should be appreciated that described herein, specific examples are only used to explain the present invention, is not used to
Limit the present invention.
Echo cancellor involved in the present invention, which refers to, eliminates the sound that TV itself speaker issues.Song is listened to see play in user
When, sound is had when not only user sends out phonetic order, TV itself can also make a sound.But it is desirable that far field voice array
The only sound that response user sends instructions, and it is not responding to the sound of TV itself speaker sending.So echo cancellor is by far field language
The voice signal that sound array acquisition arrives, after the voice signal that TV itself speaker issues is eliminated by specific algorithm, then it is right
Sound is recorded, so as to effectively improve user's wake-up rate.
Referring to Fig. 1, a kind of intelligent TV set provided by the invention, the television set includes TV main core interconnected
Piece 100, transmission module 200 and smart television voice cam device 300, when the TV master chip 100 passes through transmission module
200 by being connect with smart television voice cam device 300 when, the TV master chip 100 is obtained by transmission module 200
The main signal that smart television voice cam device 300 transmits makes television set carry out speech recognition or image recognition.Specifically,
The TV master chip 100 is arranged in inside television, and the TV master chip 100 of the intelligent TV set passes through transmission module
200 connect with smart television voice cam device 300, i.e., the described smart television voice cam device 300 is smart television
The external equipment of machine, so that the intelligent TV set also can be realized far field speech recognition under the premise of not changing hardware configuration
And image identification function, realize TV set intelligent.
Fig. 2 or Fig. 3 is please referred to, optionally, in the embodiment of the present invention, the transmission module 200 is wire transmission module 210
Or wireless transport module 220.
Specifically, the wire transmission module 210 includes wired coffret 211 and ear please continue to refer to Fig. 4 and Fig. 5
Machine interface 212, the earphone interface connect with the smart television voice cam device 300 and to the smart television languages
Sound cam device 300 sends back acoustic reference signal;The wire transmission interface 211 and smart television voice cam device
300 connect and receive the main signal that the smart television voice cam device 300 transmits.Optionally, the embodiment of the present invention
In, the wire transmission interface 211 is USB interface 213;It should be noted that the wire transmission interface 211 can also be it
He has the interface of transfer function, e.g., HDMI interface, AV interface etc..
Specifically, please continue to refer to Fig. 3, the wireless transport module 220 includes at least two wireless connectors, it is described extremely
Few two wireless connectors are separately positioned in television set and smart television voice cam device 300;In the television set
Wireless connector in wireless connector and the smart television voice cam device 300 is wirelessly connected, in television set
Wireless connector of the wireless connector into the smart television voice cam device 300 send back acoustic reference signal, simultaneously
Receive the main signal that the wireless connector of the smart television voice cam device 300 transmits.It should be noted that the nothing
Wiring connector can be WIFI module, bluetooth module etc., need to only have wireless data transmission function, not do specific limit herein
It is fixed.
Fig. 4 and Fig. 5 are please referred to, based on above-mentioned intelligent TV set, the present invention also provides a kind of camera shootings of smart television voice
Head device 300, is connect in the form of external equipment with TV master chip, the smart television voice transmission module 200
Cam device 300 includes voice camera module 310,310 integrated form of the voice camera module setting, and passes through transmission
Module 200 is connect in the form of external equipment with TV master chip;The voice camera module 310 to external voice signal and
Picture signal is acquired, while the transmission module 200 reaches voice camera from TV chip reception of echoes reference signal
Module 310 eliminates echo according to echo reference signal by voice camera module 310 and carries out image according to picture signal
Identification, and generate main signal and TV master chip is reached by transmission module 200.The present invention is by by smart television voice camera
Device 300 independently of television set outside, and described device is attached by transmission module and television set, and then without to electricity
Hardware change is carried out depending on mainboard, traditional tv intelligence can easily be realized by directly connecting this device.
When it is implemented, the transmission module 200 in the embodiment of the present invention includes USB interface 213 and earphone interface 212, by
It is most of all with USB interface 213 and earphone interface 212 in television set at present on the market, by speech identifying function and image
Identification function is integrated in the same module formation voice camera module 310, and independently of television set outside, pass through USB interface
213 and earphone interface 212 connect with television set, when distant place user issue phonetic order when, voice camera module 310 is connecing
While receiving external voice signal, TV is received together and is transmitted through the echo reference signal come, is returned by echo reference signal
Sound is eliminated, and to eliminate the sound that television set itself issues, achievees the purpose that the phonetic order for accurately identifying user;The present invention is implemented
Speech identifying function and image function are integrated on one piece of circuit board by example jointly in the form of TV peripheral hardware, are avoided to TV
Mainboard carries out hardware change, not will increase the thickness of television set, can also be adapted to most conventional television in the market, gently
Pine nut shows traditional tv intelligence.
Referring to Fig. 6, special, it is currently used interface equipment that the USB interface 213, which is USB2.0 interface 214,
There is major part in the market the television set of USB interface 213 to be provided with USB2.0 interface 214, it should be noted that described
USB interface 213 is not limited to USB2.0 interface 214, and USB3.0 interface and other USB interfaces also may be selected, in the present invention can root
Appropriate adjustment is carried out to the output interface of voice camera module 310 according to the interface type of television set, this is those skilled in the art
What member can arbitrarily replace as needed, it is not limited here.
Optionally, in the embodiment of the present invention, the main signal is obtained by the first signal and the second signal by processing.By two
Kind usb signal pools one kind, saves communication port.
Specifically, the voice camera module 310 includes power supply unit 401, acoustic processing list please continue to refer to Fig. 6
Member 402, camera module CM and hub USB_HUB;External voice signal and echo reference are obtained by sound processing unit 402
Signal simultaneously carries out echo processing, and signal switchs to the first signal and reaches hub USB_HUB after carrying out echo processing;Again by imaging
Head mould group CM acquisition picture signal carries out image procossing, generates second signal to hub USB_HUB;By hub USB_HUB
Processing is carried out to the first signal and the second signal and generates main signal, and TV master chip 100 is reached by USB2.0 interface 214.
It is last real by each unit and device progress echo processing and image procossing in simulation camera in the embodiment of the present invention
The speech recognition and image recognition of existing conventional television keep conventional television intelligent.
Further, the first foot USB_VCC of the USB2.0 interface 214 and power supply unit 401, sound processing unit
402, the third foot CM_VCC of camera module is connected with the 7th foot VCC of hub, the first foot OUT1 of the hub and
Crus secunda OUT2 is separately connected the crus secunda USB_D+ and third foot USB_D- of USB2.0 interface 214, the third of the hub
The the first foot CM_OUT1 and crus secunda CM_OUT2 connection of foot IN1 and the 4th foot IN2 and camera module, the of the hub
Five foot IN3 and the 6th foot IN4 are connect with sound processing unit 402 respectively, the octal VCC1 of the hub and with power supply list
Member 401 connects, the first foot Earphone_L and crus secunda Earphone_R and sound processing unit of the earphone interface 212
402 connections.
Optionally, please continue to refer to Fig. 6, in the embodiment of the present invention, the hub USB_HUB will USB interface be most all the way
Be extended to four road USB interfaces more, while receiving the first signal and the second signal, by the first signal and the second signal merge based on believe
Number, and reach TV master chip 100.In the embodiment of the present invention, realize that the convergence of four road usb signals is by hub USB_HUB
Two-way usb signal, and then Interface Terminal is saved, conventional television can be adapted to.
Optionally, in the embodiment of the present invention, the voice camera module 310 further includes simulation silicon wheat, the simulation silicon
Wheat is specifically used for acquisition external voice signal and is converted into analog signal to the progress echo processing of sound processing unit 402.This hair
It, will in conjunction with simulation silicon wheat while realizing echo reference signal analog-to-digital conversion by sound processing unit 402 in bright embodiment
External voice signal is converted into analog signal and carries out the analog-to-digital conversion of external voice signal.
Optionally, please continue to refer to Fig. 6, in the embodiment of the present invention, the simulation silicon wheat number is two, respectively first
Simulate the simulation of silicon wheat MIC1 and second silicon wheat MIC2.It should be noted that the present invention is the simulation silicon wheat number in embodiment
For illustrating technical solution of the present invention, it is not intended to limit the present invention.
The sound processing unit 402 includes Micro-processor MCV, analog-digital converter ADC;The analog-digital converter ADC is logical
It crosses simulation silicon wheat while receiving external voice signal and echo reference signal that earphone interface 212 transmits, then reach microprocessor
MCU carries out echo cancellation process and generates usb signal to hub USB_HUB.In the embodiment of the present invention, pass through analog-digital converter
ADC carries out analog-to-digital conversion to external voice signal and echo reference signal, then carries out echo cancellation process reality by Micro-processor MCV
Now accurate speech identifying function.
Specifically, please continue to refer to Fig. 6, the first foot MCU_OUT1 and crus secunda MCU_OUT2 of the Micro-processor MCV
It is connect with the 5th foot IN3 of hub and the 6th foot IN4, the third foot MCU_IN1 and the 4th foot MCU_ of the Micro-processor MCV
IN2 is separately connected the first foot ADC_OUT1 and crus secunda ADC_OUT2 of analog-digital converter, the 5th foot of the Micro-processor MCV
MCU_VCC and the 6th foot MCU_VCC1 is separately connected the first foot USB_VCC and power supply unit 401 of USB interface 213, the mould
The third foot ADC_IN1 and the 4th foot ADC_IN2 of number converter are separately connected the simulation silicon wheat of the first simulation silicon wheat MIC1 and second
MIC2, the 5th foot ADC_IN3 of the analog-digital converter and the 6th foot ADC_IN4 are separately connected the crus secunda of earphone interface 212
Earphone_R and the first foot Earphone_L, the 7th foot ADC_VCC1 and octal ADC_VCC2 points of the analog converter
It Lian Jie not power supply unit 401.
When it is implemented, the first simulation silicon wheat MIC1 and the second simulation silicon wheat MIC2 are acquired external voice signal,
And analog-digital converter ADC is reached, TV master chip 100 inputs echo reference signal to analog-digital converter by earphone interface 212
ADC, the echo reference signal that the external voice signal and earphone interface 212 of simulation silicon wheat acquisition transmit is by analog-digital converter
After ADC, changes into I2S signal and handled to Micro-processor MCV, Micro-processor MCV passes after I2S signal is changed into the first signal again
It is handled to hub USB_HUB.Meanwhile camera module CM also generates second signal and is connected on hub USB_HUB
It is handled.Two-way usb signal is processed into usb signal all the way and is input to TV master chip 100 by hub USB_HUB.
Particularly, the embodiment of the present invention converts number for voice signal according to digital silicon wheat when selecting microphone
The characteristics of word signal, and the characteristics of silicon wheat then converts analog signal for voice signal is simulated, it follows that digital silicon Mai Bimo
The step of having intended more than silicon wheat analog-to-digital conversion;That is, digital silicon wheat is encapsulated inside silicon wheat on the basis of simulating silicon wheat
One dedicated analog-digital converter ADC, for analog signal to be switched to digital signal;Due to this more analog-digital converter
ADC, the cost of digital silicon wheat is than simulating at high cost the 50% of silicon wheat.And just make in the embodiment of the present invention in sound processing unit 402
The echo reference signal that earphone interface 212 transmits is received with analog-digital converter ADC, i.e., in voice camera module 310 originally
Body has just used analog-digital converter ADC;Moreover, analog-digital converter ADC itself contains the input of four tunnels, pass through earphone interface
212 reception of echoes reference signals have only taken up two-way input, and there are also two-way inputs to be in idle condition, and the embodiment of the present invention is by the
One simulation silicon wheat MIC1 and the second simulation silicon wheat MIC2 have accessed the two-way input being in idle condition, and then pass through the first simulation
The simulation of silicon wheat MIC1 and second silicon wheat MIC2 receives external voice signal, then by analog-digital converter ADC to the external voice signal
Analog-to-digital conversion is carried out, equally can be realized the function of digital silicon wheat, and save 50% cost.
Further, in the embodiment of the present invention, sound processing unit 402 carries out echo required for echo cancellor with reference to letter
Number directly take the signal of earphone interface 212 in TV motherboard.Earphone interface 212 is the standard interface of TV, most TVs
On have earphone interface 212.The amplitude peak of standard earphone signal is 800mV or so, and the input signal of analog-digital converter ADC
Amplitude requirement be less than 1V, earphone signal is just met for the requirement.Thus, earphone signal can be directly as the ginseng of echo cancellor
Signal is examined, reduction voltage circuit processing is needed not move through, is directly connected on analog-digital converter ADC.
Optionally, please continue to refer to Fig. 6, in the embodiment of the present invention, the power supply unit 401 is specifically used for being separately connected sound
Sound processing unit 402 and hub USB_HUB are to provide corresponding voltage;The power supply unit 401 by with USB interface 213
Supply voltage needed for connection obtains.The power supply unit 401 includes the first voltage-stablizer LDO1 and the second voltage-stablizer LDO2;By USB
Interface 213 is used as power supply, successively changes into different voltage to sound by the first voltage-stablizer LDO1 and the second voltage-stablizer LDO2
Sound processing unit 402 and hub USB_HUB power supply.In the embodiment of the present invention, pass through the first voltage-stablizer in voltage cell
LDO1 and the second voltage-stablizer LDO2 carries out two-stage conversion to the voltage of USB interface 213, different with device to meet each unit
Power demands.
Specifically, the first foot USB_VCC connection of the input terminal LDO1_IN and USB interface of first voltage-stablizer, described
The output end LDO2_OUT of first voltage-stablizer respectively with the input terminal LDO2_IN of the second voltage-stablizer, hub octal VCC1,
6th foot MCU_VCC1 of microprocessor is connected with the 7th foot ADC_VCC1 of analog-digital converter, the output of second voltage-stablizer
Hold the octal ADC_VCC2 connection of LDO2_OUT and analog-digital converter.
When it is implemented, the input power supply of entire smart television voice cam device 300 is USB2.0 interface the
5V voltage in one foot USB_VCC, by the first voltage-stablizer LDO1 and the second voltage-stablizer LDO2 successively change into 3.3V and 1.8V to
Each chip power supply.Wherein, hub USB_HUB and Micro-processor MCV power supply include two kinds of voltages of 5V and 3.3V, are taken the photograph
As head mould group CM power supply be 5V, the power supply of analog-digital converter ADC includes two kinds of voltages of 3.3V and 1.8V.
In conclusion smart television voice cam device provided by the invention and intelligent TV set, pass through transmission mould
Block is connect in the form of external equipment with TV master chip, and described device includes voice camera module, the voice camera
Integrated modular setting, and connect in the form of external equipment with TV master chip transmission module;The voice camera
Module is acquired external voice signal and picture signal, while the transmission module is believed from the reference of TV chip reception of echoes
Number voice camera module is reached, echo is eliminated according to echo reference signal by voice camera module and is believed according to image
Number image recognition is carried out, and generates main signal and TV master chip is reached by transmission module.The present invention is by by smart television language
Sound cam device independently of television set outside, and described device is attached by transmission module and television set, Jin Erwu
Hardware change need to be carried out to TV motherboard, traditional tv intelligence can easily be realized by directly connecting this device.
It, can according to the technique and scheme of the present invention and its hair it is understood that for those of ordinary skills
Bright design is subject to equivalent substitution or change, and all these changes or replacement all should belong to the guarantor of appended claims of the invention
Protect range.
Claims (10)
1. a kind of smart television voice cam device is connected in the form of external equipment with TV master chip transmission module
It connects, which is characterized in that described device includes voice camera module, the voice camera module integrated form setting, and is passed through
Transmission module is connect in the form of external equipment with TV master chip;The voice camera module is to external voice signal and figure
As signal is acquired, while the transmission module reaches voice camera module from TV chip reception of echoes reference signal,
Echo is eliminated according to echo reference signal by voice camera module and image recognition is carried out according to picture signal, and is generated
Main signal reaches TV master chip by transmission module.
2. smart television voice cam device according to claim 1, which is characterized in that the voice camera module
Including power supply unit, sound processing unit, camera module and hub;By sound processing unit obtain external voice signal and
Echo reference signal simultaneously carries out echo processing, and signal switchs to the first signal and reaches hub after carrying out echo processing;Again by taking the photograph
As head mould group acquisition picture signal carries out image procossing, generation second signal to hub;By hub to the first signal and
Binary signal carries out processing and generates main signal, and reaches TV master chip by delivery module.
3. smart television voice cam device according to claim 2, which is characterized in that the power supply unit is specifically used
In being separately connected sound processing unit, camera module and hub to provide corresponding voltage;The power supply unit by with
Supply voltage needed for transmission module connection obtains.
4. smart television voice cam device according to claim 3, which is characterized in that the power supply unit includes the
One voltage-stablizer and the second voltage-stablizer;By transmission module as power supply, successively turn by the first voltage-stablizer and the second voltage-stablizer
It powers at different voltage to sound processing unit and hub.
5. smart television voice cam device according to claim 2, which is characterized in that the voice camera module
It further include simulation silicon wheat, the simulation silicon wheat is specifically used for acquisition external voice signal and is converted into analog signal to acoustic processing
Unit carries out echo processing.
6. smart television voice cam device according to claim 5, which is characterized in that the sound processing unit packet
Include MCU, analog-digital converter;The analog-digital converter receives external voice signal simultaneously by simulation silicon wheat and earphone interface transmits
Echo reference signal, then reach MCU and carry out echo cancellation process and generating second signal to hub.
7. smart television voice cam device according to claim 2, which is characterized in that the hub will connect all the way
Mouth is extended to two-way interface, while receiving the first signal and the second signal, and the first signal and the second signal are merged into main signal,
And reach TV master chip.
8. a kind of intelligent TV set, which is characterized in that the television set include TV master chip interconnected, transmission module and
The described in any item smart television voice cam devices of claim 1-7, other than the smart television voice cam device
The form for connecing equipment is integrally disposed, when the TV master chip by transmission module by with smart television voice cam device
When connection, the TV master chip obtains the main signal that smart television voice cam device transmits by transmission module, makes electricity
Speech recognition or image recognition are carried out depending on machine.
9. intelligent TV set according to claim 8, which is characterized in that the transmission module is wire transmission module or nothing
Line transmission module.
10. intelligent TV set according to claim 9, which is characterized in that the wire transmission module includes wire transmission
Interface and earphone interface, the earphone interface connect with the smart television voice cam device and to the smart televisions
Voice cam device sends back acoustic reference signal;The wire transmission interface connect with smart television voice cam device,
And receive the main signal that the smart television voice cam device transmits.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201910493460.5A CN110366017A (en) | 2019-06-06 | 2019-06-06 | A kind of smart television voice cam device and intelligent TV set |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201910493460.5A CN110366017A (en) | 2019-06-06 | 2019-06-06 | A kind of smart television voice cam device and intelligent TV set |
Publications (1)
Publication Number | Publication Date |
---|---|
CN110366017A true CN110366017A (en) | 2019-10-22 |
Family
ID=68216837
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN201910493460.5A Pending CN110366017A (en) | 2019-06-06 | 2019-06-06 | A kind of smart television voice cam device and intelligent TV set |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN110366017A (en) |
Cited By (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN112584022A (en) * | 2020-12-14 | 2021-03-30 | 深圳康佳电子科技有限公司 | Camera device and display system |
Citations (6)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
WO2008041878A2 (en) * | 2006-10-04 | 2008-04-10 | Micronas Nit | System and procedure of hands free speech communication using a microphone array |
CN101431629A (en) * | 2008-12-05 | 2009-05-13 | 深圳创维-Rgb电子有限公司 | System and method for controlling television set through voice |
CN102833634A (en) * | 2012-09-12 | 2012-12-19 | 康佳集团股份有限公司 | Implementation method for television speech recognition function and television |
CN203057371U (en) * | 2012-11-08 | 2013-07-10 | 康佳集团股份有限公司 | Camera device having echo elimination function and television set |
CN203590328U (en) * | 2013-11-08 | 2014-05-07 | 康佳集团股份有限公司 | Signal acquisition device |
CN104796692A (en) * | 2014-01-20 | 2015-07-22 | 宁波舜宇光电信息有限公司 | Method and system for testing echo cancellation of television audio acquisition device |
-
2019
- 2019-06-06 CN CN201910493460.5A patent/CN110366017A/en active Pending
Patent Citations (6)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
WO2008041878A2 (en) * | 2006-10-04 | 2008-04-10 | Micronas Nit | System and procedure of hands free speech communication using a microphone array |
CN101431629A (en) * | 2008-12-05 | 2009-05-13 | 深圳创维-Rgb电子有限公司 | System and method for controlling television set through voice |
CN102833634A (en) * | 2012-09-12 | 2012-12-19 | 康佳集团股份有限公司 | Implementation method for television speech recognition function and television |
CN203057371U (en) * | 2012-11-08 | 2013-07-10 | 康佳集团股份有限公司 | Camera device having echo elimination function and television set |
CN203590328U (en) * | 2013-11-08 | 2014-05-07 | 康佳集团股份有限公司 | Signal acquisition device |
CN104796692A (en) * | 2014-01-20 | 2015-07-22 | 宁波舜宇光电信息有限公司 | Method and system for testing echo cancellation of television audio acquisition device |
Cited By (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN112584022A (en) * | 2020-12-14 | 2021-03-30 | 深圳康佳电子科技有限公司 | Camera device and display system |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN106255003B (en) | Audio processor in the operation control method and terminal device of earphone noise reduction | |
CN106708763A (en) | Head-mounted display device and data transmission system of intelligent host | |
CN104135601A (en) | User customizable camera and general camera system thereof | |
CN207283696U (en) | A kind of USB3.1 Mobynebs integrator | |
CN102497609A (en) | Wireless network sound system | |
CN102325230B (en) | Eliminate processing method, system and the digital microphone of echo | |
CN106658328B (en) | A kind of earphone test circuit | |
CN110366017A (en) | A kind of smart television voice cam device and intelligent TV set | |
CN102591609B (en) | Remote management device and remote management system | |
CN206313908U (en) | A kind of high definition movable video monitoring equipment | |
CN106303829B (en) | Double nip head circuit and its control method | |
CN209328491U (en) | It is a kind of can integrated control LED display screen system device | |
CN103313029A (en) | Terminal for video conference | |
CN112929722B (en) | External equipment for voice control of set top box | |
CN214338053U (en) | External device for voice control of set top box | |
CN203590328U (en) | Signal acquisition device | |
CN109274958A (en) | A kind of STB audio frequency and video multichannel automatic switchover testing cassete | |
CN213586241U (en) | Far-field voice interaction device and electronic equipment | |
CN208572125U (en) | A kind of audio data conversion equipment and audio data transmission system | |
CN209216081U (en) | Interface conversion circuit, interface convertor, charger baby, audio-visual devices and intelligent terminal | |
CN209299429U (en) | A kind of video coding circuit with multiplex roles | |
CN207232955U (en) | Audio and video calling device and electronic equipment | |
CN204720157U (en) | A kind of Instrument Digital signal processing apparatus based on wireless transmission | |
CN206442460U (en) | A kind of Portable HDMI video capture device based on FPGA | |
CN105282639B (en) | Earphone microphone interface control system and earphone microphone interface control method |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
RJ01 | Rejection of invention patent application after publication | ||
RJ01 | Rejection of invention patent application after publication |
Application publication date: 20191022 |