CN107831684A - 应用机器视觉实现的口型发音转置 - Google Patents

应用机器视觉实现的口型发音转置 Download PDF

Info

Publication number
CN107831684A
CN107831684A CN201610823857.2A CN201610823857A CN107831684A CN 107831684 A CN107831684 A CN 107831684A CN 201610823857 A CN201610823857 A CN 201610823857A CN 107831684 A CN107831684 A CN 107831684A
Authority
CN
China
Prior art keywords
module
information analysis
transposition
mouth
picture recognition
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN201610823857.2A
Other languages
English (en)
Inventor
尚佐旭
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Tianjin Siboke Technology Development Co Ltd
Original Assignee
Tianjin Siboke Technology Development Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Tianjin Siboke Technology Development Co Ltd filed Critical Tianjin Siboke Technology Development Co Ltd
Priority to CN201610823857.2A priority Critical patent/CN107831684A/zh
Publication of CN107831684A publication Critical patent/CN107831684A/zh
Pending legal-status Critical Current

Links

Classifications

    • GPHYSICS
    • G05CONTROLLING; REGULATING
    • G05BCONTROL OR REGULATING SYSTEMS IN GENERAL; FUNCTIONAL ELEMENTS OF SUCH SYSTEMS; MONITORING OR TESTING ARRANGEMENTS FOR SUCH SYSTEMS OR ELEMENTS
    • G05B19/00Programme-control systems
    • G05B19/02Programme-control systems electric
    • G05B19/04Programme control other than numerical control, i.e. in sequence controllers or logic controllers
    • G05B19/042Programme control other than numerical control, i.e. in sequence controllers or logic controllers using digital processors
    • G05B19/0423Input/output
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F18/00Pattern recognition
    • GPHYSICS
    • G05CONTROLLING; REGULATING
    • G05BCONTROL OR REGULATING SYSTEMS IN GENERAL; FUNCTIONAL ELEMENTS OF SUCH SYSTEMS; MONITORING OR TESTING ARRANGEMENTS FOR SUCH SYSTEMS OR ELEMENTS
    • G05B2219/00Program-control systems
    • G05B2219/20Pc systems
    • G05B2219/25Pc structure of the system
    • G05B2219/25314Modular structure, modules

Landscapes

  • Engineering & Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • Theoretical Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Bioinformatics & Computational Biology (AREA)
  • Computer Vision & Pattern Recognition (AREA)
  • Data Mining & Analysis (AREA)
  • Evolutionary Biology (AREA)
  • Evolutionary Computation (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • General Engineering & Computer Science (AREA)
  • Bioinformatics & Cheminformatics (AREA)
  • Artificial Intelligence (AREA)
  • Automation & Control Theory (AREA)
  • Manipulator (AREA)
  • Image Analysis (AREA)
  • Image Processing (AREA)

Abstract

本装置公开了应用机器视觉实现的口型发音转置,包括图像检测模块,图像识别模块,信息分析模块,发声驱动模块;图像检测模块的信号输出端连接图像识别模块的信号输入端,图像识别模块的信号输出端连接信息分析模块的信号输入端,信息分析模块的信号输出端连接发声驱动模块的信号输入端;图像检测模块由摄像机模组构成,图像识别模块由机器视觉模组构成,信息分析模块由微处理器及人工神经网路模组构成,发声驱动模块由共振峰合成模组构成。

Description

应用机器视觉实现的口型发音转置
技术领域
本发明涉及声学技术、图像识别技术、电子技术等技术领域,特别是涉及应用机器视觉实现的口型发音转置。
背景技术
现有技术还不能完善的解决本发明所涉及的技术问题,本发明应用机器视觉实现的口型发音转置可以很好地适应市场需求,提供了一个有实用价值的技术方案。
发明内容
应用机器视觉实现的口型发音转置,包括图像检测模块,图像识别模块,信息分析模块,发声驱动模块;图像检测模块的信号输出端连接图像识别模块的信号输入端,图像识别模块的信号输出端连接信息分析模块的信号输入端,信息分析模块的信号输出端连接发声驱动模块的信号输入端。
本发明的硬件由摄像机模组、机器视觉模组、微处理器及人工神经网路模组、共振峰合成模组构成。
本发明具有的优点是:物美价廉、技术先进、智能可靠。
附图说明
图1为本发明的框图。
具体实施方式
如图1所示,应用机器视觉实现的口型发音转置,图像检测模块的信号输出端连接图像识别模块的信号输入端,图像识别模块的信号输出端连接信息分析模块的信号输入端,信息分析模块的信号输出端连接发声驱动模块的信号输入端。
所述的图像检测模块由摄像机模组构成。
所述的图像识别模块由机器视觉模组构成。
所述的信息分析模块由微处理器及人工神经网路模组构成。
所述的发声驱动模块由共振峰合成模组构成。
以上对本发明的一个实施例进行了详细说明,但所述内容仅为本发明的较佳实施例,不能被认为用于限定本发明的实施范围。凡依本发明申请范围所作的均等变化与改进等,均应仍归属于本发明的专利涵盖范围之内。

Claims (3)

1.应用机器视觉实现的口型发音转置,包括图像检测模块,图像识别模块,信息分析模块,发声驱动模块;其特征在于:图像检测模块的信号输出端连接图像识别模块的信号输入端,图像识别模块的信号输出端连接信息分析模块的信号输入端,信息分析模块的信号输出端连接发声驱动模块的信号输入端。
2.根据权利要求1所述的应用机器视觉实现的口型发音转置,其特征在于:所述的图像检测模块由摄像机模组构成。
3.根据权利要求1所述的应用机器视觉实现的口型发音转置,其特征在于:所述的图像识别模块由机器视觉模组构成。
CN201610823857.2A 2016-09-16 2016-09-16 应用机器视觉实现的口型发音转置 Pending CN107831684A (zh)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201610823857.2A CN107831684A (zh) 2016-09-16 2016-09-16 应用机器视觉实现的口型发音转置

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201610823857.2A CN107831684A (zh) 2016-09-16 2016-09-16 应用机器视觉实现的口型发音转置

Publications (1)

Publication Number Publication Date
CN107831684A true CN107831684A (zh) 2018-03-23

Family

ID=61643635

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201610823857.2A Pending CN107831684A (zh) 2016-09-16 2016-09-16 应用机器视觉实现的口型发音转置

Country Status (1)

Country Link
CN (1) CN107831684A (zh)

Citations (9)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101241656A (zh) * 2008-03-11 2008-08-13 黄中伟 口型识别能力的计算机辅助训练方法
CN101510256A (zh) * 2009-03-20 2009-08-19 深圳华为通信技术有限公司 一种口型语言的转换方法及装置
CN101826216A (zh) * 2010-03-31 2010-09-08 中国科学院自动化研究所 一个角色汉语口型动画自动生成系统
CN101930747A (zh) * 2010-07-30 2010-12-29 四川微迪数字技术有限公司 一种将语音转换成口型图像的方法和装置
CN102117115A (zh) * 2009-12-31 2011-07-06 上海量科电子科技有限公司 一种利用唇语进行文字输入选择的系统及实现方法
CN102324035A (zh) * 2011-08-19 2012-01-18 广东好帮手电子科技股份有限公司 口型辅助语音识别术在车载导航中应用的方法及系统
CN102368198A (zh) * 2011-10-04 2012-03-07 上海量明科技发展有限公司 通过嘴唇图像进行信息提示的方法及系统
CN103092329A (zh) * 2011-10-31 2013-05-08 南开大学 一种基于唇读技术的唇语输入方法
CN103905873A (zh) * 2014-04-08 2014-07-02 天津思博科科技发展有限公司 一种基于口型识别技术的电视遥控器

Patent Citations (9)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101241656A (zh) * 2008-03-11 2008-08-13 黄中伟 口型识别能力的计算机辅助训练方法
CN101510256A (zh) * 2009-03-20 2009-08-19 深圳华为通信技术有限公司 一种口型语言的转换方法及装置
CN102117115A (zh) * 2009-12-31 2011-07-06 上海量科电子科技有限公司 一种利用唇语进行文字输入选择的系统及实现方法
CN101826216A (zh) * 2010-03-31 2010-09-08 中国科学院自动化研究所 一个角色汉语口型动画自动生成系统
CN101930747A (zh) * 2010-07-30 2010-12-29 四川微迪数字技术有限公司 一种将语音转换成口型图像的方法和装置
CN102324035A (zh) * 2011-08-19 2012-01-18 广东好帮手电子科技股份有限公司 口型辅助语音识别术在车载导航中应用的方法及系统
CN102368198A (zh) * 2011-10-04 2012-03-07 上海量明科技发展有限公司 通过嘴唇图像进行信息提示的方法及系统
CN103092329A (zh) * 2011-10-31 2013-05-08 南开大学 一种基于唇读技术的唇语输入方法
CN103905873A (zh) * 2014-04-08 2014-07-02 天津思博科科技发展有限公司 一种基于口型识别技术的电视遥控器

Similar Documents

Publication Publication Date Title
Zolfaghari et al. Chained multi-stream networks exploiting pose, motion, and appearance for action classification and detection
Wang et al. Micro-expression recognition using dynamic textures on tensor independent color space
KR102252298B1 (ko) 표정 인식 방법 및 장치
US20150279364A1 (en) Mouth-Phoneme Model for Computerized Lip Reading
US20140119618A1 (en) Apparatus and method for face recognition
CN113469144B (zh) 基于视频的行人性别及年龄识别方法和模型
Yargıç et al. A lip reading application on MS Kinect camera
CN110210416B (zh) 基于动态伪标签解码的手语识别系统优化方法及装置
US20150341545A1 (en) Voice tracking apparatus and control method therefor
KR101187600B1 (ko) 스테레오 카메라 기반의 3차원 실시간 입술 특징점 추출을 이용한 음성 인식 장치 및 음성 인식 방법
Sui et al. A cascade gray-stereo visual feature extraction method for visual and audio-visual speech recognition
CN109376694B (zh) 一种基于图像处理的实时人脸活体检测方法
EP4207195A1 (en) Speech separation method, electronic device, chip and computer-readable storage medium
CN107831684A (zh) 应用机器视觉实现的口型发音转置
WO2021061511A8 (en) Obtaining artist imagery from video content using facial recognition
Bansal et al. Emotion recognition from facial expression based on bezier curve
Lucey et al. Patch-based representation of visual speech
Liu et al. A robust multi-modal emotion recognition framework for intelligent tutoring systems
US11164049B2 (en) Automated method and device capable of providing dynamic perceptive invariance of a space-time event with a view to extracting unified semantic representations therefrom
Goecke Current trends in joint audio-video signal processing: A review
Sahu et al. Result based analysis of various lip tracking systems
Elmaghraby et al. Speech Recognition Using Historian Multimodal Approach
Seddik et al. A computer-aided speech disorders correction system for Arabic language
Pachoud et al. Macro-cuboid based probabilistic matching for lip-reading digits
KR101074817B1 (ko) 스테레오 카메라를 이용한 3차원 비전 기반의 실시간 언어 인식 및 음성 생성 방법과 시스템

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
WD01 Invention patent application deemed withdrawn after publication
WD01 Invention patent application deemed withdrawn after publication

Application publication date: 20180323