WO2017124225A1 - 一种视频网络会议的人物跟踪方法及系统 - Google Patents

一种视频网络会议的人物跟踪方法及系统 Download PDF

Info

Publication number
WO2017124225A1
WO2017124225A1 PCT/CN2016/071205 CN2016071205W WO2017124225A1 WO 2017124225 A1 WO2017124225 A1 WO 2017124225A1 CN 2016071205 W CN2016071205 W CN 2016071205W WO 2017124225 A1 WO2017124225 A1 WO 2017124225A1
Authority
WO
WIPO (PCT)
Prior art keywords
user
image
location
pronunciation
list
Prior art date
Application number
PCT/CN2016/071205
Other languages
English (en)
French (fr)
Inventor
王晓光
Original Assignee
王晓光
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 王晓光 filed Critical 王晓光
Priority to CN201680000269.7A priority Critical patent/CN105684422A/zh
Priority to PCT/CN2016/071205 priority patent/WO2017124225A1/zh
Publication of WO2017124225A1 publication Critical patent/WO2017124225A1/zh

Links

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N7/00Television systems
    • H04N7/14Systems for two-way working
    • H04N7/15Conference systems
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/28Constructional details of speech recognition systems
    • G10L15/30Distributed recognition, e.g. in client-server systems, for mobile phones or network applications
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L67/00Network arrangements or protocols for supporting network services or applications
    • H04L67/50Network services
    • H04L67/52Network services specially adapted for the location of the user terminal
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N23/00Cameras or camera modules comprising electronic image sensors; Control thereof
    • H04N23/60Control of cameras or camera modules
    • H04N23/62Control of parameters via user interfaces
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N23/00Cameras or camera modules comprising electronic image sensors; Control thereof
    • H04N23/60Control of cameras or camera modules
    • H04N23/66Remote control of cameras or camera parts, e.g. by remote control devices
    • H04N23/661Transmitting camera control signals through networks, e.g. control via the Internet

Definitions

  • the present invention relates to the field of images and networks, and in particular, to a method and system for tracking a person in a video network conference.
  • Video refers to various technologies that capture, record, process, store, transmit, and reproduce a series of still images as electrical signals.
  • continuous image changes exceed 24 frames per second, the human eye cannot distinguish a single static image according to the principle of visual persistence; it looks like a smooth continuous visual effect, so that the continuous picture is called video.
  • Video technology was first developed for television systems, but has now evolved into a variety of formats for consumers to record video. The development of network technology has also caused video recording segments to exist on the Internet in the form of streaming media and can be received and played by computers. Video and film belong to different technologies, and the latter uses photography to capture dynamic images as a series of still photos.
  • the application provides a character tracking method for a video network conference. It solves the shortcomings of the prior art technical solution conference experience.
  • a method for tracking a person of a video network conference comprising the following steps:
  • the method further includes:
  • the image is adjusted to the position of the sounding microphone.
  • the method further includes:
  • the image is enlarged and displayed.
  • a character tracking system for a video network conference comprising:
  • a configuration unit for configuring a list of locations of users, locations, and voices
  • a receiving unit configured to receive a pronunciation of the user, and identify the user according to the pronunciation
  • the adjusting unit is configured to find the location of the user according to the location list, and adjust the image to the corresponding location.
  • the adjusting unit is further configured to adjust an image to a position of the sounding microphone if the user cannot be recognized by voice.
  • system further includes:
  • a magnifying unit for magnifying the image.
  • the technical solution provided by the present invention configures a user, location, and voice location list, receives the user's pronunciation, identifies the user according to the pronunciation, searches for the user's location according to the location list, and adjusts the image to the corresponding location, so that The conference experience feels good.
  • FIG. 1 is a flowchart of a method for tracking a person in a video network conference according to a first preferred embodiment of the present invention
  • FIG. 2 is a structural diagram of a person tracking system for a video network conference according to a second preferred embodiment of the present invention.
  • FIG. 1 is a method for tracking a person in a video network conference according to a first preferred embodiment of the present invention. The method is as shown in FIG.
  • Step S101 configuring a location list of users, locations, and voices
  • Step S102 Receive a pronunciation of the user, and identify the user according to the pronunciation;
  • Step S103 Find the location of the user according to the location list, and adjust the image to the corresponding location.
  • the technical solution provided by the present invention configures a user, location, and voice location list, receives the user's pronunciation, identifies the user according to the pronunciation, searches for the user's location according to the location list, and adjusts the image to the corresponding location, so that The conference experience feels good.
  • the foregoing method may further include:
  • the image is adjusted to the position of the sounding microphone.
  • the foregoing method may further include:
  • the image is enlarged and displayed.
  • FIG. 1 is a character tracking system for a video network conference according to a second preferred embodiment of the present invention.
  • the system includes:
  • the configuration unit 201 is configured to configure a location list of the user, the location, and the voice;
  • the receiving unit 202 is configured to receive a pronunciation of the user, and identify the user according to the pronunciation;
  • the adjusting unit 203 is configured to find the location of the user according to the location list, and adjust the image to the corresponding location.
  • the technical solution provided by the present invention configures a user, location, and voice location list, receives the user's pronunciation, identifies the user according to the pronunciation, searches for the user's location according to the location list, and adjusts the image to the corresponding location, so that The conference experience feels good.
  • the adjusting unit 203 is further configured to adjust an image to a position of the sounding microphone if the user cannot be recognized by voice.
  • the above system may further include:
  • the amplifying unit 204 is configured to display an enlarged image.
  • the program may be stored in a computer readable storage medium, and the storage medium may include: Flash drive, read-only memory (English: Read-Only Memory, referred to as: ROM), random accessor (English: Random Access Memory, referred to as: RAM), disk or CD.
  • ROM Read-Only Memory
  • RAM Random Access Memory

Landscapes

  • Engineering & Computer Science (AREA)
  • Multimedia (AREA)
  • Signal Processing (AREA)
  • Human Computer Interaction (AREA)
  • Computational Linguistics (AREA)
  • Health & Medical Sciences (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Physics & Mathematics (AREA)
  • Acoustics & Sound (AREA)
  • Computer Networks & Wireless Communication (AREA)
  • Two-Way Televisions, Distribution Of Moving Picture Or The Like (AREA)

Abstract

一种视频网络会议的人物跟踪方法及系统,所述方法包括如下步骤:配置用户、位置以及语音的位置列表(101);接收用户的发音,依据该发音识别出用户(102);依据该位置列表查找出用户的位置,将图像调整到该对应的位置(103)。上述技术方案具有会议体验好的优点。

Description

一种视频网络会议的人物跟踪方法及系统 技术领域
本发明涉及图像及网络领域,尤其涉及一种视频网络会议的人物跟踪方法及系统。
背景技术
视频(Video)泛指将一系列静态影像以电信号的方式加以捕捉、纪录、处理、储存、传送与重现的各种技术。连续的图像变化每秒超过24帧(frame)画面以上时,根据视觉暂留原理,人眼无法辨别单幅的静态画面;看上去是平滑连续的视觉效果,这样连续的画面叫做视频。视频技术最早是为了电视系统而发展,但现在已经发展为各种不同的格式以利消费者将视频记录下来。网络技术的发达也促使视频的纪录片段以串流媒体的形式存在于因特网之上并可被电脑接收与播放。视频与电影属于不同的技术,后者是利用照相术将动态的影像捕捉为一系列的静态照片。
现有的视频会议一般为图像和语音会议,但是现有的视频网络会议的图像显示无法实时的显示在发言者上,所以对会议体验不好。
技术问题
本申请提供一种视频网络会议的人物跟踪方法。其解决现有技术的技术方案会议体验不好的缺点。
技术解决方案
一方面,提供一种视频网络会议的人物跟踪方法,所述方法包括如下步骤:
配置用户、位置以及语音的位置列表;
接收用户的发音,依据该发音识别出用户;
依据该位置列表查找出用户的位置,将图像调整到该对应的位置。
可选的,所述方法还包括:
如通过语音无法识别出用户,则将图像调整到发声麦克的位置。
可选的,所述方法还包括:
将图像进行放大显示。
第二方面,提供一种视频网络会议的人物跟踪系统,所述系统包括:
配置单元,用于配置用户、位置以及语音的位置列表;
接收单元,用于接收用户的发音,依据该发音识别出用户;
调整单元,用于依据该位置列表查找出用户的位置,将图像调整到该对应的位置。
可选的,所述调整单元,还用于如通过语音无法识别出用户,则将图像调整到发声麦克的位置。
可选的,所述系统还包括:
放大单元,用于将图像进行放大显示。
有益效果
本发明提供的技术方案配置用户、位置以及语音的位置列表,接收用户的发音,依据该发音识别出用户,依据该位置列表查找出用户的位置,将图像调整到该对应的位置,所以其具有会议体验感好的优点。
附图说明
为了更清楚地说明本发明实施例的技术方案,下面将对实施例描述中所需要使用的附图作简单地介绍,显而易见地,下面描述中的附图是本发明的一些实施例,对于本领域普通技术人员来讲,在不付出创造性劳动的前提下,还可以根据这些附图获得其他的附图。
图1为本发明第一较佳实施方式提供的一种视频网络会议的人物跟踪方法的流程图;
图2为本发明第二较佳实施方式提供的一种视频网络会议的人物跟踪系统的结构图。
本发明的实施方式
下面将结合本发明实施例中的附图,对本发明实施例中的技术方案进行清楚、完整地描述,显然,所描述的实施例是本发明一部分实施例,而不是全部的实施例。基于本发明中的实施例,本领域普通技术人员在没有作出创造性劳动前提下所获得的所有其他实施例,都属于本发明保护的范围。
请参考图1,图1是本发明第一较佳实施方式提出的一种视频网络会议的人物跟踪方法,该方法如图1所示,包括如下步骤:
步骤S101、配置用户、位置以及语音的位置列表;
步骤S102、接收用户的发音,依据该发音识别出用户;
步骤S103、依据该位置列表查找出用户的位置,将图像调整到该对应的位置。
本发明提供的技术方案配置用户、位置以及语音的位置列表,接收用户的发音,依据该发音识别出用户,依据该位置列表查找出用户的位置,将图像调整到该对应的位置,所以其具有会议体验感好的优点。
可选的,上述方法在步骤S103之后还可以包括:
如通过语音无法识别出用户,则将图像调整到发声麦克的位置。
可选的,上述方法在步骤S103之后还可以包括:
将图像进行放大显示。
请参考图2,图1是本发明第二较佳实施方式提出的一种视频网络会议的人物跟踪系统,该系统包括:
配置单元201,用于配置用户、位置以及语音的位置列表;
接收单元202,用于接收用户的发音,依据该发音识别出用户;
调整单元203,用于依据该位置列表查找出用户的位置,将图像调整到该对应的位置。
本发明提供的技术方案配置用户、位置以及语音的位置列表,接收用户的发音,依据该发音识别出用户,依据该位置列表查找出用户的位置,将图像调整到该对应的位置,所以其具有会议体验感好的优点。
可选的,上述调整单元203,还用于如通过语音无法识别出用户,则将图像调整到发声麦克的位置。
可选的,上述系统还可以包括:
放大单元204,用于将图像进行放大显示。
需要说明的是,对于前述的各个方法实施例,为了简单描述,故将其都表述为一系列的动作组合,但是本领域技术人员应该知悉,本发明并不受所描述的动作顺序的限制,因为依据本发明,某一些步骤可以采用其他顺序或者同时进行。其次,本领域技术人员也应该知悉,说明书中所描述的实施例均属于优选实施例,所涉及的动作和模块并不一定是本发明所必须的。
在上述实施例中,对各个实施例的描述都各有侧重,某个实施例中没有详细描述的部分,可以参见其他实施例的相关描述。
本领域普通技术人员可以理解上述实施例的各种方法中的全部或部分步骤是可以通过程序来指令相关的硬件来完成,该程序可以存储于一计算机可读存储介质中,存储介质可以包括:闪存盘、只读存储器(英文:Read-Only Memory ,简称:ROM)、随机存取器(英文:Random Access Memory,简称:RAM)、磁盘或光盘等。
以上对本发明实施例所提供的内容下载方法及相关设备、系统进行了详细介绍,本文中应用了具体个例对本发明的原理及实施方式进行了阐述,以上实施例的说明只是用于帮助理解本发明的方法及其核心思想;同时,对于本领域的一般技术人员,依据本发明的思想,在具体实施方式及应用范围上均会有改变之处,综上所述,本说明书内容不应理解为对本发明的限制。

Claims (6)

  1. 一种视频网络会议的人物跟踪方法,其特征在于,所述方法包括如下步骤:
    配置用户、位置以及语音的位置列表;
    接收用户的发音,依据该发音识别出用户;
    依据该位置列表查找出用户的位置,将图像调整到该对应的位置。
  2. 根据权利要求1所述的方法,其特征在于,所述方法还包括:
    如通过语音无法识别出用户,则将图像调整到发声麦克的位置。
  3. 根据权利要求1所述的方法,其特征在于,所述方法还包括:
    将图像进行放大显示。
  4. 一种视频网络会议的人物跟踪系统,其特征在于,所述系统包括:
    配置单元,用于配置用户、位置以及语音的位置列表;
    接收单元,用于接收用户的发音,依据该发音识别出用户;
    调整单元,用于依据该位置列表查找出用户的位置,将图像调整到该对应的位置。
  5. 根据权利要求4所述的系统,其特征在于,
    所述调整单元,还用于如通过语音无法识别出用户,则将图像调整到发声麦克的位置。
  6. 根据权利要求4所述的系统,其特征在于,所述系统还包括:
    放大单元,用于将图像进行放大显示。
PCT/CN2016/071205 2016-01-18 2016-01-18 一种视频网络会议的人物跟踪方法及系统 WO2017124225A1 (zh)

Priority Applications (2)

Application Number Priority Date Filing Date Title
CN201680000269.7A CN105684422A (zh) 2016-01-18 2016-01-18 一种视频网络会议的人物跟踪方法及系统
PCT/CN2016/071205 WO2017124225A1 (zh) 2016-01-18 2016-01-18 一种视频网络会议的人物跟踪方法及系统

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
PCT/CN2016/071205 WO2017124225A1 (zh) 2016-01-18 2016-01-18 一种视频网络会议的人物跟踪方法及系统

Publications (1)

Publication Number Publication Date
WO2017124225A1 true WO2017124225A1 (zh) 2017-07-27

Family

ID=56215753

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2016/071205 WO2017124225A1 (zh) 2016-01-18 2016-01-18 一种视频网络会议的人物跟踪方法及系统

Country Status (2)

Country Link
CN (1) CN105684422A (zh)
WO (1) WO2017124225A1 (zh)

Families Citing this family (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP7152191B2 (ja) * 2018-05-30 2022-10-12 シャープ株式会社 操作支援装置、操作支援システム、及び操作支援方法

Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN102256098A (zh) * 2010-05-18 2011-11-23 宝利通公司 具有多个语音跟踪摄像机的视频会议端点
EP2566194A1 (en) * 2010-11-26 2013-03-06 Huawei Device Co., Ltd. Method and device for processing audio in video communication
CN103581606A (zh) * 2012-08-09 2014-02-12 北京博威康技术有限公司 一种多媒体采集装置和方法
CN103841357A (zh) * 2012-11-21 2014-06-04 中兴通讯股份有限公司 基于视频跟踪的麦克风阵列声源定位方法、装置及系统
CN104238576A (zh) * 2014-09-17 2014-12-24 厦门亿联网络技术股份有限公司 一种基于多麦的视频会议摄像头定位方法

Patent Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN102256098A (zh) * 2010-05-18 2011-11-23 宝利通公司 具有多个语音跟踪摄像机的视频会议端点
EP2566194A1 (en) * 2010-11-26 2013-03-06 Huawei Device Co., Ltd. Method and device for processing audio in video communication
CN103581606A (zh) * 2012-08-09 2014-02-12 北京博威康技术有限公司 一种多媒体采集装置和方法
CN103841357A (zh) * 2012-11-21 2014-06-04 中兴通讯股份有限公司 基于视频跟踪的麦克风阵列声源定位方法、装置及系统
CN104238576A (zh) * 2014-09-17 2014-12-24 厦门亿联网络技术股份有限公司 一种基于多麦的视频会议摄像头定位方法

Also Published As

Publication number Publication date
CN105684422A (zh) 2016-06-15

Similar Documents

Publication Publication Date Title
WO2017124294A1 (zh) 一种视频网络会议的会议记录方法及系统
CN114900733B (zh) 一种视频生成方法、相关装置及存储介质
WO2022092439A1 (ko) 발화 영상 제공 방법 및 이를 수행하기 위한 컴퓨팅 장치
WO2017124292A1 (zh) 一种视频网络会议会场连接的方法及系统
WO2017124289A1 (zh) 一种视频网络会议的开会提醒方法及系统
WO2017124225A1 (zh) 一种视频网络会议的人物跟踪方法及系统
WO2017124340A1 (zh) 一种视频网络会议的人物识别方法及系统
WO2017124291A1 (zh) 一种视频网络会议的会场分配方法及系统
JP2005039515A (ja) ネットワークシステム
WO2017124290A1 (zh) 一种视频网络会议的麦克控制方法及系统
WO2017124293A1 (zh) 一种视频会议的开会讨论方法及系统
WO2017124228A1 (zh) 一种视频网络的图像追踪方法及系统
WO2017107210A1 (zh) 一种视频软件中广告插入的方法及系统
WO2017113052A1 (zh) 一种视频广告的智能分类传输方法及系统
WO2017124224A1 (zh) 一种视频网络会议方法及系统
WO2017113049A1 (zh) 一种视频广告的图像传输方法及系统
WO2017117809A1 (zh) 一种视频网络上传的方法及系统
WO2023277231A1 (ko) 발화 영상 제공 방법 및 이를 수행하기 위한 컴퓨팅 장치
WO2022270669A1 (ko) 발화 영상 제공 방법 및 이를 수행하기 위한 컴퓨팅 장치
WO2024038975A1 (ko) 발화 비디오 제공 장치 및 방법
WO2022265148A1 (ko) 발화 영상 제공 방법 및 이를 수행하기 위한 컴퓨팅 장치
WO2017124295A1 (zh) 一种视频网络会议的人员统计方法及系统
WO2017124223A1 (zh) 一种视频网络的交互方法及系统
WO2017117810A1 (zh) 一种视频网络的下载方法及系统
WO2017107217A1 (zh) 一种视频软件的会员管理方法及系统

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 16885498

Country of ref document: EP

Kind code of ref document: A1

NENP Non-entry into the national phase

Ref country code: DE

122 Ep: pct application non-entry in european phase

Ref document number: 16885498

Country of ref document: EP

Kind code of ref document: A1