KR101686833B1 - System Providing Conference Image Among Several User - Google Patents

System Providing Conference Image Among Several User Download PDF

Info

Publication number
KR101686833B1
KR101686833B1 KR1020150066004A KR20150066004A KR101686833B1 KR 101686833 B1 KR101686833 B1 KR 101686833B1 KR 1020150066004 A KR1020150066004 A KR 1020150066004A KR 20150066004 A KR20150066004 A KR 20150066004A KR 101686833 B1 KR101686833 B1 KR 101686833B1
Authority
KR
South Korea
Prior art keywords
speaker
image
control means
information
input device
Prior art date
Application number
KR1020150066004A
Other languages
Korean (ko)
Other versions
KR20160133224A (en
Inventor
우정수
조영대
Original Assignee
주식회사 우현디지털
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 주식회사 우현디지털 filed Critical 주식회사 우현디지털
Priority to KR1020150066004A priority Critical patent/KR101686833B1/en
Publication of KR20160133224A publication Critical patent/KR20160133224A/en
Application granted granted Critical
Publication of KR101686833B1 publication Critical patent/KR101686833B1/en

Links

Images

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N7/00Television systems
    • H04N7/14Systems for two-way working
    • H04N7/15Conference systems
    • H04N7/152Multipoint control units therefor
    • H04N5/232
    • H04N5/23241
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N7/00Television systems
    • H04N7/14Systems for two-way working
    • H04N7/15Conference systems
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N7/00Television systems
    • H04N7/18Closed-circuit television [CCTV] systems, i.e. systems in which the video signal is not broadcast
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N9/00Details of colour television systems
    • H04N9/12Picture reproducers
    • H04N9/31Projection devices for colour picture display, e.g. using electronic spatial light modulators [ESLM]

Abstract

The present invention relates to a multi-party conference image providing system capable of tracking a speaker, and more particularly, to a multi-way conference image providing system capable of tracking a speaker, And more particularly to a multi-party conference image providing system capable of providing a video by tracking a speaker.

Description

{System Providing Conference Image}

The present invention relates to a multi-party conference image providing system capable of tracking a speaker, and more particularly, to a multi-way conference image providing system capable of tracking a speaker, And more particularly to a multi-party conference image providing system capable of providing a video by tracking a speaker.

In the case of a multi-party conference, the conference data is displayed on the screen by using a monitor or a projector in order to efficiently transmit the contents to the participating users.

However, when recording a teleconference or a meeting, it is possible to transmit or record the video shot by the camera installed in the conference room. However, since it is not possible to select a speaker who is presenting or speaking, It is difficult to efficiently transmit a conference.

In addition, even when recording a meeting, since only the entire video of the conference room can be recorded, there is a problem that the contents of the conference can not be easily grasped because the speaker can not be clearly distinguished when viewing the recorded data.

The object of the present invention is to solve the above problems, and it is an object of the present invention to provide a method and apparatus for analyzing information transmitted from a speaker recognition unit, extracting a position of a speaker, And to provide a multi-party conference image providing system capable of providing a speaker tracking capable of being provided.

In order to accomplish the above object, according to the present invention, there is provided a multi-party conference image providing system capable of tracking a speaker, including a speaker recognition unit for tracking a location of a speaker among users participating in a multi- A control means for extracting a position of a speaker by analyzing information transmitted from the camera and the speaker recognition means, extracting an image of a speaker position in the image of the 360-degree camera to control a video output of the speaker, And display means for outputting an image.

Wherein the speaker recognizing means comprises a speaker input device installed at a desk where a user participating in a conference is seated; The speaker input device includes a local communication module installed in a place where all users participating in a multi-party conference are located, and includes location information and can transmit data to the control means. When a speaker presses the speaker input device, And the position information stored in the input device is transmitted to the control means to track the position of the speaker.

And the speaker recognizing means comprises a plurality of directional microphones for recognizing a voice in a manner of automatically tracking and recognizing the speaker.

Wherein the directional microphone is provided with N directional microphones at regular intervals on the outer periphery of the 360 degree camera; At least one directional microphone installed in the direction of the voice is activated when the speaker speaks, and the directional characteristic of the speaker is tracked by analyzing the signal input to the directional microphone.

Wherein the control means comprises: a speaker position extraction module for extracting a speaker's position from the information transmitted from the speaker recognition means; And an image control module for extracting and outputting an image mapped to the inter-location information among the images of the 360-degree camera according to the extracted location information of the speaker.

The display means is constituted by a 4k2k (UHD) monitor and displays four screens (four channels) of FHD (1920 x 1080p), displays a 360-degree screen of the 360-degree camera on Local and Remote two channels, And displays the required presentation material on one channel and displays mobile images through mobile registration on the one channel to increase the fluidity of the speaker or the speaker by displaying four FHD screens simultaneously.

As described above, the multi-party conference image providing system according to the present invention analyzes the information transmitted from the speaker recognition means and extracts the position of the speaker, and the 360-degree camera outputs the screen of the speaker's position, So that it is possible to effectively transmit the conference contents to the remote place during the remote conference.

1 is a system configuration diagram schematically illustrating a multi-party conference image providing system capable of tracking a speaker according to a preferred embodiment of the present invention.
FIG. 2 is a schematic illustration of recognizing a speaker with a speaker input device according to a preferred embodiment of the present invention.
FIG. 3 schematically shows a directional microphone arranged in a 360-degree camera according to a preferred embodiment of the present invention.
FIG. 4 schematically shows tracking of the direction of a speaker using the directional microphone of FIG. 3. FIG.
FIG. 5 is a schematic view of a 4 CH image output through a display unit according to a preferred embodiment of the present invention.

Hereinafter, specific embodiments of the present invention will be described in detail with reference to the drawings.

1 is a system configuration diagram schematically illustrating a multi-party conference image providing system capable of tracking a speaker according to a preferred embodiment of the present invention.

Referring to FIG. 1, a multi-party conference image providing system capable of tracking a speaker according to the present invention includes a speaker recognition unit 10 for tracking a speaker among users participating in a multi-party conference and a 360-degree A control unit 30 for extracting a position of a speaker by analyzing information transmitted from the camera 20 and the speaker recognition unit, extracting an image of a speaker position in the image of the 360-degree camera, And a display means (40) for outputting an image of the extracted speaker.

In general, since a user is placed in a circle or an ellipse so that a face of a participating user can be seen in a multi-party conference, when the 360-degree camera is disposed at the upper center, the shooting range is 360 degrees. Therefore, You can shoot.

The speaker recognition means 10 is a means for recognizing the position of a speaker, and may be constituted by an input type and a tracking type.

Here, the input expression is a method of extracting the position of the speaker by transmitting the position information of the speaker input device to the control means when the speaker presses the speaker input device 110 installed at the desk where the user participating in the conference It is the way the speaker actively informs his location.

The speaker input device 110 may be configured as a button type, and the speaker input device 110 includes a location communication module and a local communication module capable of transmitting data to the control means. The control unit 30 extracts the image of the speaker corresponding to the position information among the images of the 360 degree camera according to the transmitted position information, (40). ≪ / RTI >

The speaker input device 110 may be installed in a place where all users participating in the multi-party conference are seated. In addition, the speaker input device 110 may store the location information of the installed location in advance, and when the speaker presses the button, the location information of the speaker input device is controlled Is sent to the means (30) to extract the location of the speaker.

For example, as shown in FIG. 2, when the speaker A presses a speaker input device installed in a table of his / her position, the speaker input device 110 is activated, and the built-in position information is transmitted to the control means. ) Can confirm the position of the speaker A through the transmitted location information.

Here, the location information may be replaced with the identification information of the speaker input device 110. When the identification information of the speaker input device and the location information are mapped at the initial setting and stored in the control means 30, The position can be specified only by the identification information.

Therefore, the image corresponding to the position of the speaker A among the images photographed from the 360-degree camera 20 can be extracted and controlled to be outputted through the display means 40.

On the other hand, the tracking formula may be composed of a directional microphone 120 that recognizes a voice in a manner of automatically tracking and recognizing the speaker.

More specifically, the directional microphone 120 may include a plurality of directional microphones 120 for tracking audio in all directions. For example, as shown in FIG. 3, four directional microphones Can be installed.

FIG. 3 schematically shows that a speaker's position is tracked through a directional microphone when speech is recognized from the speaker.

3, when a speaker at a specific position makes a speech, the directional microphone 120 in the direction in which the speaker speaks recognizes the speech, and the remaining directional microphones recognize the speech in the direction of the speaker. And the directional microphone direction becomes the direction of the speaker.

Therefore, the control means 30 extracts the direction of the activated directional microphone 120 to the position of the speaker, extracts the image of the direction of the directional microphone from the image of the 360 degree camera 20, . That is, the position of the speaker is tracked using the directionality detected by the directional microphone 120.

4, when the speaker is positioned between the two directional microphones 120, when two directional microphones 120 of the four directional microphones are activated and the strength of the signal is the same, the center direction between the two directional microphones 120 And when the signal size of the two directional microphones is different, the directionality is tracked by comparing the size of the signal.

That is, the direction of the two directional microphones in the center direction of the two directional microphones is estimated to be the direction of the larger one of the directional microphones, and the accurate directionality is tracked through the ratio of the signal sizes.

If three or more directional microphones are activated, the directions of the speakers can be tracked by comparing and analyzing the signals of the two directional microphones having a large signal, excluding the small directional microphone.

The 360-degree camera 20 adopts a spherical fisheye lens or a 360-degree viewing angle lens capable of 360-degree surround view, thereby enabling 360-degree image capture.

When the position of the speaker is extracted using the Dewarp function, the 360-degree camera 20 outputs the image by dewapping only the image corresponding to the position of the speaker, .

The specific configuration and operation of the 360-degree camera are not only obvious to those skilled in the art, but also deviate from the features of the present invention, and thus a detailed description thereof will be omitted.

The control means 30 extracts the position of the speaker from the information transmitted from the speaker recognition means and the image mapped to the position information among the images of the 360 degrees camera according to the extracted position information of the speaker And outputting the image data to the display device.

The image control module displays a 360-degree image around the extracted speaker's position to provide a speaker-centered conference system.

The display means 40 may be implemented as a display device such as a TV, a monitor, or a projector in order to output an image photographed through the 360-degree camera.

In particular, as shown in FIG. 5, a 4-k2k (UHD) monitor can display four FHD (1920 × 1080p) screens, and a 360-degree screen of a 360-degree camera can be displayed on two local and remote channels. The mobile image can be displayed on one channel and the four screens of the FHD can be simultaneously displayed by displaying the mobile image through the mobile registration for displaying the data on one channel and increasing the fluidity of the speaker or the presenter.

In the mobile image, a mobile device, such as a memo or a note, displayed on the mobile device is synchronized through a wired or wireless connection with a mobile device held by a speaker or a presenter, and the mobile image may be arranged in one channel and output.

While the present invention has been described in connection with what is presently considered to be practical exemplary embodiments, it is to be understood that the invention is not limited to the disclosed embodiments, but, on the contrary, It will be understood by those skilled in the art that various changes in form and details may be made therein without departing from the spirit and scope of the invention.

10: Speaker recognition means 20: 360 degree camera
30: control means 40: display means

Claims (6)

A speaker recognition means for tracking a position of a speaker among users participating in a multi-party conference;
A 360-degree camera that captures all users participating in a multi-party conference;
Control means for extracting a position of a speaker by analyzing information transmitted from the speaker recognition means, extracting an image of a speaker position in the image of the 360-degree camera, and controlling a video output of the speaker;
And display means for outputting an image of the extracted speaker;
The speaker recognition means
A speaker input device installed in a desk at a place where a user participating in the conference is located;
The speaker input device
A local communication module installed in a place where all the users participating in the multi-party conference are seated, the location information being embedded therein and capable of transmitting data to the control means, and when the speaker presses the speaker input device, Transmitting information to said control means;
Wherein the control means extracts a video of a speaker corresponding to the position information among the images of the 360-degree camera according to the transmitted position information, and tracks the position of the speaker.
delete delete delete The method according to claim 1,
The control means
A speaker position extraction module for extracting a position of a speaker from the information transmitted from the speaker recognition means;
And an image control module for controlling the speaker to extract and output an image mapped to the position information among images of the 360 degrees camera according to the extracted position information of the speaker.
The method according to claim 1,
The display means
4 channels (4 channels) of FHD (1920 × 1080p), 360 degree screen of 360 degree camera is displayed on 2 channels of Local and Remote, And displaying mobile images through mobile registration on a single channel to increase the fluidity of a speaker or a presenter, thereby displaying four FHD screens simultaneously.
KR1020150066004A 2015-05-12 2015-05-12 System Providing Conference Image Among Several User KR101686833B1 (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
KR1020150066004A KR101686833B1 (en) 2015-05-12 2015-05-12 System Providing Conference Image Among Several User

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
KR1020150066004A KR101686833B1 (en) 2015-05-12 2015-05-12 System Providing Conference Image Among Several User

Publications (2)

Publication Number Publication Date
KR20160133224A KR20160133224A (en) 2016-11-22
KR101686833B1 true KR101686833B1 (en) 2016-12-16

Family

ID=57540295

Family Applications (1)

Application Number Title Priority Date Filing Date
KR1020150066004A KR101686833B1 (en) 2015-05-12 2015-05-12 System Providing Conference Image Among Several User

Country Status (1)

Country Link
KR (1) KR101686833B1 (en)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US10681308B2 (en) 2018-04-17 2020-06-09 Samsung Electronics Co., Ltd. Electronic apparatus and method for controlling thereof

Families Citing this family (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
KR102284914B1 (en) * 2020-12-23 2021-08-03 디알시스 주식회사 A sound tracking system with preset images
KR102361347B1 (en) * 2021-07-01 2022-02-14 김영훈 User terminal screen sharing system for online collaboration
CN116684735B (en) * 2023-06-14 2024-04-09 广州市远知初电子科技有限公司 Audio and video acquisition system

Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP2005341015A (en) 2004-05-25 2005-12-08 Hitachi Hybrid Network Co Ltd Video conference system with minute creation support function
KR101508092B1 (en) 2014-03-13 2015-04-07 재단법인 다차원 스마트 아이티 융합시스템 연구단 Method and system for supporting video conference

Family Cites Families (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US7852369B2 (en) * 2002-06-27 2010-12-14 Microsoft Corp. Integrated design for omni-directional camera and microphone array
KR100602704B1 (en) * 2004-04-01 2006-07-20 주식회사 팬택앤큐리텔 Apparatus and Method for controlling dynamic screen indication of multipoint visual communication

Patent Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP2005341015A (en) 2004-05-25 2005-12-08 Hitachi Hybrid Network Co Ltd Video conference system with minute creation support function
KR101508092B1 (en) 2014-03-13 2015-04-07 재단법인 다차원 스마트 아이티 융합시스템 연구단 Method and system for supporting video conference

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US10681308B2 (en) 2018-04-17 2020-06-09 Samsung Electronics Co., Ltd. Electronic apparatus and method for controlling thereof

Also Published As

Publication number Publication date
KR20160133224A (en) 2016-11-22

Similar Documents

Publication Publication Date Title
US10440322B2 (en) Automated configuration of behavior of a telepresence system based on spatial detection of telepresence components
US11128793B2 (en) Speaker tracking in auditoriums
US9712785B2 (en) Method and system for video conferencing units
US9641585B2 (en) Automated video editing based on activity in video conference
US8208002B2 (en) Distance learning via instructor immersion into remote classroom
CN105657329B (en) Video conferencing system, processing unit and video-meeting method
KR101686833B1 (en) System Providing Conference Image Among Several User
US20100118112A1 (en) Group table top videoconferencing device
US20160134838A1 (en) Automatic Switching Between Dynamic and Preset Camera Views in a Video Conference Endpoint
US20080180519A1 (en) Presentation control system
US11601731B1 (en) Computer program product and method for auto-focusing a camera on an in-person attendee who is speaking into a microphone at a hybrid meeting that is being streamed via a videoconferencing system to remote attendees
US20110050840A1 (en) Apparatus, system and method for video call
US20150304559A1 (en) Multiple camera panoramic image capture apparatus
WO2015198964A1 (en) Imaging device provided with audio input/output function and videoconferencing system
CN111163280B (en) Asymmetric video conference system and method thereof
US20080122919A1 (en) Image capture apparatus with indicator
US9832372B1 (en) Dynamic vediotelphony systems and methods of using the same
CN107438169A (en) Alignment system, pre-determined bit method and real-time location method
CN113676693B (en) Picture presentation method, video conference system, and readable storage medium
US20220400244A1 (en) Multi-camera automatic framing
US9706169B2 (en) Remote conference system and method of performing remote conference
JP2009065490A (en) Video conference apparatus
EP4156147B1 (en) System, device, and method for improving visual and/or auditory tracking of a presentation given by a presenter
US20210266455A1 (en) Image capture control system, method and computer program product
US20230421898A1 (en) Autonomous video conferencing system with virtual director assistance

Legal Events

Date Code Title Description
A201 Request for examination
E902 Notification of reason for refusal
E701 Decision to grant or registration of patent right
GRNT Written decision to grant