KR20160019279A - Mobile terminal and method for controlling the same - Google Patents

Mobile terminal and method for controlling the same Download PDF

Info

Publication number
KR20160019279A
KR20160019279A KR1020140103849A KR20140103849A KR20160019279A KR 20160019279 A KR20160019279 A KR 20160019279A KR 1020140103849 A KR1020140103849 A KR 1020140103849A KR 20140103849 A KR20140103849 A KR 20140103849A KR 20160019279 A KR20160019279 A KR 20160019279A
Authority
KR
South Korea
Prior art keywords
mobile terminal
speaker
display
camera
visual effect
Prior art date
Application number
KR1020140103849A
Other languages
Korean (ko)
Inventor
지주민
김수진
박지영
최재호
윤성혜
Original Assignee
엘지전자 주식회사
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 엘지전자 주식회사 filed Critical 엘지전자 주식회사
Priority to KR1020140103849A priority Critical patent/KR20160019279A/en
Publication of KR20160019279A publication Critical patent/KR20160019279A/en

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F3/00Input arrangements for transferring data to be processed into a form capable of being handled by the computer; Output arrangements for transferring data from processing unit to output unit, e.g. interface arrangements
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04BTRANSMISSION
    • H04B1/00Details of transmission systems, not covered by a single one of groups H04B3/00 - H04B13/00; Details of transmission systems not characterised by the medium used for transmission
    • H04B1/38Transceivers, i.e. devices in which transmitter and receiver form a structural unit and in which at least one part is used for functions of transmitting and receiving
    • H04B1/40Circuits

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Human Computer Interaction (AREA)
  • Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Computer Networks & Wireless Communication (AREA)
  • Signal Processing (AREA)
  • Telephone Function (AREA)
  • Controls And Circuits For Display Device (AREA)

Abstract

[0001] The present invention relates to a mobile terminal, and more particularly, to a mobile terminal capable of distinguishing a speaker and recording a remark, thereby performing various additional functions, and a control method thereof. A mobile terminal according to an embodiment of the present invention includes a display; MIC; camera; And recognizing at least one speaker by using at least one of the sound input through the microphone and the image photographed through the camera, and recording sound for each of the recognized at least one speaker, The visual effect corresponding to the sound of the current speaker can be displayed through the display.

Description

[0001] MOBILE TERMINAL AND METHOD FOR CONTROLLING THE SAME [0002]

[0001] The present invention relates to a mobile terminal, and more particularly, to a mobile terminal capable of distinguishing a speaker and recording a remark, thereby performing various additional functions, and a control method thereof.

A terminal can be divided into a mobile terminal (mobile / portable terminal) and a stationary terminal according to whether the terminal can be moved. The mobile terminal can be divided into a handheld terminal and a vehicle mount terminal according to whether the user can directly carry the mobile terminal.

Such a terminal has various functions, for example, in the form of a multimedia device having multiple functions such as photographing and photographing of a moving picture, reproduction of a music or video file, reception of a game and broadcasting, etc. .

In order to support and enhance the functionality of such terminals, it may be considered to improve the structural and / or software parts of the terminal.

2. Description of the Related Art In general, when recording a meeting content through a mobile terminal, a user directly inputs a memo content as text or records / records a meeting process. Accordingly, when the user directly inputs text, he or she may be prevented from concentrating on the contents of the conference. Also, in the case of recording / recording, the user has to separate and organize the contents again after the conference ends.

The present invention is intended to provide a mobile terminal and a control method thereof that can record a conference content more conveniently.

In particular, the present invention provides a mobile terminal and a control method thereof, in which a speaker can be distinguished when two or more speakers participate in a conference, thereby recording the utterance contents.

In addition, the present invention provides a mobile terminal and its control method capable of performing various additional functions with speaking contents recorded separately for each speaker.

It is to be understood that both the foregoing general description and the following detailed description are exemplary and explanatory and are not restrictive of the invention, unless further departing from the spirit and scope of the invention as defined by the appended claims. It will be possible.

According to an embodiment of the present invention, there is provided a mobile terminal including: a display; MIC; camera; And recognizing at least one speaker by using at least one of the sound input through the microphone and the image photographed through the camera, and recording sound for each of the recognized at least one speaker, The visual effect corresponding to the sound of the current speaker can be displayed through the display.

According to another aspect of the present invention, there is provided a method of controlling a mobile terminal, the method comprising the steps of: recognizing at least one speaker using at least one of a sound input through the microphone and an image captured through a camera ; Recording the sound for each of the recognized speakers; And displaying the visual effect corresponding to the utterance of the current speaker among the recognized at least one speaker through a display.

According to the present invention, the contents of the conference can be recorded more conveniently.

In particular, according to the present invention, when two or more speakers participate in a conference, the speaker can be distinguished by facial recognition or acoustic characteristics, and the utterance contents can be recorded.

In addition, according to the present invention, it is convenient that various additional functions can be performed by converting the utterance contents recorded for each speaker into text and editing or sharing them.

The effects obtained by the present invention are not limited to the above-mentioned effects, and other effects not mentioned can be clearly understood by those skilled in the art from the following description will be.

1A is a block diagram illustrating a mobile terminal according to the present invention.
1B and 1C are conceptual diagrams illustrating an example of a mobile terminal according to the present invention in different directions.
2 is a perspective view showing an example of a glass-type mobile terminal 200 according to another embodiment of the present invention.
3 illustrates an example of a process of recording a conference by distinguishing a speaker in a mobile terminal according to an embodiment of the present invention.
FIG. 4 illustrates an example of a visual effect according to a state of a recognized speaker and recognizing a speaker in a mobile terminal according to an exemplary embodiment of the present invention.
FIG. 5 illustrates an example of a visual effect according to a state of a recognized speaker and recognizing a speaker in a mobile terminal according to an exemplary embodiment of the present invention.
FIG. 6 shows an example of a visual effect applied according to a speaker in a mobile terminal according to an embodiment of the present invention.
FIG. 7 illustrates an example of a visual effect according to a speech recording in a mobile terminal according to an embodiment of the present invention.
FIG. 8 shows an example of an additional function that can be performed in a mobile terminal according to an embodiment of the present invention.
FIG. 9 shows an example of a form in which a recorded speech is converted and output in a mobile terminal according to an embodiment of the present invention.
FIG. 10 shows another example of a form in which a recorded speech is converted and output in a mobile terminal according to an embodiment of the present invention.
FIG. 11 illustrates an example of a process of selecting an additional function according to the end of speech recording in the mobile terminal according to an embodiment of the present invention.

Hereinafter, embodiments of the present invention will be described in detail with reference to the accompanying drawings, wherein like reference numerals are used to designate identical or similar elements, and redundant description thereof will be omitted. The suffix "module" and " part "for the components used in the following description are given or mixed in consideration of ease of specification, and do not have their own meaning or role. In the following description of the embodiments of the present invention, a detailed description of related arts will be omitted when it is determined that the gist of the embodiments disclosed herein may be blurred. It is to be understood that both the foregoing general description and the following detailed description are exemplary and explanatory and are intended to provide further explanation of the invention as claimed. , ≪ / RTI > equivalents, and alternatives.

The mobile terminal described in this specification includes a mobile phone, a smart phone, a laptop computer, a digital broadcasting terminal, a personal digital assistant (PDA), a portable multimedia player (PMP), a navigation device, a slate PC A tablet PC, an ultrabook, a wearable device such as a smartwatch, a smart glass, and a head mounted display (HMD). have.

However, it will be appreciated by those skilled in the art that the configuration according to the embodiments described herein may be applied to fixed terminals such as a digital TV, a desktop computer, a digital signage, and the like, will be.

1A to 1C are block diagrams for explaining a mobile terminal according to the present invention, and FIGS. 1B and 1C are conceptual diagrams showing an example of a mobile terminal according to the present invention in different directions.

The mobile terminal 100 includes a wireless communication unit 110, an input unit 120, a sensing unit 140, an output unit 150, an interface unit 160, a memory 170, a control unit 180, and a power supply unit 190 ), And the like. The components shown in FIG. 1A are not essential for implementing a mobile terminal, so that the mobile terminal described herein may have more or fewer components than the components listed above.

The wireless communication unit 110 may be connected between the mobile terminal 100 and the wireless communication system or between the mobile terminal 100 and another mobile terminal 100 or between the mobile terminal 100 and the external server 100. [ Lt; RTI ID = 0.0 > wireless < / RTI > In addition, the wireless communication unit 110 may include one or more modules for connecting the mobile terminal 100 to one or more networks.

The wireless communication unit 110 may include at least one of a broadcast receiving module 111, a mobile communication module 112, a wireless Internet module 113, a short distance communication module 114, and a location information module 115 .

The input unit 120 includes a camera 121 or an image input unit for inputting a video signal, a microphone 122 for inputting an audio signal, an audio input unit, a user input unit 123 for receiving information from a user A touch key, a mechanical key, and the like). The voice data or image data collected by the input unit 120 may be analyzed and processed by a user's control command.

The sensing unit 140 may include at least one sensor for sensing at least one of information in the mobile terminal, surrounding environment information surrounding the mobile terminal, and user information. For example, the sensing unit 140 may include a proximity sensor 141, an illumination sensor 142, a touch sensor, an acceleration sensor, a magnetic sensor, A G-sensor, a gyroscope sensor, a motion sensor, an RGB sensor, an infrared sensor, a finger scan sensor, an ultrasonic sensor, A microphone 226, a battery gauge, an environmental sensor (for example, a barometer, a hygrometer, a thermometer, a radiation detection sensor, A thermal sensor, a gas sensor, etc.), a chemical sensor (e.g., an electronic nose, a healthcare sensor, a biometric sensor, etc.). Meanwhile, the mobile terminal disclosed in the present specification can combine and utilize information sensed by at least two of the sensors.

The output unit 150 includes at least one of a display unit 151, an acoustic output unit 152, a haptic tip module 153, and a light output unit 154 to generate an output related to visual, auditory, can do. The display unit 151 may have a mutual layer structure with the touch sensor or may be integrally formed to realize a touch screen. The touch screen may function as a user input unit 123 that provides an input interface between the mobile terminal 100 and a user and may provide an output interface between the mobile terminal 100 and a user.

The interface unit 160 serves as a path to various types of external devices connected to the mobile terminal 100. The interface unit 160 is connected to a device having a wired / wireless headset port, an external charger port, a wired / wireless data port, a memory card port, And may include at least one of a port, an audio I / O port, a video I / O port, and an earphone port. In the mobile terminal 100, corresponding to the connection of the external device to the interface unit 160, it is possible to perform appropriate control related to the connected external device.

In addition, the memory 170 stores data supporting various functions of the mobile terminal 100. The memory 170 may store a plurality of application programs or applications running on the mobile terminal 100, data for operation of the mobile terminal 100, and commands. At least some of these applications may be downloaded from an external server via wireless communication. Also, at least a part of these application programs may exist on the mobile terminal 100 from the time of shipment for the basic functions (e.g., telephone call receiving function, message receiving function, and calling function) of the mobile terminal 100. Meanwhile, the application program may be stored in the memory 170, installed on the mobile terminal 100, and may be operated by the control unit 180 to perform the operation (or function) of the mobile terminal.

In addition to the operations related to the application program, the control unit 180 typically controls the overall operation of the mobile terminal 100. The control unit 180 may process or process signals, data, information, and the like input or output through the above-mentioned components, or may drive an application program stored in the memory 170 to provide or process appropriate information or functions to the user.

In addition, the controller 180 may control at least some of the components illustrated in FIG. 1A in order to drive an application program stored in the memory 170. FIG. In addition, the controller 180 may operate at least two of the components included in the mobile terminal 100 in combination with each other for driving the application program.

The power supply unit 190 receives external power and internal power under the control of the controller 180 and supplies power to the components included in the mobile terminal 100. The power supply unit 190 includes a battery, which may be an internal battery or a replaceable battery.

At least some of the components may operate in cooperation with one another to implement a method of operation, control, or control of a mobile terminal according to various embodiments described below. In addition, the operation, control, or control method of the mobile terminal may be implemented on the mobile terminal by driving at least one application program stored in the memory 170. [

The broadcast management server may refer to a server for generating and transmitting broadcast signals and / or broadcast related information, or a server for receiving broadcast signals and / or broadcast related information generated by the broadcast management server and transmitting the generated broadcast signals and / or broadcast related information. The broadcast signal may include a TV broadcast signal, a radio broadcast signal, a data broadcast signal, and a broadcast signal in which a data broadcast signal is combined with a TV broadcast signal or a radio broadcast signal.

The broadcasting signal may be encoded according to at least one of technical standards for transmitting and receiving a digital broadcasting signal (or a broadcasting system, for example, ISO, IEC, DVB, ATSC, etc.) It is possible to receive the digital broadcasting signal using a method conforming to the technical standard defined by the technical standards.

The broadcast-related information may be information related to a broadcast channel, a broadcast program, or a broadcast service provider. The broadcast-related information may also be provided through a mobile communication network. In this case, it may be received by the mobile communication module 112.

The broadcast-related information may exist in various forms, for example, an Electronic Program Guide (EPG) of Digital Multimedia Broadcasting (DMB) or an Electronic Service Guide (ESG) of Digital Video Broadcast-Handheld (DVB-H). The broadcast signal and / or the broadcast-related information received through the broadcast receiving module 111 may be stored in the memory 170.

The mobile communication module 112 may be a mobile communication module or a mobile communication module such as a mobile communication module or a mobile communication module that uses technology standards or a communication method (e.g., Global System for Mobile communication (GSM), Code Division Multi Access (CDMA), Code Division Multi Access 2000 (Enhanced Voice-Data Optimized or Enhanced Voice-Data Only), Wideband CDMA (WCDMA), High Speed Downlink Packet Access (HSDPA), High Speed Uplink Packet Access (HSUPA), Long Term Evolution And an external terminal, or a server on a mobile communication network established according to a long term evolution (AR), a long term evolution (AR), or the like.

The wireless signal may include various types of data depending on a voice call signal, a video call signal or a text / multimedia message transmission / reception.

The wireless Internet module 113 is a module for wireless Internet access, and may be built in or externally attached to the mobile terminal 100. The wireless Internet module 113 is configured to transmit and receive a wireless signal in a communication network according to wireless Internet technologies.

Wireless Internet technologies include, for example, wireless LAN (WLAN), wireless fidelity (Wi-Fi), wireless fidelity (Wi-Fi) Direct, DLNA (Digital Living Network Alliance), WiBro Interoperability for Microwave Access, High Speed Downlink Packet Access (HSDPA), High Speed Uplink Packet Access (HSUPA), Long Term Evolution (LTE) and Long Term Evolution-Advanced (LTE-A) 113 transmit and receive data according to at least one wireless Internet technology, including Internet technologies not listed above.

The wireless Internet module 113 for performing a wireless Internet connection through the mobile communication network can be used for wireless Internet access by WiBro, HSDPA, HSUPA, GSM, CDMA, WCDMA, LTE or LTE- May be understood as a kind of the mobile communication module 112.

The short-range communication module 114 is for short-range communication, and includes Bluetooth ™, Radio Frequency Identification (RFID), Infrared Data Association (IrDA), Ultra Wideband (UWB) (Near Field Communication), Wi-Fi (Wireless-Fidelity), Wi-Fi Direct, and Wireless USB (Wireless Universal Serial Bus) technology. The short-range communication module 114 is connected to the mobile terminal 100 and the wireless communication system through the wireless area networks, between the mobile terminal 100 and another mobile terminal 100, or between the mobile terminal 100 ) And the other mobile terminal 100 (or the external server). The short-range wireless communication network may be a short-range wireless personal area network.

Here, the other mobile terminal 100 may be a wearable device (e.g., a smartwatch, a smart glass, etc.) capable of interchanging data with the mobile terminal 100 according to the present invention (smart glass), HMD (head mounted display)). The short range communication module 114 may detect (or recognize) a wearable device capable of communicating with the mobile terminal 100 around the mobile terminal 100. [ If the detected wearable device is a device authenticated to communicate with the mobile terminal 100 according to the present invention, the control unit 180 may transmit at least a part of the data processed by the mobile terminal 100 to the short- 114 to the wearable device. Therefore, the user of the wearable device can use the data processed by the mobile terminal 100 through the wearable device. For example, according to this, when a telephone is received in the mobile terminal 100, the user performs a telephone conversation via the wearable device, or when a message is received in the mobile terminal 100, It is possible to check the message.

The position information module 115 is a module for obtaining the position (or current position) of the mobile terminal, and a representative example thereof is a Global Positioning System (GPS) module or a Wireless Fidelity (WiFi) module. For example, when the mobile terminal utilizes the GPS module, it can acquire the position of the mobile terminal by using a signal transmitted from the GPS satellite. As another example, when the mobile terminal utilizes the Wi-Fi module, it can acquire the position of the mobile terminal based on information of a wireless access point (AP) that transmits or receives the wireless signal with the Wi-Fi module. Optionally, the location information module 115 may perform any of the other functions of the wireless communication unit 110 to obtain data relating to the location of the mobile terminal, in addition or alternatively. The location information module 115 is a module used to obtain the location (or current location) of the mobile terminal, and is not limited to a module that directly calculates or obtains the location of the mobile terminal.

Next, the input unit 120 is for inputting image information (or signal), audio information (or signal), data, or information input from a user. For inputting image information, Or a plurality of cameras 121 may be provided. The camera 121 processes an image frame such as a still image or moving image obtained by the image sensor in the video communication mode or the photographing mode. The processed image frame may be displayed on the display unit 151 or stored in the memory 170. [ A plurality of cameras 121 provided in the mobile terminal 100 may be arranged to have a matrix structure and various angles or foci may be provided to the mobile terminal 100 through the camera 121 having the matrix structure A plurality of pieces of image information can be input. In addition, the plurality of cameras 121 may be arranged in a stereo structure to acquire a left image and a right image for realizing a stereoscopic image.

The microphone 122 processes the external acoustic signal into electrical voice data. The processed voice data can be utilized variously according to a function (or a running application program) being executed in the mobile terminal 100. Meanwhile, the microphone 122 may be implemented with various noise reduction algorithms for eliminating noise generated in receiving an external sound signal.

The user input unit 123 is for receiving information from a user and when the information is inputted through the user input unit 123, the control unit 180 can control the operation of the mobile terminal 100 to correspond to the input information . The user input unit 123 may include a mechanical input means (or a mechanical key such as a button located on the front, rear or side of the mobile terminal 100, a dome switch, a jog wheel, Jog switches, etc.) and touch-type input means. For example, the touch-type input means may comprise a virtual key, a soft key or a visual key displayed on the touch screen through software processing, And a touch key disposed on the touch panel. Meanwhile, the virtual key or the visual key can be displayed on a touch screen having various forms, for example, a graphic, a text, an icon, a video, As shown in FIG.

Meanwhile, the sensing unit 140 senses at least one of information in the mobile terminal, surrounding environment information surrounding the mobile terminal, and user information, and generates a corresponding sensing signal. The control unit 180 may control the driving or operation of the mobile terminal 100 or may perform data processing, function or operation related to the application program installed in the mobile terminal 100 based on the sensing signal. Representative sensors among various sensors that may be included in the sensing unit 140 will be described in more detail.

First, the proximity sensor 141 refers to a sensor that detects the presence of an object approaching a predetermined detection surface, or the presence of an object in the vicinity of the detection surface, without mechanical contact by using electromagnetic force or infrared rays. The proximity sensor 141 may be disposed in the inner area of the mobile terminal or in proximity to the touch screen, which is covered by the touch screen.

Examples of the proximity sensor 141 include a transmission type photoelectric sensor, a direct reflection type photoelectric sensor, a mirror reflection type photoelectric sensor, a high frequency oscillation type proximity sensor, a capacitive proximity sensor, a magnetic proximity sensor, and an infrared proximity sensor. In the case where the touch screen is electrostatic, the proximity sensor 141 can be configured to detect the proximity of the object with a change of the electric field along the proximity of the object having conductivity. In this case, the touch screen (or touch sensor) itself may be classified as a proximity sensor.

On the other hand, for convenience of explanation, the act of recognizing that the object is located on the touch screen in proximity with no object touching the touch screen is referred to as "proximity touch & The act of actually touching an object on the screen is called a "contact touch. &Quot; The position at which the object is closely touched on the touch screen means a position where the object corresponds to the touch screen vertically when the object is touched. The proximity sensor 141 can detect a proximity touch and a proximity touch pattern (e.g., a proximity touch distance, a proximity touch direction, a proximity touch speed, a proximity touch time, a proximity touch position, have. Meanwhile, the control unit 180 processes data (or information) corresponding to the proximity touch operation and the proximity touch pattern sensed through the proximity sensor 141 as described above, and further provides visual information corresponding to the processed data It can be output on the touch screen. Furthermore, the control unit 180 can control the mobile terminal 100 such that different operations or data (or information) are processed according to whether the touch to the same point on the touch screen is a proximity touch or a touch touch .

The touch sensor uses a touch (or touch input) applied to the touch screen (or the display unit 151) by using at least one of various touch methods such as a resistance film type, a capacitive type, an infrared type, an ultrasonic type, Detection.

For example, the touch sensor may be configured to convert a change in a pressure applied to a specific portion of the touch screen or a capacitance generated in a specific portion to an electrical input signal. The touch sensor may be configured to detect a position, an area, a pressure at the time of touch, a capacitance at the time of touch, and the like where a touch object touching the touch screen is touched on the touch sensor. Here, the touch object may be a finger, a touch pen, a stylus pen, a pointer, or the like as an object to which a touch is applied to the touch sensor.

Thus, when there is a touch input to the touch sensor, the corresponding signal (s) is sent to the touch controller. The touch controller processes the signal (s) and transmits the corresponding data to the controller 180. Thus, the control unit 180 can know which area of the display unit 151 is touched or the like. Here, the touch controller may be a separate component from the control unit 180, and may be the control unit 180 itself.

On the other hand, the control unit 180 may perform different controls or perform the same control according to the type of the touch object touching the touch screen (or a touch key provided on the touch screen). Whether to perform different controls or to perform the same control according to the type of the touch object may be determined according to the current state of the mobile terminal 100 or an application program being executed.

On the other hand, the touch sensors and the proximity sensors discussed above can be used independently or in combination to provide a short touch (touch), a long touch, a multi touch, a drag touch ), Flick touch, pinch-in touch, pinch-out touch, swipe touch, hovering touch, and the like. Touch can be sensed.

The ultrasonic sensor can recognize the position information of the object to be sensed by using ultrasonic waves. Meanwhile, the controller 180 can calculate the position of the wave generating source through the information sensed by the optical sensor and the plurality of ultrasonic sensors. The position of the wave source can be calculated using the fact that the light is much faster than the ultrasonic wave, that is, the time when the light reaches the optical sensor is much faster than the time the ultrasonic wave reaches the ultrasonic sensor. More specifically, the position of the wave generating source can be calculated using the time difference with the time when the ultrasonic wave reaches the reference signal.

The camera 121 includes at least one of a camera sensor (for example, a CCD, a CMOS, etc.), a photo sensor (or an image sensor), and a laser sensor.

The camera 121 and the laser sensor may be combined with each other to sense a touch of the sensing object with respect to the three-dimensional stereoscopic image. The photosensor can be laminated to the display element, which is adapted to scan the movement of the object to be detected proximate to the touch screen. More specifically, the photosensor mounts photo diodes and TRs (Transistors) in a row / column and scans the contents loaded on the photosensor using an electrical signal that varies according to the amount of light applied to the photo diode. That is, the photo sensor performs coordinate calculation of the object to be sensed according to the amount of change of light, and position information of the object to be sensed can be obtained through the calculation.

The display unit 151 displays (outputs) information processed by the mobile terminal 100. For example, the display unit 151 may display execution screen information of an application program driven by the mobile terminal 100 or UI (User Interface) and GUI (Graphic User Interface) information according to the execution screen information .

Also, the display unit 151 may be configured as a stereoscopic display unit for displaying a stereoscopic image.

In the stereoscopic display unit, a three-dimensional display system such as a stereoscopic system (glasses system), an autostereoscopic system (no-glasses system), and a projection system (holographic system) can be applied.

The sound output unit 152 may output audio data received from the wireless communication unit 110 or stored in the memory 170 in a call signal reception mode, a call mode or a recording mode, a voice recognition mode, a broadcast reception mode, The sound output unit 152 also outputs sound signals related to functions (e.g., call signal reception sound, message reception sound, etc.) performed in the mobile terminal 100. [ The audio output unit 152 may include a receiver, a speaker, a buzzer, and the like.

The haptic module 153 generates various tactile effects that the user can feel. A typical example of the haptic effect generated by the haptic module 153 may be vibration. The intensity and pattern of the vibration generated in the haptic module 153 can be controlled by the user's selection or the setting of the control unit. For example, the haptic module 153 may synthesize and output different vibrations or sequentially output the vibrations.

In addition to vibration, the haptic module 153 may be configured to perform various functions such as a pin arrangement vertically moving with respect to the contact skin surface, a spraying force or suction force of the air through the injection port or the suction port, a touch on the skin surface, And various tactile effects such as an effect of reproducing a cold sensation using an endothermic or exothermic element can be generated.

The haptic module 153 can transmit the tactile effect through the direct contact, and the tactile effect can be felt by the user through the muscles of the finger or arm. The haptic module 153 may include two or more haptic modules 153 according to the configuration of the mobile terminal 100.

The light output unit 154 outputs a signal for notifying the occurrence of an event using the light of the light source of the mobile terminal 100. Examples of events that occur in the mobile terminal 100 may include message reception, call signal reception, missed call, alarm, schedule notification, email reception, information reception through an application, and the like.

The signal output from the light output unit 154 is implemented as the mobile terminal emits light of a single color or a plurality of colors to the front or rear surface. The signal output may be terminated by the mobile terminal detecting the event confirmation of the user.

The interface unit 160 serves as a path for communication with all external devices connected to the mobile terminal 100. The interface unit 160 receives data from an external device or supplies power to each component in the mobile terminal 100 or transmits data in the mobile terminal 100 to an external device. For example, a port for connecting a device equipped with a wired / wireless headset port, an external charger port, a wired / wireless data port, a memory card port, an audio I / O port, a video I / O port, an earphone port, and the like may be included in the interface unit 160.

The identification module is a chip for storing various information for authenticating the use right of the mobile terminal 100 and includes a user identification module (UIM), a subscriber identity module (SIM) A universal subscriber identity module (USIM), and the like. Devices with identification modules (hereinafter referred to as "identification devices") can be manufactured in a smart card format. Accordingly, the identification device can be connected to the terminal 100 through the interface unit 160. [

The interface unit 160 may be a path through which power from the cradle is supplied to the mobile terminal 100 when the mobile terminal 100 is connected to an external cradle, And various command signals may be transmitted to the mobile terminal 100. The various command signals or the power source input from the cradle may be operated as a signal for recognizing that the mobile terminal 100 is correctly mounted on the cradle.

The memory 170 may store a program for the operation of the controller 180 and temporarily store input / output data (e.g., a phone book, a message, a still image, a moving picture, etc.). The memory 170 may store data related to vibration and sound of various patterns outputted when a touch is input on the touch screen.

The memory 170 may be a flash memory type, a hard disk type, a solid state disk type, an SDD type (Silicon Disk Drive type), a multimedia card micro type ), Card type memory (e.g., SD or XD memory), random access memory (RAM), static random access memory (SRAM), read-only memory (ROM), electrically erasable programmable read memory, a programmable read-only memory (PROM), a magnetic memory, a magnetic disk, and / or an optical disk. The mobile terminal 100 may operate in association with a web storage that performs the storage function of the memory 170 on the Internet.

Meanwhile, as described above, the control unit 180 controls the operations related to the application program and the general operation of the mobile terminal 100. [ For example, when the state of the mobile terminal meets a set condition, the control unit 180 can execute or release a lock state for restricting input of a user's control command to applications.

In addition, the control unit 180 performs control and processing related to voice communication, data communication, video call, or the like, or performs pattern recognition processing to recognize handwriting input or drawing input performed on the touch screen as characters and images, respectively . Further, the controller 180 may control any one or a plurality of the above-described components in order to implement various embodiments described below on the mobile terminal 100 according to the present invention.

The power supply unit 190 receives external power and internal power under the control of the controller 180 and supplies power necessary for operation of the respective components. The power supply unit 190 includes a battery, the battery may be an internal battery configured to be chargeable, and may be detachably coupled to the terminal body for charging or the like.

In addition, the power supply unit 190 may include a connection port, and the connection port may be configured as an example of an interface 160 through which an external charger for supplying power for charging the battery is electrically connected.

As another example, the power supply unit 190 may be configured to charge the battery in a wireless manner without using the connection port. In this case, the power supply unit 190 may use at least one of an inductive coupling method based on a magnetic induction phenomenon from an external wireless power transmission apparatus and a magnetic resonance coupling method based on an electromagnetic resonance phenomenon Power can be delivered.

In the following, various embodiments may be embodied in a recording medium readable by a computer or similar device using, for example, software, hardware, or a combination thereof.

Referring to FIGS. 1B and 1C, the disclosed mobile terminal 100 includes a bar-shaped terminal body. However, the present invention is not limited thereto and can be applied to various structures such as a folder type, a flip type, a slide type, a swing type, and a swivel type in which a watch type, a clip type, a glass type or two or more bodies are relatively movably coupled . A description of a particular type of mobile terminal, although relevant to a particular type of mobile terminal, is generally applicable to other types of mobile terminals.

Here, the terminal body can be understood as a concept of referring to the mobile terminal 100 as at least one aggregate.

The mobile terminal 100 includes a case (for example, a frame, a housing, a cover, and the like) that forms an appearance. As shown, the mobile terminal 100 may include a front case 101 and a rear case 102. Various electronic components are disposed in the inner space formed by the combination of the front case 101 and the rear case 102. At least one middle case may be additionally disposed between the front case 101 and the rear case 102.

A display unit 151 is disposed on a front surface of the terminal body to output information. The window 151a of the display unit 151 may be mounted on the front case 101 to form a front surface of the terminal body together with the front case 101. [

In some cases, electronic components may also be mounted on the rear case 102. Electronic parts that can be mounted on the rear case 102 include detachable batteries, an identification module, a memory card, and the like. In this case, a rear cover 103 for covering the mounted electronic components can be detachably coupled to the rear case 102. Therefore, when the rear cover 103 is separated from the rear case 102, the electronic parts mounted on the rear case 102 are exposed to the outside.

As shown, when the rear cover 103 is coupled to the rear case 102, a side portion of the rear case 102 can be exposed. In some cases, the rear case 102 may be completely covered by the rear cover 103 during the engagement. Meanwhile, the rear cover 103 may be provided with an opening for exposing the camera 121b and the sound output unit 152b to the outside.

These cases 101, 102, and 103 may be formed by injection molding of synthetic resin or may be formed of metal such as stainless steel (STS), aluminum (Al), titanium (Ti), or the like.

The mobile terminal 100 may be configured such that one case provides the internal space, unlike the above example in which a plurality of cases provide an internal space for accommodating various electronic components. In this case, a unibody mobile terminal 100 in which synthetic resin or metal is connected from the side to the rear side can be realized.

Meanwhile, the mobile terminal 100 may include a waterproof unit (not shown) for preventing water from penetrating into the terminal body. For example, the waterproof portion is provided between the window 151a and the front case 101, between the front case 101 and the rear case 102, or between the rear case 102 and the rear cover 103, And a waterproof member for sealing the inside space of the oven.

The mobile terminal 100 is provided with a display unit 151, first and second sound output units 152a and 152b, a proximity sensor 141, an illuminance sensor 142, a light output unit 154, Cameras 121a and 121b, first and second operation units 123a and 123b, a microphone 122, an interface unit 160, and the like.

1B and 1C, a display unit 151, a first sound output unit 152a, a proximity sensor 141, an illuminance sensor 142, an optical output unit (not shown) A second operation unit 123b, a microphone 122 and an interface unit 160 are disposed on a side surface of the terminal body, And a mobile terminal 100 having a second sound output unit 152b and a second camera 121b disposed on a rear surface thereof.

However, these configurations are not limited to this arrangement. These configurations may be excluded or replaced as needed, or placed on different planes. For example, the first operation unit 123a may not be provided on the front surface of the terminal body, and the second sound output unit 152b may be provided on the side surface of the terminal body rather than the rear surface of the terminal body.

The display unit 151 displays (outputs) information processed by the mobile terminal 100. For example, the display unit 151 may display execution screen information of an application program driven by the mobile terminal 100 or UI (User Interface) and GUI (Graphic User Interface) information according to the execution screen information .

The display unit 151 may be a liquid crystal display (LCD), a thin film transistor-liquid crystal display (TFT LCD), an organic light-emitting diode (OLED), a flexible display display, a 3D display, and an e-ink display.

In addition, the display unit 151 may exist in two or more depending on the embodiment of the mobile terminal 100. In this case, the mobile terminal 100 may be provided with a plurality of display portions spaced apart from each other or disposed integrally with one another, or may be disposed on different surfaces, respectively.

The display unit 151 may include a touch sensor that senses a touch with respect to the display unit 151 so that a control command can be received by a touch method. When a touch is made to the display unit 151, the touch sensor senses the touch, and the control unit 180 generates a control command corresponding to the touch based on the touch. The content input by the touch method may be a letter or a number, an instruction in various modes, a menu item which can be designated, and the like.

The touch sensor may be a film having a touch pattern and disposed between the window 151a and a display (not shown) on the rear surface of the window 151a, or may be a metal wire . Alternatively, the touch sensor may be formed integrally with the display. For example, the touch sensor may be disposed on a substrate of the display or inside the display.

In this way, the display unit 151 can form a touch screen together with the touch sensor. In this case, the touch screen can function as a user input unit 123 (see FIG. 1A). In some cases, the touch screen may replace at least some functions of the first operation unit 123a.

The first sound output unit 152a may be implemented as a receiver for transmitting a call sound to a user's ear and the second sound output unit 152b may be implemented as a loud speaker for outputting various alarm sounds or multimedia playback sounds. ). ≪ / RTI >

The window 151a of the display unit 151 may be provided with an acoustic hole for emitting the sound generated from the first acoustic output unit 152a. However, the present invention is not limited to this, and the sound may be configured to be emitted along an assembly gap (for example, a gap between the window 151a and the front case 101) between the structures. In this case, the appearance of the mobile terminal 100 can be made more simple because the hole formed independently for the apparent acoustic output is hidden or hidden.

The optical output unit 154 is configured to output light for notifying the occurrence of an event. Examples of the event include a message reception, a call signal reception, a missed call, an alarm, a schedule notification, an email reception, and reception of information through an application. The control unit 180 may control the light output unit 154 to terminate the light output when the event confirmation of the user is detected.

The first camera 121a processes an image frame of a still image or a moving image obtained by the image sensor in the photographing mode or the video communication mode. The processed image frame can be displayed on the display unit 151 and can be stored in the memory 170. [

The first and second operation units 123a and 123b may be collectively referred to as a manipulating portion as an example of a user input unit 123 operated to receive a command for controlling the operation of the mobile terminal 100 have. The first and second operation units 123a and 123b can be employed in any manner as long as the user is in a tactile manner such as touch, push, scroll, or the like. In addition, the first and second operation units 123a and 123b may be employed in a manner that the user operates the apparatus without touching the user through a proximity touch, a hovering touch, or the like.

In this figure, the first operation unit 123a is a touch key, but the present invention is not limited thereto. For example, the first operation unit 123a may be a mechanical key, or a combination of a touch key and a touch key.

The contents input by the first and second operation units 123a and 123b can be variously set. For example, the first operation unit 123a receives a command such as a menu, a home key, a cancellation, a search, and the like, and the second operation unit 123b receives a command from the first or second sound output unit 152a or 152b The size of the sound, and the change of the display unit 151 to the touch recognition mode.

On the other hand, a rear input unit (not shown) may be provided on the rear surface of the terminal body as another example of the user input unit 123. The rear input unit is operated to receive a command for controlling the operation of the mobile terminal 100, and input contents may be variously set. For example, commands such as power on / off, start, end, scrolling, and the like, the size adjustment of the sound output from the first and second sound output units 152a and 152b, And the like can be inputted. The rear input unit may be implemented as a touch input, a push input, or a combination thereof.

The rear input unit may be disposed so as to overlap with the front display unit 151 in the thickness direction of the terminal body. For example, the rear input unit may be disposed at the rear upper end of the terminal body such that when the user holds the terminal body with one hand, the rear input unit can be easily operated using the index finger. However, the present invention is not limited thereto, and the position of the rear input unit may be changed.

When a rear input unit is provided on the rear surface of the terminal body, a new type of user interface using the rear input unit can be realized. When the first operation unit 123a is not disposed on the front surface of the terminal body in place of at least a part of the functions of the first operation unit 123a provided on the front surface of the terminal body, The display unit 151 may be configured as a larger screen.

Meanwhile, the mobile terminal 100 may be provided with a fingerprint recognition sensor for recognizing the fingerprint of the user, and the controller 180 may use the fingerprint information sensed through the fingerprint recognition sensor as authentication means. The fingerprint recognition sensor may be embedded in the display unit 151 or the user input unit 123.

The microphone 122 is configured to receive the user's voice, other sounds, and the like. The microphone 122 may be provided at a plurality of locations to receive stereophonic sound.

The interface unit 160 is a path through which the mobile terminal 100 can be connected to an external device. For example, the interface unit 160 may include a connection terminal for connection with another device (for example, an earphone or an external speaker), a port for short-range communication (for example, an infrared port (IrDA Port), a Bluetooth port A wireless LAN port, or the like), or a power supply terminal for supplying power to the mobile terminal 100. The interface unit 160 may be implemented as a socket for receiving an external card such as a SIM (Subscriber Identification Module) or a UIM (User Identity Module) or a memory card for storing information.

And a second camera 121b may be disposed on a rear surface of the terminal body. In this case, the second camera 121b has a photographing direction which is substantially opposite to that of the first camera 121a.

The second camera 121b may include a plurality of lenses arranged along at least one line. The plurality of lenses may be arranged in a matrix form. Such a camera can be named an 'array camera'. When the second camera 121b is configured as an array camera, images can be taken in various ways using a plurality of lenses, and a better quality image can be obtained.

The flash 124 may be disposed adjacent to the second camera 121b. The flash 124 shines light toward the subject when the subject is photographed by the second camera 121b.

And a second sound output unit 152b may be additionally disposed in the terminal body. The second sound output unit 152b may implement a stereo function together with the first sound output unit 152a and may be used for implementing a speakerphone mode in a call.

The terminal body may be provided with at least one antenna for wireless communication. The antenna may be embedded in the terminal body or formed in the case. For example, an antenna constituting a part of the broadcast receiving module 111 (see FIG. 1A) may be configured to be able to be drawn out from the terminal body. Alternatively, the antenna may be formed in a film type and attached to the inner surface of the rear cover 103, or a case including a conductive material may be configured to function as an antenna.

The terminal body is provided with a power supply unit 190 (see FIG. 1A) for supplying power to the mobile terminal 100. The power supply unit 190 may include a battery 191 built in the terminal body or detachable from the outside of the terminal body.

The battery 191 may be configured to receive power through a power cable connected to the interface unit 160. In addition, the battery 191 may be configured to be wirelessly chargeable through a wireless charger. The wireless charging may be implemented by a magnetic induction method or a resonance method (magnetic resonance method).

The rear cover 103 is configured to be coupled to the rear case 102 so as to cover the battery 191 to restrict the release of the battery 191 and to protect the battery 191 from external impact and foreign matter . When the battery 191 is detachably attached to the terminal body, the rear cover 103 may be detachably coupled to the rear case 102.

The mobile terminal 100 may be provided with an accessory that protects the appearance or supports or expands the function of the mobile terminal 100. [ One example of such an accessory is a cover or pouch that covers or accommodates at least one side of the mobile terminal 100. [ The cover or pouch may be configured to interlock with the display unit 151 to expand the function of the mobile terminal 100. Another example of an accessory is a touch pen for supplementing or extending a touch input to the touch screen.

Meanwhile, the mobile terminal can be extended to a wearable device that can be worn on the body beyond the dimension that the user mainly grasps and uses. These wearable devices include smart watch, smart glass, and head mounted display (HMD). Hereinafter, examples of a mobile terminal extended to a wearable device will be described.

The wearable device can be made to be able to exchange (or interlock) data with another mobile terminal 100. The short range communication module 114 can detect (or recognize) a wearable device capable of communicating with the mobile terminal 100. If the detected wearable device is a device authenticated to communicate with the mobile terminal 100, the control unit 180 may transmit at least a part of the data processed by the mobile terminal 100 to the wearable device 100 via the short- Lt; / RTI > Accordingly, the user can use the data processed by the mobile terminal 100 through the wearable device. For example, when a telephone is received in the mobile terminal 100, it is possible to perform a telephone conversation via the wearable device or to confirm the received message via the wearable device when a message is received in the mobile terminal 100 .

2 is a perspective view showing an example of a glass-type mobile terminal 200 according to another embodiment of the present invention.

The glass-type mobile terminal 200 is configured to be worn on the head of a human body, and a frame portion (a case, a housing, etc.) for the mobile terminal 200 may be provided. The frame portion may be formed of a flexible material to facilitate wearing. In this figure, it is illustrated that the frame portion includes a first frame 201 and a second frame 202 of different materials. In general, mobile terminal 200 may include features of mobile terminal 100 of Figures 1A-1C or similar features.

The frame portion is supported on the head portion, and a space for mounting various components is provided. As shown in the figure, electronic parts such as the control module 280, the sound output module 252, and the like may be mounted on the frame part. In addition, a lens 203 covering at least one of the left and right eyes may be detachably mounted on the frame portion.

The control module 280 controls various electronic components included in the mobile terminal 200. The control module 280 can be understood as a configuration corresponding to the control unit 180 described above. In this figure, the control module 280 is provided on the frame portion on one side of the head. However, the position of the control module 280 is not limited thereto.

The display unit 251 may be implemented as a head mounted display (HMD). The HMD type refers to a display method that is mounted on a head and displays an image directly in front of the user's eyes. When the user wears the glass-type mobile terminal 200, the display unit 251 may be arranged to correspond to at least one of the left eye and the right eye so that the user can directly provide an image in front of the user's eyes. In this figure, the display unit 251 is located at a portion corresponding to the right eye so that an image can be output toward the user's right eye.

The display unit 251 can project an image with the user's eyes using a prism. Further, the prism may be formed to be transmissive so that the user can view the projected image and the general view of the front (the range that the user views through the eyes) together.

As described above, the image output through the display unit 251 can be overlapped with the general view. The mobile terminal 200 can provide an Augmented Reality (AR) in which a virtual image is superimposed on a real image or a background and displayed as a single image by using the characteristics of the display.

The camera 221 is disposed adjacent to at least one of the left eye and the right eye, and is configured to photograph a forward image. Since the camera 221 is located adjacent to the eye, the camera 221 can acquire a scene viewed by the user as an image.

Although the camera 221 is provided in the control module 280 in this figure, it is not limited thereto. The camera 221 may be installed in the frame part, or may be provided in a plurality of ways to acquire a stereoscopic image.

The glass-type mobile terminal 200 may include user input units 223a and 223b operated to receive control commands. The user input units 223a and 223b can be employed in any tactile manner, such as a touch or a push, in a tactile manner. This figure illustrates that the frame unit and the control module 280 are provided with user input units 223a and 223b of a push and touch input method, respectively.

In addition, the glass-type mobile terminal 200 may be provided with a microphone (not shown) for receiving sound and processing it as electrical voice data and an acoustic output module 252 for outputting sound. The sound output module 252 may be configured to transmit sound in a general sound output mode or a bone conduction mode. When the sound output module 252 is implemented in a bone conduction manner, when the user wears the mobile terminal 200, the sound output module 252 is brought into close contact with the head and vibrates the skull to transmit sound. In addition, although not shown, a microphone 222 for receiving a user's voice or a surrounding sound may be additionally provided.

For convenience of explanation, it is assumed that the mobile terminal referred to below includes at least one of the components shown in FIG. However, in the embodiments of the present invention, it is preferable that the camera provided in the mobile terminal is oriented in a direction similar to the direction of the user's eyes. Unless otherwise stated, the following mobile terminal is a glass- I suppose. However, in a state where the glass-type mobile terminal 200 is connected to the smart-phone-type mobile terminal 100 and data exchange is possible, information acquired and / or displayed by the glass- 100 and displayed on the touch screen 151 of the mobile terminal. In this case, the glass-type mobile terminal 200 may be controlled according to a command input on the touch screen 151 of the mobile terminal. Of course, if the function is not available only in the glass-type mobile terminal 200, the functions according to the present invention can be performed in the mobile terminal 100 of the smartphone type as it is.

Meeting record by speaker

According to an embodiment of the present invention, there is provided a mobile terminal capable of recognizing one or more speakers including a user at the time of a conference, recording the utterance for each recognized speaker, and performing various additional functions using the same, and a control method thereof do. This will be described with reference to FIG.

3 illustrates an example of a process of recording a conference by distinguishing a speaker in a mobile terminal according to an embodiment of the present invention.

Referring to FIG. 3, the control unit 280 may enter the conference recording mode. Here, the conference recording mode may mean a mode for recognizing one or more speakers including the user according to the present invention, and recording the utterance contents for each recognized speaker. This mode can be implemented in the form of an application, and in this case, the mode can be entered by executing the corresponding application. In addition, when the user input unit on the mobile terminal is operated in a predetermined form, the control unit 280 can enter the present mode. Upon entering the mode, the control unit 280 can activate the microphone 222 and the camera 221 until the mode is terminated.

In the conference recording mode, the control unit 280 can recognize the speaker (S320). Here, the speaker refers to a person who recognizes the speech as an object to which the control unit 280 classifies the speech, rather than simply a person who has spoken since the conference recording mode was activated. When a user of the mobile terminal 200 inputs a specific command when the mouth of a user moves within the shooting range of the camera while making a sound when the user stays within the shooting range of the camera 221, It can be recognized by the control unit 280 when it stays in the range for a predetermined time or the like. Once recognized, the speaker can be distinguished from other speakers at a later time by facial recognition or voice characteristic recognition in the control unit 280. [

The control unit 280 may record the sound of the recognized speaker in the memory 170 (S330). At this time, the sound itself may be recorded, or the sound may be converted to text through a STT (speech to text) process at the same time as or after the recording. In addition, the sound may be recorded in the form of an acoustic file, or may be recorded together with a conference image in the form of a moving picture file. In addition, it is preferable that the control unit 180 records sounds separately for each speaker in the process.

Thereafter, the control unit 280 determines which of the recognized speakers is currently speaking, and displays the visual effect corresponding to the determined current speaker's sound through the display 251 (S340). Here, the visual effect may include a recorded sound length, a size, a speech state, a keyword detected in a speech, a speaker classification information, and the like.

If the user wishes to confirm the details of the recorded sound during the conference recording mode (S350), the control unit 280 may convert the recorded sound into text and display it on the display 251 in a predetermined form (S360). In this process, the user may delete a part of the recorded sound or give specific attributes (for example, prohibition of deletion, etc.).

When a command for ending the conference recording mode is inputted, the conference recording mode may be terminated (370), and whether the additional function is performed or not may be determined by the user (S380). Additional functions that can be performed will be described later in more detail.

Hereinafter, the processes of FIG. 3 will be described in detail with reference to the drawings. In the following drawings including FIG. 3, there is shown a situation in which a user's visual field overlaps with an image output through the display 251 while the user wears the glass-type mobile terminal 200 I suppose. Of course, in the case where the functions according to the embodiments are implemented in a mobile terminal of a smartphone type, the image captured through the camera and the visual effect corresponding to the recorded sound may be displayed on the display 151 together.

FIG. 4 illustrates an example of a visual effect according to a state of a recognized speaker and recognizing a speaker in a mobile terminal according to an exemplary embodiment of the present invention.

Referring to FIG. 4 (a), the user gazes at another person 410 while wearing the glass-type mobile terminal 200. At this time, as the person 410 starts to speak, the mouth 411 moves as shown in FIG. 4 (b). The control unit 280 senses the movement of the mouth through the camera 221, and at the same time, when a voice of a certain size or larger is detected through the microphone 222, the controller 280 can recognize a person who is speaking now as a speaker. The control unit 280 can distinguish the recognized speaker from other speakers through voice or facial recognition.

The visual effect 430 (here a border effect) indicating that the speech is recognized as a speaker and the speech bubble 420 may be output through the display 251 such that the speech 410 appears around the speaker 410 (similar to the AR) . At this time, a predetermined visual effect 421 may be given to the speech balloon 420 corresponding to the length of the utterance. For example, when the speaker 410 speaks continuously, the visual effect 421 'becomes longer as shown in FIG. 4 (c). On the other hand, if the speaker 420 does not have a word or if the voice has a certain size or less, an ellipsis mark 422 may be displayed in the speech bubble. If the mobile terminal 200 simultaneously performs the STT function on the speaker's voice in real time, the visual effect 421 corresponding to the utterance length may be replaced with the text that has undergone the STT process.

FIG. 5 illustrates an example of a visual effect according to a state of a recognized speaker and recognizing a speaker in a mobile terminal according to an exemplary embodiment of the present invention.

Referring to FIG. 5A, a predetermined shape 510 may be displayed at a fixed position of the display 251. It is possible for the user to keep the person 410 who intends to register as a speaker in the graphic form for a predetermined time or longer in the graphic form as shown in FIG. 5B (i.e., gazing while wearing the glass type mobile terminal). In this case, the figure 510 can be changed to a visual effect 510 'indicating that the figure 510 is recognized as a speaker, as shown in FIG. 5C. Accordingly, the speech balloon 420 is displayed around the speaker as shown in (d) of FIG. 5, and the visual effect 421 can be displayed in the inside of the speech balloon 420 as the utterance progresses.

Alternatively, a speaker may be recognized in the control unit 280 by a method of operating the user input unit in a specific pattern when the speaker is located within the shooting range of the camera.

FIG. 6 shows an example of a visual effect applied according to a speaker in a mobile terminal according to an embodiment of the present invention.

Referring to FIG. 6A, the speech balloon 620 is filled with the recognized utterer 610 utterance. At this time, the status bar 630 may be displayed on the speaker, and the conference title / theme and / or the keyword frequently referred to by the speaker may be extracted and displayed as text in the status bar. For example, on the left side of the status bar 630, a keyword commonly referred to by the conference title / topic or speakers entered by the user can be displayed, and the keyword mentioned by the speaker can be displayed on the right side.

Further, in the status bar 630, the identification information 641 of the speaker 641 currently speaking (and recognized by the control unit) can be displayed. If the face identification information stored in the phone book of the memory 170 matches the face identification information previously stored in the memory 170, the name on the phone book may be displayed. Otherwise, the numbering result may be displayed according to a predetermined rule.

If the control unit 280 determines that the user of the mobile terminal is speaking not through the voice recognition of the user, the color given to the status bar 630 'may be changed as shown in FIG. 6B And identification information 642 indicating that the user is a user can be displayed. If additional information is set by the user, an indicator 650 indicating the additional information may be displayed. The additional information set by the user will be described later in more detail with reference to FIG.

FIG. 7 illustrates an example of a visual effect according to a speech recording in a mobile terminal according to an embodiment of the present invention.

7A, the speech balloon 710 displayed around the recognized speaker shows, in addition to the length of the speech, whether or not the user was looking at the speaker at the time of speech, . That is, the controller 180 can determine whether the user is looking at the speaker by determining whether the speaker is recognized in the image photographed through the camera 221 when the speaker is tracked by sound.

Specifically, the inside of the speech balloon is divided into two regions. In the upper part, a visual effect 720 corresponding to the utterance length is displayed when the user watches the utterance. When the user does not see the bottom effect, (730) may be displayed. Here, an indicator 721 indicating that the user is watching is displayed in the visual effect 720 at the top. Further, when the utterance length exceeds the corresponding region, or when the utterance length is uttered plural times over a predetermined time interval, visual effects for each unit utterance can be displayed in a stacked form.

7 (b), the cursor 740 can be displayed through the operation of the user input section 223a / b. The cursor 740 can be displayed in correspondence with the operation direction of the user input section 223a / The utterance corresponding to the visual effect can be deleted or stored. For example, if a command corresponding to the left direction is input to the visual effect 730 corresponding to a specific utterance, the utterance record corresponding to the visual effect can be deleted. If a command corresponding to the right direction is input, Quot; can be stored separately.

When the speech balloon 710 is displayed on the touch screen 151 of the mobile terminal 100, the user can touch the visual effect 730 corresponding to the specific speech in one direction as shown in FIGS. 7C and 7D, - You can delete or save the record of the statement by entering the drag command. Also, in this case, indicators 731 and 732 corresponding to the processing according to the touch-drag direction may be displayed.

FIG. 8 shows an example of an additional function that can be performed in a mobile terminal according to an embodiment of the present invention.

Referring to FIG. 8A, the status bar 810 may be displayed as the user executes the conference recording mode. If the user is speaking, the identification information 811 of the user may be displayed on the status bar. When the mode is being executed, when the user looks at the document 820 on the upper side of the book, as shown in FIG. 8B, the document is also taken in the camera 221. At this time, when a specific mark is detected in the photographed image, the control unit 280 optically reads (OCR) the periphery of the mark to acquire the content described in the document and perform a function corresponding to the mark. For example, when the mark is the check mark 821, the control unit 280 determines that the mark is to-do, and can input the obtained content to a certain application. Accordingly, an indicator 812 indicating that the schedule is added can be displayed on the status bar 810 as shown in FIG. 8C, and a pop-up window 830 including the added contents can be displayed for a predetermined time . Even before a predetermined time elapses, the pop-up window 830 may disappear even if the user lifts his head again (detected by a G sensor or a gyro sensor) or sees a speaker as shown in FIG. 8D.

On the other hand, when the text in which the utterance has been directly converted is not displayed in the speech balloon, the written utterance may be converted into text and displayed in accordance with the user's command input. For example, when a speech balloon is displayed on the display 251, when a command of a specific pattern is input on the user input unit, the controller 280 performs an STT conversion operation on the speech record of the speaker corresponding to the speech balloon , And display the result in a separate area outside the speech balloon or outside the speech balloon. If the recording mode of the conference is performed through the mobile terminal 100 of the smartphone type or the speech balloon information is transmitted to the mobile terminal 100, the STT conversion and output command may be transmitted to the touch screen 151 of the mobile terminal 100 ) By touching the speech balloon.

FIG. 9 shows an example of a form in which a recorded speech is converted and output in a mobile terminal according to an embodiment of the present invention.

9 (a), a speech balloon 920 displayed on the periphery of the speaker 910 recognized through the touch screen 151 during execution of the conference recording mode is selected, or the speech balloon 920 displayed on the user input unit 223a / When a command of a specific pattern is input, the speech record corresponding to the speech balloon can be converted into text through the STT process. Accordingly, as shown in FIG. 9B, the converted text is displayed in one area 940 of the display, and an indicator 930 indicating whether the currently displayed text corresponds to the utterance of the speaker is displayed on the speech balloon Can be displayed. When the size of the area 940 in which the text is displayed is fixed, the converted text can be automatically scrolled as shown in FIG. 9C, and the position of the indicator 930 can be changed accordingly. On the other hand, when the speaker 910 starts speaking again, scrolling is stopped as shown in (d) of FIG. 9, and a message 950 indicating this can be displayed. Of course, scrolling can be resumed when the speech is interrupted.

FIG. 10 shows another example of a form in which a recorded speech is converted and output in a mobile terminal according to an embodiment of the present invention.

In FIG. 10, only the area 1010 corresponding to the area 940 in which the converted text described in FIG. 9 is displayed will be shown for the sake of simplicity.

Referring to FIG. 10 (a), a recorded speech for a specific speaker may be managed and converted for each speech unit, and displayed in a list form in one area 1010 of the display. The speech unit may be divided into recordings until a time point when the utterance is stopped for a predetermined time or more since the utterance of the utterance is started, and information indicating the time at which the utterance is recorded based on the current time may be displayed in each utterance unit item . In addition, each item of the list can be initially displayed in a fixed size, and when an item 1013 in which the entire text can not be displayed within a fixed size is selected, And can be displayed (1013 ').

On the other hand, if the specific item 1014 is dragged to the left as shown in (c) of FIG. 10, an icon 1021 indicating that the item is deleted can be displayed. If the drag distance is a certain distance or more, Be deleted. 10 (d), if an item 1013 is dragged to the right, an icon 1022 indicating that the item is stored can be displayed. If the drag distance is a certain distance or more, the corresponding statement can be stored separately have.

Next, an additional function that can be performed after the conference recording is completed will be described with reference to FIG.

FIG. 11 illustrates an example of a process of selecting an additional function according to the end of speech recording in the mobile terminal according to an embodiment of the present invention.

As described above, the conference recording mode may be terminated through a method such that the user looks down the desk with his / her head and operates the user input unit. The control unit 280 can stop the speaker recognition and recording through the microphone 222 and the camera 221 as the conference recording mode is terminated. In addition, the control unit 280 may display icons 1111 to 1114 corresponding to the respective speakers on the display 251 as shown in FIG. 11 (a). Each icon may be provided with an indicator corresponding to the amount of the written statement and / or the identification information of the speaker. If the transmission icon 115 is selected in this state, the control unit 180 can transmit the recorded speech record to the remaining speakers except the user. At this time, the type of the transmitted data may be acoustic data or text converted from acoustic data. Of course, the two may be transmitted together, or the whole speech record of the other speakers may be transmitted to each speaker, or only the speech record of one's own may be transmitted to each speaker. The control unit 180 can use the pre-stored contacts when the contacts of the respective speakers are confirmed through the facial recognition in the specification of the transmission destination (for example, the telephone number or the mail address, the messenger ID, the network address, Otherwise, the transmission destination may be input from the user.

11 (b), when the user calls the cursor 1120 and places it on the icon corresponding to a specific speaker, the statement list 1130 of the speaker can be displayed. If there is a schedule set by the user, the control unit 180 may display the set schedule information 1140 if the cursor is positioned on the icon corresponding to the user.

The present invention described above can be implemented as computer readable codes on a medium on which a program is recorded. The computer readable medium includes all kinds of recording devices in which data that can be read by a computer system is stored. Examples of the computer readable medium include a hard disk drive (HDD), a solid state disk (SSD), a silicon disk drive (SDD), a ROM, a RAM, a CD-ROM, a magnetic tape, a floppy disk, , And may also be implemented in the form of a carrier wave (e.g., transmission over the Internet). In addition, the computer may include a control unit 180 of the terminal. Accordingly, the above description should not be construed in a limiting sense in all respects and should be considered illustrative. The scope of the present invention should be determined by rational interpretation of the appended claims, and all changes within the scope of equivalents of the present invention are included in the scope of the present invention.

Claims (20)

display;
MIC;
camera; And
Recognizing at least one speaker by using at least one of the sound inputted through the microphone and the image photographed through the camera, and recording sound for each of the recognized at least one speaker, And a control unit for controlling the visual effect corresponding to the sound of the current speaker to be displayed through the display.
The method according to claim 1,
Wherein,
And recognizes the at least one speaker using at least one of a position and a mouth motion in an image photographed through the camera.
3. The method of claim 2,
Wherein,
Wherein the current speaker is determined through facial recognition performed on the sound input through the microphone and the image captured through the camera.
The method according to claim 1,
Wherein,
The first visual effect corresponding to whether or not the current speaker speaks and the utterance length is displayed.
5. The method of claim 4,
Wherein the mobile terminal is a glass type mobile terminal,
If the display is a light transmissive head-mounted display (HMD)
Wherein,
Determines the position of the current speaker through the camera and controls the display to display the first visual effect adjacent to the current speaker at a user's viewpoint using the determined position of the speaker.
5. The method of claim 4,
Wherein,
Wherein the control unit controls the keyword so that the keyword and the identification information of the current speaker are displayed on the first area of the display.
5. The method of claim 4,
Wherein,
And controls the second visual effect to be further displayed in the first visual effect depending on whether or not the current speaker is being photographed by the camera during a speech.
5. The method of claim 4,
Wherein,
If the first visual effect is selected through user input,
Converts the recorded sound of the current speaker corresponding to the first visual effect into text and controls the display to be displayed in the second area of the display.
9. The method of claim 8,
Wherein,
And displays the converted text in units of speech according to a predetermined criterion.
The method according to claim 1,
Wherein,
Performing an optical reading (OCR) on an image around the specific landmark if a specific landmark is detected in the image photographed through the camera, and storing the result of the optical landing in association with the application corresponding to the specific landmark , A mobile terminal.
Recognizing at least one speaker using at least one of a sound input through a microphone and an image captured through a camera;
Recording the sound for each of the recognized speakers; And
And displaying the visual effect corresponding to the utterance of the current speaker among the recognized at least one speaker through a display.
12. The method of claim 11,
Wherein the recognizing comprises:
And recognizing the at least one speaker using at least one of a position and a mouth motion in an image photographed through the camera.
13. The method of claim 12,
Further comprising the step of determining the current speaker through facial recognition performed on the sound input through the microphone and the image captured through the camera.
12. The method of claim 11,
Further comprising the step of displaying a first visual effect corresponding to whether or not the current speaker speaks and the utterance length.
15. The method of claim 14,
Wherein the mobile terminal is a glass type mobile terminal,
If the display is a light transmissive head-mounted display (HMD)
Wherein the step of displaying the first visual effect comprises:
Determining a position of the current speaker through the camera; And
And displaying the first visual effect adjacent to the current speaker at a user's point of view using the determined position of the speaker.
15. The method of claim 14,
Further comprising the step of displaying, in a first area of the display, a keyword detected more than a predetermined number of times in the sound of the current speaker and the identification information of the current speaker.
15. The method of claim 14,
Further comprising displaying the second visual effect on the first visual effect in accordance with whether or not the current speaker is being photographed by the camera during the speech.
15. The method of claim 14,
Selecting the first visual effect via user input;
Converting the recorded sound of the current speaker corresponding to the first visual effect into text; And
And displaying the converted text in a second area of the display.
19. The method of claim 18,
Wherein the converted text displayed in the second area is a text area,
Wherein said display unit is divided into speech units according to a predetermined criterion.
12. The method of claim 11,
Wherein,
Performing an optical reading (OCR) on an image around the specific landmark if a specific landmark is detected in the image photographed through the camera, and storing the result of the optical landing in association with the application corresponding to the specific landmark , And a control method of the mobile terminal.
KR1020140103849A 2014-08-11 2014-08-11 Mobile terminal and method for controlling the same KR20160019279A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
KR1020140103849A KR20160019279A (en) 2014-08-11 2014-08-11 Mobile terminal and method for controlling the same

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
KR1020140103849A KR20160019279A (en) 2014-08-11 2014-08-11 Mobile terminal and method for controlling the same

Publications (1)

Publication Number Publication Date
KR20160019279A true KR20160019279A (en) 2016-02-19

Family

ID=55448727

Family Applications (1)

Application Number Title Priority Date Filing Date
KR1020140103849A KR20160019279A (en) 2014-08-11 2014-08-11 Mobile terminal and method for controlling the same

Country Status (1)

Country Link
KR (1) KR20160019279A (en)

Cited By (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
KR20170126667A (en) * 2016-05-10 2017-11-20 삼성에스디에스 주식회사 Method for generating conference record automatically and apparatus thereof
KR20190016683A (en) * 2017-08-09 2019-02-19 (주)에스엠인스트루먼트 Apparatus for automatic conference notetaking using mems microphone array
KR20210054157A (en) * 2019-11-05 2021-05-13 두산중공업 주식회사 Apparatus and method for producing conference record
WO2021206415A1 (en) * 2020-04-10 2021-10-14 Samsung Electronics Co., Ltd. Electronic device for communicating in augmented reality and method thereof

Cited By (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
KR20170126667A (en) * 2016-05-10 2017-11-20 삼성에스디에스 주식회사 Method for generating conference record automatically and apparatus thereof
KR20190016683A (en) * 2017-08-09 2019-02-19 (주)에스엠인스트루먼트 Apparatus for automatic conference notetaking using mems microphone array
KR20210054157A (en) * 2019-11-05 2021-05-13 두산중공업 주식회사 Apparatus and method for producing conference record
WO2021206415A1 (en) * 2020-04-10 2021-10-14 Samsung Electronics Co., Ltd. Electronic device for communicating in augmented reality and method thereof
US11481965B2 (en) 2020-04-10 2022-10-25 Samsung Electronics Co., Ltd. Electronic device for communicating in augmented reality and method thereof

Similar Documents

Publication Publication Date Title
KR20170010494A (en) Mobile terminal and method for controlling the same
KR20170128820A (en) Mobile terminal and method for controlling the same
KR20160092363A (en) Mobile terminal and method for controlling the same
KR20170131101A (en) Mobile terminal and method for controlling the same
KR20170025177A (en) Mobile terminal and method for controlling the same
KR20170013083A (en) Mobile terminal and method for controlling the same
KR20170014356A (en) Mobile terminal and method of controlling the same
KR20160061156A (en) Mobile terminal and method for controlling the same
KR20170001329A (en) Mobile terminal and method for controlling the same
KR101510704B1 (en) Mobile terminal and control method for the mobile terminal
KR20160019279A (en) Mobile terminal and method for controlling the same
KR20170115863A (en) Mobile terminal and method for controlling the same
KR20170021514A (en) Display apparatus and controlling method thereof
KR20160006518A (en) Mobile terminal
KR20170075579A (en) Mobile terminal and method for controlling the same
KR20160031336A (en) Mobile terminal and method for controlling the same
KR20150093519A (en) Mobile terminal and method for controlling the same
KR20170090072A (en) Mobile terminal and method for controlling the same
KR20170014193A (en) Mobile terminal and method for controlling the same
KR20170020070A (en) Mobile terminal and method for controlling the same
KR20160089782A (en) Mobile terminal and method for controlling the same
KR20170017577A (en) Mobile terminal and method for controlling the same
KR20160067696A (en) Mobile terminal and method for controlling the same
KR101680809B1 (en) Mobile terminal and method for controlling the same
KR20160087307A (en) Mobile terminal and the control method thereof

Legal Events

Date Code Title Description
A201 Request for examination
E902 Notification of reason for refusal
AMND Amendment
E601 Decision to refuse application
AMND Amendment