KR20230117877A

KR20230117877A - Electronic device for generating composed audio video (av) file in which audio and video are synchronized

Info

Publication number: KR20230117877A
Application number: KR1020220014159A
Authority: KR
Inventors: 김봉곤; 엄지혜; 김훈; 이경일; 최종찬; 최진호; 허영근; 김한상; 김현술; 이정원
Original assignee: 삼성전자주식회사
Priority date: 2022-02-03
Filing date: 2022-02-03
Publication date: 2023-08-10

Abstract

다양한 실시예에서 전자 장치는, 카메라들; 마이크; 상기 카메라들과 상기 마이크에 작동적으로 연결된 프로세서; 및 상기 프로세서에 작동적으로 연결된 메모리를 포함할 수 있다. 상기 메모리는, 실행될 때, 상기 프로세서가: 상기 카메라들 중 제1카메라로부터 수신된 비디오 신호를 프레임 단위로 인코딩함으로써 제1 비디오 파일을 생성하고, 상기 마이크로부터 수신된 오디오 신호를 프레임 단위로 인코딩함으로써 제1 오디오 파일을 생성하고, 상기 제1 비디오 파일과 상기 제1 오디오 파일을 제1 AV 파일로 결합하여 상기 메모리에 저장하고, 상기 제1 비디오 파일과 상기 제1 오디오 파일을 생성하는 동일한 시간대에, 상기 카메라들 중 제2카메라로부터 수신된 비디오 신호를 프레임 단위로 인코딩함으로써 제2 비디오 파일을 생성하고, 상기 마이크로부터 수신된 오디오 신호를 프레임 단위로 인코딩함으로써 제2 오디오 파일을 생성하고, 상기 제2 비디오 파일과 상기 제2 오디오 파일을 제2 AV 파일로 결합하여 상기 메모리에 저장하고, 상기 제1 비디오 파일에서 첫번째 프레임이 생성된 시점을 나타내는 제1 비디오 타임스탬프, 상기 제1 오디오 파일에서 첫번째 프레임이 생성된 시점을 나타내는 제1 오디오 타임스탬프, 상기 제2 비디오 파일에서 첫번째 프레임이 생성된 시점을 나타내는 제2 비디오 타임스탬프, 및 상기 제2 오디오 파일에서 첫번째 프레임이 생성된 시점을 나타내는 제2 오디오 타임스탬프를 상기 메모리에 저장하고, 상기 메모리에 동 시간대에 저장된 AV 파일들에 포함된 비디오 타임스탬프들 중에서 또는 상기 AV 파일들에 포함된 비디오 타임스탬프들 및 오디오 타임스탬프들 중에서 상대적으로 느린 시점을 나타내는 제1 타임스탬프에 기초하여 비동기화 비디오 프레임 구간을 결정하고, 상기 AV 파일들에 포함된 오디오 타임스탬프들 중에서 또는 AV 파일들에 포함된 비디오 타임스탬프들 및 오디오 타임스탬프들 중에서 상대적으로 느린 시점을 나타내는 제2 타임스탬프에 기초하여 비동기화 오디오 프레임 구간을 결정하고, 상기 AV 파일들마다 비동기화 비디오 프레임 구간에 속하지 않는 비디오 프레임들을 디코딩함으로써 복수의 비디오 신호들을 생성하고, 상기 복수의 비디오 신호들을 하나의 비디오 신호로 합성하고, 하나로 합성된 상기 비디오 신호를 인코딩함으로써 합성 비디오 파일을 생성하고, 상기 AV 파일들마다 비동기화 오디오 프레임 구간에 속하지 않은 오디오 프레임들을 디코딩함으로써 복수의 오디오 신호들을 생성하고, 상기 복수의 오디오 신호들을 하나의 오디오 신호로 합성하고, 하나로 합성된 상기 오디오 신호를 인코딩함으로써 합성 오디오 파일을 생성하고, 상기 합성 비디오 파일과 상기 합성 오디오 파일을 합성 AV 파일로 결합하여 상기 메모리에 저장하도록 하는 인스트럭션들을 저장할 수 있다. 그 외에도, 다양한 실시예들이 가능하다.In various embodiments, an electronic device may include cameras; mike; a processor operatively connected to the cameras and the microphone; and a memory operatively coupled to the processor. The memory, when executed, causes the processor to: generate a first video file by encoding a video signal received from a first one of the cameras frame by frame, and encode an audio signal received from the microphone frame by frame A first audio file is generated, the first video file and the first audio file are combined into a first AV file, stored in the memory, and the first video file and the first audio file are generated at the same time. , A second video file is generated by encoding a video signal received from a second camera of the cameras in units of frames, and a second audio file is generated by encoding an audio signal received from the microphone in units of frames, 2 video files and the second audio file are combined into a second AV file and stored in the memory, and a first video timestamp indicating a time point at which a first frame was generated in the first video file, a first video timestamp in the first audio file A first audio timestamp indicating when a frame was created, a second video timestamp indicating when a first frame was created in the second video file, and a second audio timestamp indicating when a first frame was created in the second audio file. An audio timestamp is stored in the memory, and a relatively slow point in time among video timestamps included in AV files or among video timestamps and audio timestamps included in AV files stored in the memory at the same time period Determines an unsynchronized video frame period based on a first timestamp indicating a relatively slow among audio timestamps included in the AV files or among video timestamps and audio timestamps included in the AV files. An asynchronous audio frame interval is determined based on a second timestamp indicating a point of view, a plurality of video signals are generated by decoding video frames not belonging to the asynchronous video frame interval for each of the AV files, and the plurality of video signals are generated. synthesized into one video signal, generating a synthesized video file by encoding the synthesized video signal, and generating a plurality of audio signals by decoding audio frames that do not belong to an asynchronous audio frame interval for each AV file, , synthesizing the plurality of audio signals into one audio signal, generating a synthesized audio file by encoding the synthesized audio signal, combining the synthesized video file and the synthesized audio file into a synthesized AV file, and storing the synthesized audio signal in the memory. You can store instructions that cause it to be saved. Besides that, various embodiments are possible.

Description

Electronic device for generating a composite audio video (AV) file in which audio and video are synchronized

다양한 실시예는 복수의 오디오 비디오(이하, AV(audio video)) 파일들을 하나의 AV 파일로 합성하고 AV 파일을 재생하는 전자 장치에 관한 것이다. Various embodiments relate to an electronic device that synthesizes a plurality of audio video (hereinafter referred to as AV (audio video)) files into one AV file and reproduces the AV file.

전자 장치(예: 스마트 폰)은 복수의 카메라들을 구비할 수 있다. 예컨대, 전자 장치는 전자 장치의 전면에 배치된 전면 카메라와 후면에 배치된 하나 이상의 후면 카메라를 포함할 수 있다. An electronic device (eg, a smart phone) may include a plurality of cameras. For example, the electronic device may include a front camera disposed on the front side of the electronic device and one or more rear cameras disposed on the rear side of the electronic device.

전자 장치는 복수의 레코더들을 구비할 수 있다. 예를 들어, 제1 레코더는 제1 카메라(예: 전면 카메라)로부터 수신되는 비디오 신호를 인코딩함으로써 제1 비디오 파일을 생성하고 마이크로부터 수신된 오디오 신호를 인코딩함으로써 제1 비디오 파일에 동기화된 제2 오디오 파일을 생성할 수 있다. 제1 레코더는 제1 비디오 파일과 제1 오디오 파일을 하나의 제1 AV 파일로 결합하여 컨테이너(예: MP4)에 포함(또는, 저장)시킬 수 있다. 제2 레코더는 제2 카메라(예: 후면 카메라)로부터 수신되는 비디오 신호를 인코딩함으로써 제2 비디오 파일을 생성하고 마이크로부터 수신된 오디오 신호를 인코딩함으로써 제2 비디오 파일에 동기화된 제2 오디오 파일을 생성할 수 있다. 제2 레코더는 제2 비디오 파일과 제2 오디오 파일을 제2 AV 파일로 결합하여 컨테이너에 포함시킬 수 있다.An electronic device may include a plurality of recorders. For example, the first recorder generates a first video file by encoding a video signal received from a first camera (eg, a front camera) and generates a second video file synchronized to the first video file by encoding an audio signal received from a microphone. You can create audio files. The first recorder may combine the first video file and the first audio file into one first AV file and include (or store) the first AV file in a container (eg, MP4). The second recorder generates a second video file by encoding a video signal received from a second camera (eg, a rear camera) and generates a second audio file synchronized with the second video file by encoding an audio signal received from a microphone. can do. The second recorder may combine the second video file and the second audio file into the second AV file and include them in the container.

전자 장치는 복수의 AV 파일들을 하나의 AV 파일로 합성하는 편집기(editor)를 포함할 수 있다. 예컨대, 편집기는 제1 AV 파일을 디코딩함으로써 제1 비디오 신호와 제1 오디오 신호를 획득하고 제2 AV 파일을 디코딩함으로써 제2 비디오 신호와 제2 오디오 신호를 획득할 수 있다. 편집기는 제1 비디오 신호와 제2 비디오 신호를 하나의 비디오 신호로 합성하고, 합성된 비디오 신호를 인코딩함으로써 제3 비디오 파일을 생성할 수 있다. 편집기는 제1 오디오 신호와 제2 오디오 신호를 하나의 오디오 신호로 합성하고, 합성된 오디오 신호를 인코딩함으로써 제3 오디오 파일을 생성할 수 있다. 편집기는 제3 비디오 파일과 제3 오디오 파일을 제3 AV 파일로 결합하여 컨테이너에 포함시킬 수 있다. 전자 장치는 편집 결과물로서 얻은 제3 AV 파일을 스피커와 디스플레이를 통해 재생할 수 있다.The electronic device may include an editor for synthesizing a plurality of AV files into one AV file. For example, the editor may obtain the first video signal and the first audio signal by decoding the first AV file, and obtain the second video signal and the second audio signal by decoding the second AV file. The editor may create a third video file by synthesizing the first video signal and the second video signal into one video signal and encoding the synthesized video signal. The editor may create a third audio file by synthesizing the first audio signal and the second audio signal into one audio signal and encoding the synthesized audio signal. The editor may combine the third video file and the third audio file into a third AV file and include it in the container. The electronic device may reproduce the third AV file obtained as an editing result through a speaker and a display.

전자 장치는 입력 장치(예: 터치스크린)를 통해 사용자로부터 녹화 명령을 수신할 수 있다. 녹화 명령이 복수 카메라들에 동시에 주어지더라도 피사체 촬영을 시작한 시점은 복수 카메라들마다 다를 수 있다. 예를 들어, 제1 카메라는 녹화 명령에 빠르게 반응하여 촬영을 시작하고 촬영을 시작한 시점부터 비디오 신호를 프레임 단위로 생성하여 제1 레코더로 전달할 수 있다. 제2 카메라는 녹화 명령에 상대적으로 느리게 반응하여 촬영을 시작하고 촬영을 시작한 시점부터 비디오 신호를 프레임 단위로 생성하여 제2 레코더로 전달할 수 있다. 이에 따라 제2 카메라에서 제2 레코더로 전달되는 비디오 신호에는 두 레코더 간의 반응 속도의 차이만큼의 장면이 포함되지 않을 수 있다. 이러한 반응 속도의 차이는 편집 결과물인 제3 AV 파일이 재생될 때 오디오와 비디오의 비동기화(일명, AV 싱크(sync) 불일치)를 야기할 수 있다. 결과적으로 전자 장치는, 제3 AV 파일을 재생하는 동안, AV 싱크 불일치로 인한 불쾌감을 사용자에게 줄 수 있다. 예를 들면, 피사체들 중에 어느 한 피사체의 동작과 소리가 시간적으로 어긋날 수 있다. 제1 카메라에서 촬영된 피사체의 동작과 제2 카메라에서 촬영된 피사체의 동작이 시간적으로 어긋날 수도 있다.An electronic device may receive a recording command from a user through an input device (eg, a touch screen). Even if a recording command is given to a plurality of cameras at the same time, a point in time at which a subject is started to be photographed may be different for each of the plurality of cameras. For example, the first camera may start recording in response to a recording command, generate a video signal in units of frames from the start of recording, and transmit the video signal to the first recorder. The second camera responds relatively slowly to the recording command and starts recording, and generates a video signal in units of frames from the start of recording and transmits the video signal to the second recorder. Accordingly, a video signal transferred from the second camera to the second recorder may not include as many scenes as the difference in reaction speed between the two recorders. Such a difference in response speed may cause audio and video to become out of synch (aka AV sync mismatch) when the third AV file, which is the editing result, is reproduced. As a result, the electronic device may give the user discomfort due to the AV sync mismatch while reproducing the third AV file. For example, an operation and sound of one of the subjects may be temporally out of sync. A motion of a subject photographed by the first camera and a motion of a subject photographed by the second camera may be temporally different.

카메라들이 촬영을 시작하는 순서와 촬영 시작 시점들 간의 시차가 여러 번의 테스트들을 통해 획득될 수 있다. AV 파일들을 합성 시 여러 번의 실험을 통해 획득된 촬영 개시 순서와 시차가 고려될 수 있다. 예를 들어, 제1 카메라가 제2 카메라보다 먼저 촬영을 시작한다는 점 그리고 촬영 시점들 간의 시차를 나타내는 평균 지연 값이 획득될 수 있다. 편집기는 제1 카메라를 이용하여 생성된 제1 비디오 파일에서 상기 시차에 해당되는 앞부분을 제외한 나머지와 제2 비디오 파일을 합성할 수 있다. 하지만, 촬영 시작 순서 및/또는 촬영 시작 시점들 간의 시차가 촬영 때마다 매번 동일하지 않을 수 있다. 따라서, 합성 결과물인 AV 파일에서 싱크 일치가 합성 때마다 매번 이루어질 것이라는 보장을 못할 수도 있다.An order in which cameras start shooting and a time difference between shooting start points may be obtained through several tests. When synthesizing AV files, the shooting start order and time difference obtained through several experiments may be considered. For example, an average delay value indicating that the first camera starts shooting before the second camera and a time difference between shooting points may be obtained. The editor may synthesize a second video file with the rest of the first video file generated by using the first camera except for the front part corresponding to the parallax. However, the shooting start sequence and/or the time difference between shooting start points may not be the same every time shooting. Accordingly, it may not be possible to guarantee that synchronization in the AV file, which is the result of synthesis, will be made every time it is synthesized.

다양한 실시예에서 전자 장치는 오디오와 비디오의 동기화가 이루어진 합성 파일을 생성하도록 함으로써 AV 싱크 불일치로 인한 사용자의 불편함을 해소할 수 있다. In various embodiments, the electronic device can solve the user's inconvenience caused by the AV sync mismatch by generating a synthesized file in which audio and video are synchronized.

본 개시에서 이루고자 하는 기술적 과제는 이상에서 언급한 기술적 과제로 제한되지 않으며, 언급되지 않은 또 다른 기술적 과제들은 아래의 기재로부터 본 발명이 속하는 기술분야에서 통상의 지식을 가진 자에게 명확하게 이해될 수 있을 것이다.The technical problem to be achieved in the present disclosure is not limited to the above-mentioned technical problem, and other technical problems not mentioned can be clearly understood by those skilled in the art from the description below. There will be.

다양한 실시예에서 전자 장치는, 카메라들; 마이크; 상기 카메라들과 상기 마이크에 작동적으로 연결된 프로세서; 및 상기 프로세서에 작동적으로 연결된 메모리를 포함할 수 있다. 상기 메모리는, 실행될 때, 상기 프로세서가: 상기 카메라들 중 제1카메라로부터 수신된 비디오 신호를 프레임 단위로 인코딩함으로써 제1 비디오 파일을 생성하고, 상기 마이크로부터 수신된 오디오 신호를 프레임 단위로 인코딩함으로써 제1 오디오 파일을 생성하고, 상기 제1 비디오 파일과 상기 제1 오디오 파일을 제1 AV 파일로 결합하여 상기 메모리에 저장하고, 상기 제1 비디오 파일과 상기 제1 오디오 파일을 생성하는 동일한 시간대에, 상기 카메라들 중 제2카메라로부터 수신된 비디오 신호를 프레임 단위로 인코딩함으로써 제2 비디오 파일을 생성하고, 상기 마이크로부터 수신된 오디오 신호를 프레임 단위로 인코딩함으로써 제2 오디오 파일을 생성하고, 상기 제2 비디오 파일과 상기 제2 오디오 파일을 제2 AV 파일로 결합하여 상기 메모리에 저장하고, 상기 제1 비디오 파일에서 첫번째 프레임이 생성된 시점을 나타내는 제1 비디오 타임스탬프, 상기 제1 오디오 파일에서 첫번째 프레임이 생성된 시점을 나타내는 제1 오디오 타임스탬프, 상기 제2 비디오 파일에서 첫번째 프레임이 생성된 시점을 나타내는 제2 비디오 타임스탬프, 및 상기 제2 오디오 파일에서 첫번째 프레임이 생성된 시점을 나타내는 제2 오디오 타임스탬프를 상기 메모리에 저장하고, 상기 메모리에 동 시간대에 저장된 AV 파일들에 포함된 비디오 타임스탬프들 중에서 또는 상기 AV 파일들에 포함된 비디오 타임스탬프들 및 오디오 타임스탬프들 중에서 상대적으로 느린 시점을 나타내는 제1 타임스탬프에 기초하여 비동기화 비디오 프레임 구간을 결정하고, 상기 AV 파일들에 포함된 오디오 타임스탬프들 중에서 또는 AV 파일들에 포함된 비디오 타임스탬프들 및 오디오 타임스탬프들 중에서 상대적으로 느린 시점을 나타내는 제2 타임스탬프에 기초하여 비동기화 오디오 프레임 구간을 결정하고, 상기 AV 파일들마다 비동기화 비디오 프레임 구간에 속하지 않는 비디오 프레임들을 디코딩함으로써 복수의 비디오 신호들을 생성하고, 상기 복수의 비디오 신호들을 하나의 비디오 신호로 합성하고, 하나로 합성된 상기 비디오 신호를 인코딩함으로써 합성 비디오 파일을 생성하고, 상기 AV 파일들마다 비동기화 오디오 프레임 구간에 속하지 않은 오디오 프레임들을 디코딩함으로써 복수의 오디오 신호들을 생성하고, 상기 복수의 오디오 신호들을 하나의 오디오 신호로 합성하고, 하나로 합성된 상기 오디오 신호를 인코딩함으로써 합성 오디오 파일을 생성하고, 상기 합성 비디오 파일과 상기 합성 오디오 파일을 합성 AV 파일로 결합하여 상기 메모리에 저장하도록 하는 인스트럭션들을 저장할 수 있다.In various embodiments, an electronic device may include cameras; mike; a processor operatively connected to the cameras and the microphone; and a memory operatively coupled to the processor. The memory, when executed, causes the processor to: generate a first video file by encoding a video signal received from a first one of the cameras frame by frame, and encode an audio signal received from the microphone frame by frame A first audio file is generated, the first video file and the first audio file are combined into a first AV file, stored in the memory, and the first video file and the first audio file are generated at the same time. , A second video file is generated by encoding a video signal received from a second camera of the cameras in units of frames, and a second audio file is generated by encoding an audio signal received from the microphone in units of frames, 2 video files and the second audio file are combined into a second AV file and stored in the memory, and a first video timestamp indicating a time point at which a first frame was generated in the first video file, a first video timestamp in the first audio file A first audio timestamp indicating when a frame was created, a second video timestamp indicating when a first frame was created in the second video file, and a second audio timestamp indicating when a first frame was created in the second audio file. An audio timestamp is stored in the memory, and a relatively slow point in time among video timestamps included in AV files or among video timestamps and audio timestamps included in AV files stored in the memory at the same time period Determines an unsynchronized video frame period based on a first timestamp indicating a relatively slow among audio timestamps included in the AV files or among video timestamps and audio timestamps included in the AV files. An asynchronous audio frame interval is determined based on a second timestamp indicating a point of view, a plurality of video signals are generated by decoding video frames not belonging to the asynchronous video frame interval for each of the AV files, and the plurality of video signals are generated. synthesized into one video signal, generating a synthesized video file by encoding the synthesized video signal, and generating a plurality of audio signals by decoding audio frames that do not belong to an asynchronous audio frame interval for each AV file, , synthesizing the plurality of audio signals into one audio signal, generating a synthesized audio file by encoding the synthesized audio signal, combining the synthesized video file and the synthesized audio file into a synthesized AV file, and storing the synthesized audio signal in the memory. You can store instructions that cause it to be saved.

다양한 실시예에서 전자 장치를 동작하는 방법은, 상기 전자 장치에 구비된 카메라들 중 제1카메라로부터 수신된 비디오 신호를 프레임 단위로 인코딩함으로써 제1 비디오 파일을 생성하고, 상기 마이크로부터 수신된 오디오 신호를 프레임 단위로 인코딩함으로써 제1 오디오 파일을 생성하고, 상기 제1 비디오 파일과 상기 제1 오디오 파일을 제1 AV 파일로 결합하여 상기 전자 장치의 메모리에 저장하는 동작; 상기 제1 비디오 파일과 상기 제1 오디오 파일을 생성하는 동일한 시간대에, 상기 카메라들 중 제2카메라로부터 수신된 비디오 신호를 프레임 단위로 인코딩함으로써 제2 비디오 파일을 생성하고, 상기 마이크로부터 수신된 오디오 신호를 프레임 단위로 인코딩함으로써 제2 오디오 파일을 생성하고, 상기 제2 비디오 파일과 상기 제2 오디오 파일을 제2 AV 파일로 결합하여 상기 메모리에 저장하는 동작; 상기 제1 비디오 파일에서 첫번째 프레임이 생성된 시점을 나타내는 제1 비디오 타임스탬프, 상기 제1 오디오 파일에서 첫번째 프레임이 생성된 시점을 나타내는 제1 오디오 타임스탬프, 상기 제2 비디오 파일에서 첫번째 프레임이 생성된 시점을 나타내는 제2 비디오 타임스탬프, 및 상기 제2 오디오 파일에서 첫번째 프레임이 생성된 시점을 나타내는 제2 오디오 타임스탬프를 상기 메모리에 저장하는 동작; 상기 메모리에 동 시간대에 저장된 AV 파일들에 포함된 비디오 타임스탬프들 중에서 또는 상기 AV 파일들에 포함된 비디오 타임스탬프들 및 오디오 타임스탬프들 중에서 상대적으로 느린 시점을 나타내는 제1 타임스탬프에 기초하여 비동기화 비디오 프레임 구간을 결정하고, 상기 AV 파일들에 포함된 오디오 타임스탬프들 중에서 또는 상기 AV 파일들에 포함된 비디오 타임스탬프들 및 오디오 타임스탬프들 중에서 상대적으로 느린 시점을 나타내는 제2 타임스탬프에 기초하여 비동기화 오디오 프레임 구간을 결정하는 동작; 상기 AV 파일들마다 비동기화 비디오 프레임 구간에 속하지 않는 비디오 프레임들을 디코딩함으로써 복수의 비디오 신호들을 생성하고, 상기 복수의 비디오 신호들을 하나의 비디오 신호로 합성하고, 하나로 합성된 상기 비디오 신호를 인코딩함으로써 합성 비디오 파일을 생성하는 동작; 상기 AV 파일들마다 비동기화 오디오 프레임 구간에 속하지 않은 오디오 프레임들을 디코딩함으로써 복수의 오디오 신호들을 생성하고, 상기 복수의 오디오 신호들을 하나의 오디오 신호로 합성하고, 하나로 합성된 상기 오디오 신호를 인코딩함으로써 합성 오디오 파일을 생성하는 동작; 및 상기 합성 비디오 파일과 상기 합성 오디오 파일을 합성 AV 파일로 결합하여 상기 메모리에 저장하는 동작을 포함할 수 있다.In various embodiments, a method for operating an electronic device includes generating a first video file by encoding a video signal received from a first camera among cameras included in the electronic device in units of frames, and generating an audio signal received from the microphone. generating a first audio file by encoding in units of frames, combining the first video file and the first audio file into a first AV file, and storing the combined first AV file in a memory of the electronic device; A second video file is generated by encoding a video signal received from a second camera among the cameras frame by frame during the same time period in which the first video file and the first audio file are generated, and the audio received from the microphone is generated. generating a second audio file by encoding a signal frame by frame, combining the second video file and the second audio file into a second AV file, and storing the second audio file in the memory; A first video timestamp indicating when the first frame was created in the first video file, a first audio timestamp indicating when the first frame was created in the first audio file, and the first frame in the second video file storing, in the memory, a second video timestamp indicating when a first frame was created and a second audio timestamp indicating when a first frame was generated in the second audio file; Based on a first timestamp indicating a relatively slow time point among video timestamps included in AV files stored in the memory at the same time, or among video timestamps and audio timestamps included in the AV files, the ratio Determines a synchronization video frame interval, based on a second timestamp indicating a relatively slow time point among audio timestamps included in the AV files or among video timestamps and audio timestamps included in the AV files determining an unsynchronized audio frame section by performing the step; For each of the AV files, a plurality of video signals are generated by decoding video frames that do not belong to an asynchronous video frame period, the plurality of video signals are synthesized into one video signal, and the video signal synthesized into one is synthesized by encoding. creating a video file; For each of the AV files, a plurality of audio signals are generated by decoding audio frames that do not belong to the asynchronous audio frame period, the plurality of audio signals are synthesized into one audio signal, and the synthesized audio signal is synthesized by encoding. creating an audio file; and combining the composite video file and the composite audio file into a composite AV file and storing the composite AV file in the memory.

본 발명의 다양한 실시예는, 오디오와 비디오의 동기화가 이루어진 합성 파일을 생성하도록 함으로써 AV 싱크 불일치로 인한 사용자의 불편함을 해소할 수 있도록 한 전자 장치를 제공할 수 있다. 이 외에, 본 문서를 통해 직접적 또는 간접적으로 파악되는 다양한 효과들이 제공될 수 있다.Various embodiments of the present invention may provide an electronic device capable of solving user's inconvenience due to AV sync mismatch by generating a synthesized file in which audio and video are synchronized. In addition to this, various effects identified directly or indirectly through this document may be provided.

도 1 은, 다양한 실시예에 따른, 네트워크 환경 내의 전자 장치의 블록도이다.
도 2 은, 다양한 실시에 따른, 오디오 모듈의 블록도이다.
도 3 는, 다양한 실시예들에 따른, 카메라 모듈을 예시하는 블럭도이다.
도 4a는 바(bar) 타입의 하우징 구조를 갖는 휴대 전자 장치 전면의 사시도이다.
도 4b는 도 4a의 전자 장치 후면의 사시도이다.
도 5는, 일 실시예에 따른, AV 싱크 일치된 합성 AV 파일을 생성하도록 구성된 전자 장치의 블록도이다.
도 6은 도 5의 기록 모듈이 타임스탬프(timestamp)가 표기되어 있는 원본 AV 파일들을 생성하는 동작을 예시한다.
도 7 및 도 8은 비동기화 프레임 구간을 결정하고 비동기화 프레임 구간을 이용하여 합성 AV 파일을 생성하는 동작을 예시한다.
도 9는, 일 실시예에 따른, 비동기화 프레임 구간을 결정하기 위한 동작들을 설명하기 위한 흐름도이다.
도 10은, 일 실시예에 따른, AV 싱크 일치된 합성 AV 파일을 생성하기 위한 동작들을 설명하기 위한 흐름도이다.1 is a block diagram of an electronic device in a network environment, according to various embodiments.
2 is a block diagram of an audio module, in accordance with various implementations.
3 is a block diagram illustrating a camera module, in accordance with various embodiments.
4A is a perspective view of a front side of a portable electronic device having a bar-type housing structure.
4B is a perspective view of the back of the electronic device of FIG. 4A.
5 is a block diagram of an electronic device configured to generate an AV synchronized composite AV file, according to one embodiment.
FIG. 6 illustrates an operation in which the recording module of FIG. 5 creates original AV files marked with timestamps.
7 and 8 illustrate an operation of determining an asynchronous frame period and generating a synthesized AV file using the asynchronous frame period.
9 is a flowchart illustrating operations for determining an unsynchronized frame period, according to an exemplary embodiment.
10 is a flowchart illustrating operations for generating an AV sync-matched composite AV file, according to an embodiment.

도 1은, 다양한 실시예들에 따른, 네트워크 환경(100) 내의 전자 장치(101)의 블록도이다. 도 1을 참조하면, 네트워크 환경(100)에서 전자 장치(101)는 제1 네트워크(198)(예: 근거리 무선 통신 네트워크)를 통하여 전자 장치(102)와 통신하거나, 또는 제2 네트워크(199)(예: 원거리 무선 통신 네트워크)를 통하여 전자 장치(104) 또는 서버(108) 중 적어도 하나와 통신할 수 있다. 일 실시예에 따르면, 전자 장치(101)는 서버(108)를 통하여 전자 장치(104)와 통신할 수 있다. 일 실시예에 따르면, 전자 장치(101)는 프로세서(120), 메모리(130), 입력 모듈(150), 음향 출력 모듈(155), 디스플레이 모듈(160), 오디오 모듈(170), 센서 모듈(176), 인터페이스(177), 연결 단자(178), 햅틱 모듈(179), 카메라 모듈(180), 전력 관리 모듈(188), 배터리(189), 통신 모듈(190), 가입자 식별 모듈(196), 또는 안테나 모듈(197)을 포함할 수 있다. 어떤 실시예에서는, 전자 장치(101)에는, 이 구성요소들 중 적어도 하나(예: 연결 단자(178))가 생략되거나, 하나 이상의 다른 구성요소가 추가될 수 있다. 어떤 실시예에서는, 이 구성요소들 중 일부들(예: 센서 모듈(176), 카메라 모듈(180), 또는 안테나 모듈(197))은 하나의 구성요소(예: 디스플레이 모듈(160))로 통합될 수 있다.1 is a block diagram of an electronic device 101 within a network environment 100, according to various embodiments. Referring to FIG. 1 , in a network environment 100, an electronic device 101 communicates with an electronic device 102 through a first network 198 (eg, a short-range wireless communication network) or through a second network 199. It may communicate with at least one of the electronic device 104 or the server 108 through (eg, a long-distance wireless communication network). According to an embodiment, the electronic device 101 may communicate with the electronic device 104 through the server 108 . According to an embodiment, the electronic device 101 includes a processor 120, a memory 130, an input module 150, an audio output module 155, a display module 160, an audio module 170, a sensor module ( 176), interface 177, connection terminal 178, haptic module 179, camera module 180, power management module 188, battery 189, communication module 190, subscriber identification module 196 , or the antenna module 197 may be included. In some embodiments, in the electronic device 101, at least one of these components (eg, the connection terminal 178) may be omitted or one or more other components may be added. In some embodiments, some of these components (eg, sensor module 176, camera module 180, or antenna module 197) are integrated into a single component (eg, display module 160). It can be.

프로세서(120)는, 예를 들면, 소프트웨어(예: 프로그램(140))를 실행하여 프로세서(120)에 연결된 전자 장치(101)의 적어도 하나의 다른 구성요소(예: 하드웨어 또는 소프트웨어 구성요소)를 제어할 수 있고, 다양한 데이터 처리 또는 연산을 수행할 수 있다. 일 실시예에 따르면, 데이터 처리 또는 연산의 적어도 일부로서, 프로세서(120)는 다른 구성요소(예: 센서 모듈(176) 또는 통신 모듈(190))로부터 수신된 명령 또는 데이터를 휘발성 메모리(132)에 저장하고, 휘발성 메모리(132)에 저장된 명령 또는 데이터를 처리하고, 결과 데이터를 비휘발성 메모리(134)에 저장할 수 있다. 일 실시예에 따르면, 프로세서(120)는 메인 프로세서(121)(예: 중앙 처리 장치 또는 어플리케이션 프로세서) 또는 이와는 독립적으로 또는 함께 운영 가능한 보조 프로세서(123)(예: 그래픽 처리 장치, 신경망 처리 장치(NPU: neural processing unit), 이미지 시그널 프로세서, 센서 허브 프로세서, 또는 커뮤니케이션 프로세서)를 포함할 수 있다. 예를 들어, 전자 장치(101)가 메인 프로세서(121) 및 보조 프로세서(123)를 포함하는 경우, 보조 프로세서(123)는 메인 프로세서(121)보다 저전력을 사용하거나, 지정된 기능에 특화되도록 설정될 수 있다. 보조 프로세서(123)는 메인 프로세서(121)와 별개로, 또는 그 일부로서 구현될 수 있다.The processor 120, for example, executes software (eg, the program 140) to cause at least one other component (eg, hardware or software component) of the electronic device 101 connected to the processor 120. It can control and perform various data processing or calculations. According to one embodiment, as at least part of data processing or operation, processor 120 transfers instructions or data received from other components (eg, sensor module 176 or communication module 190) to volatile memory 132. , processing commands or data stored in the volatile memory 132 , and storing resultant data in the non-volatile memory 134 . According to an embodiment, the processor 120 may include a main processor 121 (eg, a central processing unit or an application processor) or a secondary processor 123 (eg, a graphic processing unit, a neural network processing unit ( NPU: neural processing unit (NPU), image signal processor, sensor hub processor, or communication processor). For example, when the electronic device 101 includes the main processor 121 and the auxiliary processor 123, the auxiliary processor 123 may use less power than the main processor 121 or be set to be specialized for a designated function. can The secondary processor 123 may be implemented separately from or as part of the main processor 121 .

보조 프로세서(123)는, 예를 들면, 메인 프로세서(121)가 인액티브(예: 슬립) 상태에 있는 동안 메인 프로세서(121)를 대신하여, 또는 메인 프로세서(121)가 액티브(예: 어플리케이션 실행) 상태에 있는 동안 메인 프로세서(121)와 함께, 전자 장치(101)의 구성요소들 중 적어도 하나의 구성요소(예: 디스플레이 모듈(160), 센서 모듈(176), 또는 통신 모듈(190))와 관련된 기능 또는 상태들의 적어도 일부를 제어할 수 있다. 일 실시예에 따르면, 보조 프로세서(123)(예: 이미지 시그널 프로세서 또는 커뮤니케이션 프로세서)는 기능적으로 관련 있는 다른 구성요소(예: 카메라 모듈(180) 또는 통신 모듈(190))의 일부로서 구현될 수 있다. 일 실시예에 따르면, 보조 프로세서(123)(예: 신경망 처리 장치)는 인공지능 모델의 처리에 특화된 하드웨어 구조를 포함할 수 있다. 인공지능 모델은 기계 학습을 통해 생성될 수 있다. 이러한 학습은, 예를 들어, 인공지능 모델이 수행되는 전자 장치(101) 자체에서 수행될 수 있고, 별도의 서버(예: 서버(108))를 통해 수행될 수도 있다. 학습 알고리즘은, 예를 들어, 지도형 학습(supervised learning), 비지도형 학습(unsupervised learning), 준지도형 학습(semi-supervised learning) 또는 강화 학습(reinforcement learning)을 포함할 수 있으나, 전술한 예에 한정되지 않는다. 인공지능 모델은, 복수의 인공 신경망 레이어들을 포함할 수 있다. 인공 신경망은 심층 신경망(DNN: deep neural network), CNN(convolutional neural network), RNN(recurrent neural network), RBM(restricted boltzmann machine), DBN(deep belief network), BRDNN(bidirectional recurrent deep neural network), 심층 Q-네트워크(deep Q-networks) 또는 상기 중 둘 이상의 조합 중 하나일 수 있으나, 전술한 예에 한정되지 않는다. 인공지능 모델은 하드웨어 구조 이외에, 추가적으로 또는 대체적으로, 소프트웨어 구조를 포함할 수 있다.The secondary processor 123 may, for example, take the place of the main processor 121 while the main processor 121 is in an inactive (eg, sleep) state, or the main processor 121 is active (eg, running an application). ) state, together with the main processor 121, at least one of the components of the electronic device 101 (eg, the display module 160, the sensor module 176, or the communication module 190) It is possible to control at least some of the related functions or states. According to one embodiment, the auxiliary processor 123 (eg, an image signal processor or a communication processor) may be implemented as part of other functionally related components (eg, the camera module 180 or the communication module 190). there is. According to an embodiment, the auxiliary processor 123 (eg, a neural network processing device) may include a hardware structure specialized for processing an artificial intelligence model. AI models can be created through machine learning. Such learning may be performed, for example, in the electronic device 101 itself where the artificial intelligence model is performed, or may be performed through a separate server (eg, the server 108). The learning algorithm may include, for example, supervised learning, unsupervised learning, semi-supervised learning, or reinforcement learning, but in the above example Not limited. The artificial intelligence model may include a plurality of artificial neural network layers. Artificial neural networks include deep neural networks (DNNs), convolutional neural networks (CNNs), recurrent neural networks (RNNs), restricted boltzmann machines (RBMs), deep belief networks (DBNs), bidirectional recurrent deep neural networks (BRDNNs), It may be one of deep Q-networks or a combination of two or more of the foregoing, but is not limited to the foregoing examples. The artificial intelligence model may include, in addition or alternatively, a software structure in addition to a hardware structure.

메모리(130)는, 전자 장치(101)의 적어도 하나의 구성요소(예: 프로세서(120) 또는 센서 모듈(176))에 의해 사용되는 다양한 데이터를 저장할 수 있다. 데이터는, 예를 들어, 소프트웨어(예: 프로그램(140)) 및, 이와 관련된 명령에 대한 입력 데이터 또는 출력 데이터를 포함할 수 있다. 메모리(130)는, 휘발성 메모리(132) 또는 비휘발성 메모리(134)를 포함할 수 있다. The memory 130 may store various data used by at least one component (eg, the processor 120 or the sensor module 176) of the electronic device 101 . The data may include, for example, input data or output data for software (eg, program 140) and commands related thereto. The memory 130 may include volatile memory 132 or non-volatile memory 134 .

프로그램(140)은 메모리(130)에 소프트웨어로서 저장될 수 있으며, 예를 들면, 운영 체제(142), 미들 웨어(144) 또는 어플리케이션(146)을 포함할 수 있다. The program 140 may be stored as software in the memory 130 and may include, for example, an operating system 142 , middleware 144 , or an application 146 .

입력 모듈(150)은, 전자 장치(101)의 구성요소(예: 프로세서(120))에 사용될 명령 또는 데이터를 전자 장치(101)의 외부(예: 사용자)로부터 수신할 수 있다. 입력 모듈(150)은, 예를 들면, 마이크, 마우스, 키보드, 키(예: 버튼), 또는 디지털 펜(예: 스타일러스 펜)을 포함할 수 있다. The input module 150 may receive a command or data to be used for a component (eg, the processor 120) of the electronic device 101 from an outside of the electronic device 101 (eg, a user). The input module 150 may include, for example, a microphone, a mouse, a keyboard, a key (eg, a button), or a digital pen (eg, a stylus pen).

음향 출력 모듈(155)은 음향 신호를 전자 장치(101)의 외부로 출력할 수 있다. 음향 출력 모듈(155)은, 예를 들면, 스피커 또는 리시버를 포함할 수 있다. 스피커는 멀티미디어 재생 또는 녹음 재생과 같이 일반적인 용도로 사용될 수 있다. 리시버는 착신 전화를 수신하기 위해 사용될 수 있다. 일 실시예에 따르면, 리시버는 스피커와 별개로, 또는 그 일부로서 구현될 수 있다.The sound output module 155 may output sound signals to the outside of the electronic device 101 . The sound output module 155 may include, for example, a speaker or a receiver. The speaker can be used for general purposes such as multimedia playback or recording playback. A receiver may be used to receive an incoming call. According to one embodiment, the receiver may be implemented separately from the speaker or as part of it.

디스플레이 모듈(160)은 전자 장치(101)의 외부(예: 사용자)로 정보를 시각적으로 제공할 수 있다. 디스플레이 모듈(160)은, 예를 들면, 디스플레이, 홀로그램 장치, 또는 프로젝터 및 해당 장치를 제어하기 위한 제어 회로를 포함할 수 있다. 일 실시예에 따르면, 디스플레이 모듈(160)은 터치를 감지하도록 설정된 터치 센서, 또는 상기 터치에 의해 발생되는 힘의 세기를 측정하도록 설정된 압력 센서를 포함할 수 있다. The display module 160 can visually provide information to the outside of the electronic device 101 (eg, a user). The display module 160 may include, for example, a display, a hologram device, or a projector and a control circuit for controlling the device. According to an embodiment, the display module 160 may include a touch sensor configured to detect a touch or a pressure sensor configured to measure the intensity of force generated by the touch.

오디오 모듈(170)은 소리를 전기 신호로 변환시키거나, 반대로 전기 신호를 소리로 변환시킬 수 있다. 일 실시예에 따르면, 오디오 모듈(170)은, 입력 모듈(150)을 통해 소리를 획득하거나, 음향 출력 모듈(155), 또는 전자 장치(101)와 직접 또는 무선으로 연결된 외부 전자 장치(예: 전자 장치(102))(예: 스피커 또는 헤드폰)를 통해 소리를 출력할 수 있다.The audio module 170 may convert sound into an electrical signal or vice versa. According to one embodiment, the audio module 170 acquires sound through the input module 150, the sound output module 155, or an external electronic device connected directly or wirelessly to the electronic device 101 (eg: Sound may be output through the electronic device 102 (eg, a speaker or a headphone).

센서 모듈(176)은 전자 장치(101)의 작동 상태(예: 전력 또는 온도), 또는 외부의 환경 상태(예: 사용자 상태)를 감지하고, 감지된 상태에 대응하는 전기 신호 또는 데이터 값을 생성할 수 있다. 일 실시예에 따르면, 센서 모듈(176)은, 예를 들면, 제스처 센서, 자이로 센서, 기압 센서, 마그네틱 센서, 가속도 센서, 그립 센서, 근접 센서, 컬러 센서, IR(infrared) 센서, 생체 센서, 온도 센서, 습도 센서, 또는 조도 센서를 포함할 수 있다. The sensor module 176 detects an operating state (eg, power or temperature) of the electronic device 101 or an external environmental state (eg, a user state), and generates an electrical signal or data value corresponding to the detected state. can do. According to one embodiment, the sensor module 176 may include, for example, a gesture sensor, a gyro sensor, an air pressure sensor, a magnetic sensor, an acceleration sensor, a grip sensor, a proximity sensor, a color sensor, an IR (infrared) sensor, a bio sensor, It may include a temperature sensor, humidity sensor, or light sensor.

인터페이스(177)는 전자 장치(101)가 외부 전자 장치(예: 전자 장치(102))와 직접 또는 무선으로 연결되기 위해 사용될 수 있는 하나 이상의 지정된 프로토콜들을 지원할 수 있다. 일 실시예에 따르면, 인터페이스(177)는, 예를 들면, HDMI(high definition multimedia interface), USB(universal serial bus) 인터페이스, SD카드 인터페이스, 또는 오디오 인터페이스를 포함할 수 있다.The interface 177 may support one or more specified protocols that may be used to directly or wirelessly connect the electronic device 101 to an external electronic device (eg, the electronic device 102). According to one embodiment, the interface 177 may include, for example, a high definition multimedia interface (HDMI), a universal serial bus (USB) interface, an SD card interface, or an audio interface.

연결 단자(178)는, 그를 통해서 전자 장치(101)가 외부 전자 장치(예: 전자 장치(102))와 물리적으로 연결될 수 있는 커넥터를 포함할 수 있다. 일 실시예에 따르면, 연결 단자(178)는, 예를 들면, HDMI 커넥터, USB 커넥터, SD 카드 커넥터, 또는 오디오 커넥터(예: 헤드폰 커넥터)를 포함할 수 있다.The connection terminal 178 may include a connector through which the electronic device 101 may be physically connected to an external electronic device (eg, the electronic device 102). According to one embodiment, the connection terminal 178 may include, for example, an HDMI connector, a USB connector, an SD card connector, or an audio connector (eg, a headphone connector).

햅틱 모듈(179)은 전기적 신호를 사용자가 촉각 또는 운동 감각을 통해서 인지할 수 있는 기계적인 자극(예: 진동 또는 움직임) 또는 전기적인 자극으로 변환할 수 있다. 일 실시예에 따르면, 햅틱 모듈(179)은, 예를 들면, 모터, 압전 소자, 또는 전기 자극 장치를 포함할 수 있다.The haptic module 179 may convert electrical signals into mechanical stimuli (eg, vibration or movement) or electrical stimuli that a user may perceive through tactile or kinesthetic senses. According to one embodiment, the haptic module 179 may include, for example, a motor, a piezoelectric element, or an electrical stimulation device.

카메라 모듈(180)은 정지 영상 및 동영상을 촬영할 수 있다. 일 실시예에 따르면, 카메라 모듈(180)은 하나 이상의 렌즈들, 이미지 센서들, 이미지 시그널 프로세서들, 또는 플래시들을 포함할 수 있다.The camera module 180 may capture still images and moving images. According to one embodiment, the camera module 180 may include one or more lenses, image sensors, image signal processors, or flashes.

전력 관리 모듈(188)은 전자 장치(101)에 공급되는 전력을 관리할 수 있다. 일 실시예에 따르면, 전력 관리 모듈(188)은, 예를 들면, PMIC(power management integrated circuit)의 적어도 일부로서 구현될 수 있다.The power management module 188 may manage power supplied to the electronic device 101 . According to one embodiment, the power management module 188 may be implemented as at least part of a power management integrated circuit (PMIC), for example.

배터리(189)는 전자 장치(101)의 적어도 하나의 구성요소에 전력을 공급할 수 있다. 일 실시예에 따르면, 배터리(189)는, 예를 들면, 재충전 불가능한 1차 전지, 재충전 가능한 2차 전지 또는 연료 전지를 포함할 수 있다.The battery 189 may supply power to at least one component of the electronic device 101 . According to one embodiment, the battery 189 may include, for example, a non-rechargeable primary cell, a rechargeable secondary cell, or a fuel cell.

통신 모듈(190)은 전자 장치(101)와 외부 전자 장치(예: 전자 장치(102), 전자 장치(104), 또는 서버(108)) 간의 직접(예: 유선) 통신 채널 또는 무선 통신 채널의 수립, 및 수립된 통신 채널을 통한 통신 수행을 지원할 수 있다. 통신 모듈(190)은 프로세서(120)(예: 어플리케이션 프로세서)와 독립적으로 운영되고, 직접(예: 유선) 통신 또는 무선 통신을 지원하는 하나 이상의 커뮤니케이션 프로세서를 포함할 수 있다. 일 실시예에 따르면, 통신 모듈(190)은 무선 통신 모듈(192)(예: 셀룰러 통신 모듈, 근거리 무선 통신 모듈, 또는 GNSS(global navigation satellite system) 통신 모듈) 또는 유선 통신 모듈(194)(예: LAN(local area network) 통신 모듈, 또는 전력선 통신 모듈)을 포함할 수 있다. 이들 통신 모듈 중 해당하는 통신 모듈은 제1 네트워크(198)(예: 블루투스, WiFi(wireless fidelity) direct 또는 IrDA(infrared data association)와 같은 근거리 통신 네트워크) 또는 제2 네트워크(199)(예: 레거시 셀룰러 네트워크, 5G 네트워크, 차세대 통신 네트워크, 인터넷, 또는 컴퓨터 네트워크(예: LAN 또는 WAN)와 같은 원거리 통신 네트워크)를 통하여 외부의 전자 장치(104)와 통신할 수 있다. 이런 여러 종류의 통신 모듈들은 하나의 구성요소(예: 단일 칩)로 통합되거나, 또는 서로 별도의 복수의 구성요소들(예: 복수 칩들)로 구현될 수 있다. 무선 통신 모듈(192)은 가입자 식별 모듈(196)에 저장된 가입자 정보(예: 국제 모바일 가입자 식별자(IMSI))를 이용하여 제1 네트워크(198) 또는 제2 네트워크(199)와 같은 통신 네트워크 내에서 전자 장치(101)를 확인 또는 인증할 수 있다. The communication module 190 is a direct (eg, wired) communication channel or a wireless communication channel between the electronic device 101 and an external electronic device (eg, the electronic device 102, the electronic device 104, or the server 108). Establishment and communication through the established communication channel may be supported. The communication module 190 may include one or more communication processors that operate independently of the processor 120 (eg, an application processor) and support direct (eg, wired) communication or wireless communication. According to one embodiment, the communication module 190 may be a wireless communication module 192 (eg, a cellular communication module, a short-range wireless communication module, or a global navigation satellite system (GNSS) communication module) or a wired communication module 194 (eg, a : a local area network (LAN) communication module or a power line communication module). Among these communication modules, a corresponding communication module is a first network 198 (eg, a short-range communication network such as Bluetooth, wireless fidelity (WiFi) direct, or infrared data association (IrDA)) or a second network 199 (eg, a legacy communication module). It may communicate with the external electronic device 104 through a cellular network, a 5G network, a next-generation communication network, the Internet, or a telecommunications network such as a computer network (eg, a LAN or a WAN). These various types of communication modules may be integrated as one component (eg, a single chip) or implemented as a plurality of separate components (eg, multiple chips). The wireless communication module 192 uses subscriber information (eg, International Mobile Subscriber Identifier (IMSI)) stored in the subscriber identification module 196 within a communication network such as the first network 198 or the second network 199. The electronic device 101 may be identified or authenticated.

무선 통신 모듈(192)은 4G 네트워크 이후의 5G 네트워크 및 차세대 통신 기술, 예를 들어, NR 접속 기술(new radio access technology)을 지원할 수 있다. NR 접속 기술은 고용량 데이터의 고속 전송(eMBB(enhanced mobile broadband)), 단말 전력 최소화와 다수 단말의 접속(mMTC(massive machine type communications)), 또는 고신뢰도와 저지연(URLLC(ultra-reliable and low-latency communications))을 지원할 수 있다. 무선 통신 모듈(192)은, 예를 들어, 높은 데이터 전송률 달성을 위해, 고주파 대역(예: mmWave 대역)을 지원할 수 있다. 무선 통신 모듈(192)은 고주파 대역에서의 성능 확보를 위한 다양한 기술들, 예를 들어, 빔포밍(beamforming), 거대 배열 다중 입출력(massive MIMO(multiple-input and multiple-output)), 전차원 다중입출력(FD-MIMO: full dimensional MIMO), 어레이 안테나(array antenna), 아날로그 빔형성(analog beam-forming), 또는 대규모 안테나(large scale antenna)와 같은 기술들을 지원할 수 있다. 무선 통신 모듈(192)은 전자 장치(101), 외부 전자 장치(예: 전자 장치(104)) 또는 네트워크 시스템(예: 제2 네트워크(199))에 규정되는 다양한 요구사항을 지원할 수 있다. 일 실시예에 따르면, 무선 통신 모듈(192)은 eMBB 실현을 위한 Peak data rate(예: 20Gbps 이상), mMTC 실현을 위한 손실 Coverage(예: 164dB 이하), 또는 URLLC 실현을 위한 U-plane latency(예: 다운링크(DL) 및 업링크(UL) 각각 0.5ms 이하, 또는 라운드 트립 1ms 이하)를 지원할 수 있다.The wireless communication module 192 may support a 5G network after a 4G network and a next-generation communication technology, for example, NR access technology (new radio access technology). NR access technologies include high-speed transmission of high-capacity data (enhanced mobile broadband (eMBB)), minimization of terminal power and access of multiple terminals (massive machine type communications (mMTC)), or high reliability and low latency (ultra-reliable and low latency (URLLC)). -latency communications)) can be supported. The wireless communication module 192 may support a high frequency band (eg, mmWave band) to achieve a high data rate, for example. The wireless communication module 192 uses various technologies for securing performance in a high frequency band, such as beamforming, massive multiple-input and multiple-output (MIMO), and full-dimensional multiplexing. Technologies such as input/output (FD-MIMO: full dimensional MIMO), array antenna, analog beam-forming, or large scale antenna may be supported. The wireless communication module 192 may support various requirements defined for the electronic device 101, an external electronic device (eg, the electronic device 104), or a network system (eg, the second network 199). According to one embodiment, the wireless communication module 192 may be used to realize peak data rate (eg, 20 Gbps or more) for realizing eMBB, loss coverage (eg, 164 dB or less) for realizing mMTC, or U-plane latency (for realizing URLLC). Example: downlink (DL) and uplink (UL) each of 0.5 ms or less, or round trip 1 ms or less) may be supported.

안테나 모듈(197)은 신호 또는 전력을 외부(예: 외부의 전자 장치)로 송신하거나 외부로부터 수신할 수 있다. 일 실시예에 따르면, 안테나 모듈(197)은 서브스트레이트(예: PCB) 위에 형성된 도전체 또는 도전성 패턴으로 이루어진 방사체를 포함하는 안테나를 포함할 수 있다. 일 실시예에 따르면, 안테나 모듈(197)은 복수의 안테나들(예: 어레이 안테나)을 포함할 수 있다. 이런 경우, 제1 네트워크(198) 또는 제2 네트워크(199)와 같은 통신 네트워크에서 사용되는 통신 방식에 적합한 적어도 하나의 안테나가, 예를 들면, 통신 모듈(190)에 의하여 상기 복수의 안테나들로부터 선택될 수 있다. 신호 또는 전력은 상기 선택된 적어도 하나의 안테나를 통하여 통신 모듈(190)과 외부의 전자 장치 간에 송신되거나 수신될 수 있다. 어떤 실시예에 따르면, 방사체 이외에 다른 부품(예: RFIC(radio frequency integrated circuit))이 추가로 안테나 모듈(197)의 일부로 형성될 수 있다. The antenna module 197 may transmit or receive signals or power to the outside (eg, an external electronic device). According to an embodiment, the antenna module 197 may include an antenna including a radiator formed of a conductor or a conductive pattern formed on a substrate (eg, PCB). According to one embodiment, the antenna module 197 may include a plurality of antennas (eg, an array antenna). In this case, at least one antenna suitable for a communication method used in a communication network such as the first network 198 or the second network 199 is selected from the plurality of antennas by the communication module 190, for example. can be chosen A signal or power may be transmitted or received between the communication module 190 and an external electronic device through the selected at least one antenna. According to some embodiments, other components (eg, a radio frequency integrated circuit (RFIC)) may be additionally formed as a part of the antenna module 197 in addition to the radiator.

다양한 실시예에 따르면, 안테나 모듈(197)은 mmWave 안테나 모듈을 형성할 수 있다. 일 실시예에 따르면, mmWave 안테나 모듈은 인쇄 회로 기판, 상기 인쇄 회로 기판의 제1 면(예: 아래 면)에 또는 그에 인접하여 배치되고 지정된 고주파 대역(예: mmWave 대역)을 지원할 수 있는 RFIC, 및 상기 인쇄 회로 기판의 제2 면(예: 윗 면 또는 측 면)에 또는 그에 인접하여 배치되고 상기 지정된 고주파 대역의 신호를 송신 또는 수신할 수 있는 복수의 안테나들(예: 어레이 안테나)을 포함할 수 있다.According to various embodiments, the antenna module 197 may form a mmWave antenna module. According to one embodiment, the mmWave antenna module includes a printed circuit board, an RFIC disposed on or adjacent to a first surface (eg, a bottom surface) of the printed circuit board and capable of supporting a designated high frequency band (eg, mmWave band); and a plurality of antennas (eg, array antennas) disposed on or adjacent to a second surface (eg, a top surface or a side surface) of the printed circuit board and capable of transmitting or receiving signals of the designated high frequency band. can do.

상기 구성요소들 중 적어도 일부는 주변 기기들간 통신 방식(예: 버스, GPIO(general purpose input and output), SPI(serial peripheral interface), 또는 MIPI(mobile industry processor interface))을 통해 서로 연결되고 신호(예: 명령 또는 데이터)를 상호간에 교환할 수 있다.At least some of the components are connected to each other through a communication method between peripheral devices (eg, a bus, general purpose input and output (GPIO), serial peripheral interface (SPI), or mobile industry processor interface (MIPI)) and signal ( e.g. commands or data) can be exchanged with each other.

일 실시예에 따르면, 명령 또는 데이터는 제2 네트워크(199)에 연결된 서버(108)를 통해서 전자 장치(101)와 외부의 전자 장치(104)간에 송신 또는 수신될 수 있다. 외부의 전자 장치(102, 또는 104) 각각은 전자 장치(101)와 동일한 또는 다른 종류의 장치일 수 있다. 일 실시예에 따르면, 전자 장치(101)에서 실행되는 동작들의 전부 또는 일부는 외부의 전자 장치들(102, 104, 또는 108) 중 하나 이상의 외부의 전자 장치들에서 실행될 수 있다. 예를 들면, 전자 장치(101)가 어떤 기능이나 서비스를 자동으로, 또는 사용자 또는 다른 장치로부터의 요청에 반응하여 수행해야 할 경우에, 전자 장치(101)는 기능 또는 서비스를 자체적으로 실행시키는 대신에 또는 추가적으로, 하나 이상의 외부의 전자 장치들에게 그 기능 또는 그 서비스의 적어도 일부를 수행하라고 요청할 수 있다. 상기 요청을 수신한 하나 이상의 외부의 전자 장치들은 요청된 기능 또는 서비스의 적어도 일부, 또는 상기 요청과 관련된 추가 기능 또는 서비스를 실행하고, 그 실행의 결과를 전자 장치(101)로 전달할 수 있다. 전자 장치(101)는 상기 결과를, 그대로 또는 추가적으로 처리하여, 상기 요청에 대한 응답의 적어도 일부로서 제공할 수 있다. 이를 위하여, 예를 들면, 클라우드 컴퓨팅, 분산 컴퓨팅, 모바일 에지 컴퓨팅(MEC: mobile edge computing), 또는 클라이언트-서버 컴퓨팅 기술이 이용될 수 있다. 전자 장치(101)는, 예를 들어, 분산 컴퓨팅 또는 모바일 에지 컴퓨팅을 이용하여 초저지연 서비스를 제공할 수 있다. 다른 실시예에 있어서, 외부의 전자 장치(104)는 IoT(internet of things) 기기를 포함할 수 있다. 서버(108)는 기계 학습 및/또는 신경망을 이용한 지능형 서버일 수 있다. 일 실시예에 따르면, 외부의 전자 장치(104) 또는 서버(108)는 제2 네트워크(199) 내에 포함될 수 있다. 전자 장치(101)는 5G 통신 기술 및 IoT 관련 기술을 기반으로 지능형 서비스(예: 스마트 홈, 스마트 시티, 스마트 카, 또는 헬스 케어)에 적용될 수 있다.According to an embodiment, commands or data may be transmitted or received between the electronic device 101 and the external electronic device 104 through the server 108 connected to the second network 199 . Each of the external electronic devices 102 or 104 may be the same as or different from the electronic device 101 . According to an embodiment, all or part of operations executed in the electronic device 101 may be executed in one or more external electronic devices among the external electronic devices 102 , 104 , or 108 . For example, when the electronic device 101 needs to perform a certain function or service automatically or in response to a request from a user or another device, the electronic device 101 instead of executing the function or service by itself. Alternatively or additionally, one or more external electronic devices may be requested to perform the function or at least part of the service. One or more external electronic devices receiving the request may execute at least a part of the requested function or service or an additional function or service related to the request, and deliver the execution result to the electronic device 101 . The electronic device 101 may provide the result as at least part of a response to the request as it is or additionally processed. To this end, for example, cloud computing, distributed computing, mobile edge computing (MEC), or client-server computing technology may be used. The electronic device 101 may provide an ultra-low latency service using, for example, distributed computing or mobile edge computing. In another embodiment, the external electronic device 104 may include an internet of things (IoT) device. Server 108 may be an intelligent server using machine learning and/or neural networks. According to one embodiment, the external electronic device 104 or server 108 may be included in the second network 199 . The electronic device 101 may be applied to intelligent services (eg, smart home, smart city, smart car, or health care) based on 5G communication technology and IoT-related technology.

도 2은, 다양한 실시에 따른, 오디오 모듈(170)의 블록도(200)이다. 도 2를 참조하면, 오디오 모듈(170)은, 예를 들면, 오디오 입력 인터페이스(210), 오디오 입력 믹서(220), ADC(analog to digital converter)(230), 오디오 신호 처리기(240), DAC(digital to analog converter)(250), 오디오 출력 믹서(260), 및/또는 오디오 출력 인터페이스(270)를 포함할 수 있다. 2 is a block diagram 200 of an audio module 170, in accordance with various implementations. Referring to FIG. 2 , the audio module 170 includes, for example, an audio input interface 210, an audio input mixer 220, an analog to digital converter (ADC) 230, an audio signal processor 240, and a DAC. (digital to analog converter) 250, an audio output mixer 260, and/or an audio output interface 270 may be included.

오디오 입력 인터페이스(210)는 입력 모듈(150)의 일부로서 또는 전자 장치(101)와 별도로 구성된 마이크(예: 다이나믹 마이크, 콘덴서 마이크, 또는 피에조 마이크)를 통하여 전자 장치(101)의 외부로부터 획득한 소리에 대응하는 오디오 신호를 수신할 수 있다. 예를 들어, 오디오 신호가 외부의 전자 장치(102)(예: 헤드셋 또는 마이크)로부터 획득되는 경우, 오디오 입력 인터페이스(210)는 상기 외부의 전자 장치(102)와 연결 단자(예: 도 1의 연결 단자(178))를 통해 직접, 또는 무선 통신 모듈(예: 도 1의 무선 통신 모듈(192))을 통하여 무선으로(예: Bluetooth 통신) 연결되어 오디오 신호를 수신할 수 있다. 일실시예에 따르면, 오디오 입력 인터페이스(210)는 상기 외부의 전자 장치(102)로부터 획득(또는, 수신)되는 오디오 신호와 관련된 제어 신호(예: 입력 버튼을 통해 수신된 볼륨 조정 신호)를 수신할 수 있다. 오디오 입력 인터페이스(210)는 복수의 오디오 입력 채널들을 포함하고, 상기 복수의 오디오 입력 채널들 중 대응하는 오디오 입력 채널 별로 다른 오디오 신호를 수신할 수 있다. 일실시예에 따르면, 추가적으로 또는 대체적으로, 오디오 입력 인터페이스(210)는 전자 장치(101)의 다른 구성 요소(예: 도 1의 프로세서(120) 또는 메모리(130))로부터 오디오 신호를 입력 받을 수 있다.The audio input interface 210 is a part of the input module 150 or through a microphone configured separately from the electronic device 101 (eg, a dynamic microphone, a condenser microphone, or a piezo microphone), obtained from the outside of the electronic device 101. An audio signal corresponding to sound may be received. For example, when an audio signal is acquired from an external electronic device 102 (eg, a headset or a microphone), the audio input interface 210 connects the external electronic device 102 to a connection terminal (eg, as shown in FIG. 1 ). An audio signal may be received by being connected directly through the connection terminal 178 or wirelessly (eg, Bluetooth communication) through a wireless communication module (eg, the wireless communication module 192 of FIG. 1 ). According to one embodiment, the audio input interface 210 receives a control signal related to an audio signal obtained (or received) from the external electronic device 102 (eg, a volume control signal received through an input button). can do. The audio input interface 210 includes a plurality of audio input channels, and can receive different audio signals for each corresponding audio input channel among the plurality of audio input channels. According to an embodiment, additionally or alternatively, the audio input interface 210 may receive audio signals from other components (eg, the processor 120 or the memory 130 of FIG. 1 ) of the electronic device 101 . there is.

오디오 입력 믹서(220)는 입력된 복수의 오디오 신호들을 적어도 하나의 오디오 신호로 합성할 수 있다. 예를 들어, 일실시예에 따르면, 오디오 입력 믹서(220)는, 오디오 입력 인터페이스(210)를 통해 입력된 복수의 아날로그 오디오 신호들을 적어도 하나의 아날로그 오디오 신호로 합성할 수 있다.The audio input mixer 220 may synthesize a plurality of input audio signals into at least one audio signal. For example, according to one embodiment, the audio input mixer 220 may synthesize a plurality of analog audio signals input through the audio input interface 210 into at least one analog audio signal.

ADC(230)는 아날로그 오디오 신호를 디지털 오디오 신호로 변환할 수 있다. 예를 들어, 일실시예에 따르면, ADC(230)는 오디오 입력 인터페이스(210)을 통해 수신된 아날로그 오디오 신호, 또는 추가적으로 또는 대체적으로 오디오 입력 믹서(220)를 통해 합성된 아날로그 오디오 신호를 디지털 오디오 신호로 변환할 수 있다.The ADC 230 may convert an analog audio signal into a digital audio signal. For example, according to one embodiment, ADC 230 converts an analog audio signal received via audio input interface 210 or an analog audio signal synthesized via audio input mixer 220 additionally or alternatively to digital audio. can be converted into signals.

오디오 신호 처리기(240)는 ADC(230)를 통해 입력받은 디지털 오디오 신호, 또는 전자 장치(101)의 다른 구성 요소로부터 수신된 디지털 오디오 신호에 대하여 다양한 처리를 수행할 수 있다. 예를 들어, 일실시예에 따르면, 오디오 신호 처리기(240)는 하나 이상의 디지털 오디오 신호들에 대해 샘플링 비율 변경, 하나 이상의 필터 적용, 보간(interpolation) 처리, 전체 또는 일부 주파수 대역의 증폭 또는 감쇄, 노이즈 처리(예: 노이즈 또는 에코 감쇄), 채널 변경(예: 모노 및 스테레오간 전환), 합성(mixing), 또는 지정된 신호 추출을 수행할 수 있다. 일실시예에 따르면, 오디오 신호 처리기(240)의 하나 이상의 기능들은 이퀄라이저(equalizer)의 형태로 구현될 수 있다.The audio signal processor 240 may perform various processes on the digital audio signal received through the ADC 230 or the digital audio signal received from other components of the electronic device 101 . For example, according to one embodiment, the audio signal processor 240 changes the sampling rate of one or more digital audio signals, applies one or more filters, performs interpolation processing, amplifies or attenuates all or some frequency bands, It can perform noise processing (eg, noise or echo reduction), channel change (eg, switching between mono and stereo), mixing, or specified signal extraction. According to one embodiment, one or more functions of the audio signal processor 240 may be implemented in the form of an equalizer.

DAC(250)는 디지털 오디오 신호를 아날로그 오디오 신호로 변환할 수 있다. 예를 들어, 일실시예에 따르면, DAC(250)는 오디오 신호 처리기(240)에 의해 처리된 디지털 오디오 신호, 또는 전자 장치(101)의 다른 구성 요소(예: 프로세서(120) 또는 메모리(130))로부터 획득한 디지털 오디오 신호를 아날로그 오디오 신호로 변환할 수 있다.The DAC 250 may convert a digital audio signal into an analog audio signal. For example, according to one embodiment, the DAC 250 is a digital audio signal processed by the audio signal processor 240, or other components of the electronic device 101 (eg, the processor 120 or the memory 130). )) to convert the digital audio signal obtained from the analog audio signal.

오디오 출력 믹서(260)는 출력할 복수의 오디오 신호들을 적어도 하나의 오디오 신호로 합성할 수 있다. 예를 들어, 일실시예에 따르면, 오디오 출력 믹서(260)는 DAC(250)를 통해 아날로그로 전환된 오디오 신호 및 다른 아날로그 오디오 신호(예: 오디오 입력 인터페이스(210)을 통해 수신한 아날로그 오디오 신호)를 적어도 하나의 아날로그 오디오 신호로 합성할 수 있다. The audio output mixer 260 may synthesize a plurality of audio signals to be output into at least one audio signal. For example, according to one embodiment, the audio output mixer 260 includes an audio signal converted to analog through the DAC 250 and another analog audio signal (eg, an analog audio signal received through the audio input interface 210). ) into at least one analog audio signal.

오디오 출력 인터페이스(270)는 DAC(250)를 통해 변환된 아날로그 오디오 신호, 또는 추가적으로 또는 대체적으로 오디오 출력 믹서(260)에 의해 합성된 아날로그 오디오 신호를 음향 출력 모듈(155) 를 통해 전자 장치(101)의 외부로 출력할 수 있다. 음향 출력 모듈(155)는, 예를 들어, dynamic driver 또는 balanced armature driver 같은 스피커, 또는 리시버를 포함할 수 있다. 일실시예에 따르면, 음향 출력 모듈(155)는 복수의 스피커들을 포함할 수 있다. 이런 경우, 오디오 출력 인터페이스(270)는 상기 복수의 스피커들 중 적어도 일부 스피커들을 통하여 서로 다른 복수의 채널들(예: 스테레오, 또는 5.1채널)을 갖는 오디오 신호를 출력할 수 있다. 일실시예에 따르면, 오디오 출력 인터페이스(270)는 외부의 전자 장치(102)(예: 외부 스피커 또는 헤드셋)와 연결 단자(178)를 통해 직접, 또는 무선 통신 모듈(192)을 통하여 무선으로 연결되어 오디오 신호를 출력할 수 있다. The audio output interface 270 transmits the analog audio signal converted through the DAC 250 or the analog audio signal synthesized by the audio output mixer 260 additionally or alternatively to the electronic device 101 through the sound output module 155. ) can be output to the outside. The sound output module 155 may include, for example, a speaker or receiver such as a dynamic driver or a balanced armature driver. According to one embodiment, the sound output module 155 may include a plurality of speakers. In this case, the audio output interface 270 may output an audio signal having a plurality of different channels (eg, stereo or 5.1 channels) through at least some of the plurality of speakers. According to one embodiment, the audio output interface 270 is directly connected to the external electronic device 102 (eg, an external speaker or headset) through a connection terminal 178 or wirelessly through a wireless communication module 192. and output an audio signal.

일실시예에 따르면, 오디오 모듈(170)은 오디오 입력 믹서(220) 또는 오디오 출력 믹서(260)를 별도로 구비하지 않고, 오디오 신호 처리기(240)의 적어도 하나의 기능을 이용하여 복수의 디지털 오디오 신호들을 합성하여 적어도 하나의 디지털 오디오 신호를 생성할 수 있다.According to one embodiment, the audio module 170 does not separately include the audio input mixer 220 or the audio output mixer 260, and uses at least one function of the audio signal processor 240 to generate a plurality of digital audio signals. At least one digital audio signal may be generated by combining them.

일실시예에 따르면, 오디오 모듈(170)은 오디오 입력 인터페이스(210)를 통해 입력된 아날로그 오디오 신호, 또는 오디오 출력 인터페이스(270)를 통해 출력될 오디오 신호를 증폭할 수 있는 오디오 증폭기(미도시)(예: 스피커 증폭 회로)를 포함할 수 있다. 일실시예에 따르면, 상기 오디오 증폭기는 오디오 모듈(170)과 별도의 모듈로 구성될 수 있다.According to one embodiment, the audio module 170 is an audio amplifier (not shown) capable of amplifying an analog audio signal input through the audio input interface 210 or an audio signal to be output through the audio output interface 270. (e.g. speaker amplification circuit). According to one embodiment, the audio amplifier may be configured as a separate module from the audio module 170.

도 3는, 다양한 실시예들에 따른, 카메라 모듈(180)을 예시하는 블럭도(300)이다. 도 3를 참조하면, 카메라 모듈(180)은 렌즈 어셈블리(310), 플래쉬(320), 이미지 센서(330), 이미지 스태빌라이저(340), 메모리(350)(예: 버퍼 메모리), 또는 이미지 시그널 프로세서(360)를 포함할 수 있다. 렌즈 어셈블리(310)는 이미지 촬영의 대상인 피사체로부터 방출되는 빛을 수집할 수 있다. 렌즈 어셈블리(310)는 하나 또는 그 이상의 렌즈들을 포함할 수 있다. 일실시예에 따르면, 카메라 모듈(180)은 복수의 렌즈 어셈블리(310)들을 포함할 수 있다. 이런 경우, 카메라 모듈(180)은, 예를 들면, 듀얼 카메라, 360도 카메라, 또는 구형 카메라(spherical camera)를 형성할 수 있다. 복수의 렌즈 어셈블리(310)들 중 일부는 동일한 렌즈 속성(예: 화각, 초점 거리, 자동 초점, f 넘버(f number), 또는 광학 줌)을 갖거나, 또는 적어도 하나의 렌즈 어셈블리는 다른 렌즈 어셈블리의 렌즈 속성들과 다른 하나 이상의 렌즈 속성들을 가질 수 있다. 렌즈 어셈블리(310)는, 예를 들면, 광각 렌즈 또는 망원 렌즈를 포함할 수 있다. 3 is a block diagram 300 illustrating a camera module 180, in accordance with various embodiments. Referring to FIG. 3 , the camera module 180 includes a lens assembly 310, a flash 320, an image sensor 330, an image stabilizer 340, a memory 350 (eg, a buffer memory), or an image signal processor. (360). The lens assembly 310 may collect light emitted from a subject that is an image capturing target. Lens assembly 310 may include one or more lenses. According to one embodiment, the camera module 180 may include a plurality of lens assemblies 310 . In this case, the camera module 180 may form, for example, a dual camera, a 360-degree camera, or a spherical camera. Some of the plurality of lens assemblies 310 may have the same lens properties (eg, angle of view, focal length, auto focus, f number, or optical zoom), or at least one lens assembly may have the same lens properties as other lens assemblies. may have one or more lens properties different from the lens properties of . The lens assembly 310 may include, for example, a wide-angle lens or a telephoto lens.

플래쉬(320)는 피사체로부터 방출 또는 반사되는 빛을 강화하기 위하여 사용되는 빛을 방출할 수 있다. 일실시예에 따르면, 플래쉬(320)는 하나 이상의 발광 다이오드들(예: RGB(red-green-blue) LED, white LED, infrared LED, 또는 ultraviolet LED), 또는 xenon lamp를 포함할 수 있다. 이미지 센서(330)는 피사체로부터 방출 또는 반사되어 렌즈 어셈블리(310)를 통해 전달된 빛을 전기적인 신호로 변환함으로써, 상기 피사체에 대응하는 이미지를 획득할 수 있다. 일실시예에 따르면, 이미지 센서(330)는, 예를 들면, RGB 센서, BW(black and white) 센서, IR 센서, 또는 UV 센서와 같이 속성이 다른 이미지 센서들 중 선택된 하나의 이미지 센서, 동일한 속성을 갖는 복수의 이미지 센서들, 또는 다른 속성을 갖는 복수의 이미지 센서들을 포함할 수 있다. 이미지 센서(330)에 포함된 각각의 이미지 센서는, 예를 들면, CCD(charged coupled device) 센서 또는 CMOS(complementary metal oxide semiconductor) 센서를 이용하여 구현될 수 있다.The flash 320 may emit light used to enhance light emitted or reflected from a subject. According to one embodiment, the flash 320 may include one or more light emitting diodes (eg, a red-green-blue (RGB) LED, a white LED, an infrared LED, or an ultraviolet LED), or a xenon lamp. The image sensor 330 may obtain an image corresponding to the subject by converting light emitted or reflected from the subject and transmitted through the lens assembly 310 into an electrical signal. According to one embodiment, the image sensor 330 is, for example, an image sensor selected from among image sensors having different properties, such as an RGB sensor, a black and white (BW) sensor, an IR sensor, or a UV sensor, It may include a plurality of image sensors having attributes, or a plurality of image sensors having other attributes. Each image sensor included in the image sensor 330 may be implemented using, for example, a charged coupled device (CCD) sensor or a complementary metal oxide semiconductor (CMOS) sensor.

이미지 스태빌라이저(340)는 카메라 모듈(180) 또는 이를 포함하는 전자 장치(101)의 움직임에 반응하여, 렌즈 어셈블리(310)에 포함된 적어도 하나의 렌즈 또는 이미지 센서(330)를 특정한 방향으로 움직이거나 이미지 센서(330)의 동작 특성을 제어(예: 리드 아웃(read-out) 타이밍을 조정 등)할 수 있다. 이는 촬영되는 이미지에 대한 상기 움직임에 의한 부정적인 영향의 적어도 일부를 보상하게 해 준다. 일실시예에 따르면, 이미지 스태빌라이저(340)는 카메라 모듈(180)의 내부 또는 외부에 배치된 자이로 센서(미도시) 또는 가속도 센서(미도시)를 이용하여 카메라 모듈(180) 또는 전자 장치(101)의 움직임을 감지할 수 있다. 일실시예에 따르면, 이미지 스태빌라이저(340)는, 예를 들면, 광학식 이미지 스태빌라이저로 구현될 수 있다. 메모리(350)는 이미지 센서(330)을 통하여 획득된 이미지의 적어도 일부를 다음 이미지 처리 작업을 위하여 적어도 일시 저장할 수 있다. 예를 들어, 셔터에 따른 이미지 획득이 지연되거나, 또는 복수의 이미지들이 고속으로 획득되는 경우, 획득된 원본 이미지(예: Bayer-patterned 이미지 또는 높은 해상도의 이미지)는 메모리(350)에 저장이 되고, 그에 대응하는 사본 이미지(예: 낮은 해상도의 이미지)는 디스플레이 모듈(160)을 통하여 프리뷰될 수 있다. 이후, 지정된 조건이 만족되면(예: 사용자 입력 또는 시스템 명령) 메모리(350)에 저장되었던 원본 이미지의 적어도 일부가, 예를 들면, 이미지 시그널 프로세서(360)에 의해 획득되어 처리될 수 있다. 일실시예에 따르면, 메모리(350)는 메모리(130)의 적어도 일부로, 또는 이와는 독립적으로 운영되는 별도의 메모리로 구성될 수 있다.The image stabilizer 340 moves at least one lens or image sensor 330 included in the lens assembly 310 in a specific direction in response to movement of the camera module 180 or the electronic device 101 including the same. Operation characteristics of the image sensor 330 may be controlled (eg, read-out timing is adjusted, etc.). This makes it possible to compensate at least part of the negative effect of the movement on the image being taken. According to an embodiment, the image stabilizer 340 uses a gyro sensor (not shown) or an acceleration sensor (not shown) disposed inside or outside the camera module 180 to control the camera module 180 or the electronic device 101 . ) can be detected. According to one embodiment, the image stabilizer 340 may be implemented as, for example, an optical image stabilizer. The memory 350 may at least temporarily store at least a portion of an image acquired through the image sensor 330 for a next image processing task. For example, when image acquisition is delayed according to the shutter, or a plurality of images are acquired at high speed, the acquired original image (eg, a Bayer-patterned image or a high-resolution image) is stored in the memory 350 and , a copy image (eg, a low resolution image) corresponding thereto may be previewed through the display module 160 . Thereafter, when a specified condition is satisfied (eg, a user input or a system command), at least a part of the original image stored in the memory 350 may be acquired and processed by, for example, the image signal processor 360 . According to one embodiment, the memory 350 may be configured as at least a part of the memory 130 or as a separate memory operated independently of the memory 130 .

이미지 시그널 프로세서(360)는 이미지 센서(330)을 통하여 획득된 이미지 또는 메모리(350)에 저장된 이미지에 대하여 하나 이상의 이미지 처리들을 수행할 수 있다. 상기 하나 이상의 이미지 처리들은, 예를 들면, 깊이 지도(depth map) 생성, 3차원 모델링, 파노라마 생성, 특징점 추출, 이미지 합성, 또는 이미지 보상(예: 노이즈 감소, 해상도 조정, 밝기 조정, 블러링(blurring), 샤프닝(sharpening), 또는 소프트닝(softening))을 포함할 수 있다. 추가적으로 또는 대체적으로, 이미지 시그널 프로세서(360)는 카메라 모듈(180)에 포함된 구성 요소들 중 적어도 하나(예: 이미지 센서(330))에 대한 제어(예: 노출 시간 제어, 또는 리드 아웃 타이밍 제어 등)를 수행할 수 있다. 이미지 시그널 프로세서(360)에 의해 처리된 이미지는 추가 처리를 위하여 메모리(350)에 다시 저장되거나 카메라 모듈(180)의 외부 구성 요소(예: 메모리(130), 디스플레이 모듈(160), 전자 장치(102), 전자 장치(104), 또는 서버(108))로 제공될 수 있다. 일실시예에 따르면, 이미지 시그널 프로세서(360)는 프로세서(120)의 적어도 일부로 구성되거나, 프로세서(120)와 독립적으로 운영되는 별도의 프로세서로 구성될 수 있다. 이미지 시그널 프로세서(360)이 프로세서(120)과 별도의 프로세서로 구성된 경우, 이미지 시그널 프로세서(360)에 의해 처리된 적어도 하나의 이미지는 프로세서(120)에 의하여 그대로 또는 추가의 이미지 처리를 거친 후 디스플레이 모듈(160)를 통해 표시될 수 있다.The image signal processor 360 may perform one or more image processes on an image acquired through the image sensor 330 or an image stored in the memory 350 . The one or more image processes, for example, depth map generation, 3D modeling, panorama generation, feature point extraction, image synthesis, or image compensation (eg, noise reduction, resolution adjustment, brightness adjustment, blurring ( blurring, sharpening, or softening). Additionally or alternatively, the image signal processor 360 controls (eg, exposure time control or read-out timing control) for at least one of the components included in the camera module 180 (eg, the image sensor 330). etc.) can be performed. Images processed by the image signal processor 360 are stored again in the memory 350 for further processing or external components of the camera module 180 (eg, memory 130, display module 160, electronic device ( 102), the electronic device 104, or the server 108). According to one embodiment, the image signal processor 360 may be configured as at least a part of the processor 120 or as a separate processor operated independently of the processor 120 . When the image signal processor 360 is configured as a separate processor from the processor 120, at least one image processed by the image signal processor 360 is displayed by the processor 120 as it is or after additional image processing. It can be displayed via module 160 .

일실시예에 따르면, 전자 장치(101)는 각각 다른 속성 또는 기능을 가진 복수의 카메라 모듈(180)들을 포함할 수 있다. 이런 경우, 예를 들면, 상기 복수의 카메라 모듈(180)들 중 적어도 하나는 광각 카메라이고, 적어도 다른 하나는 망원 카메라일 수 있다. 유사하게, 상기 복수의 카메라 모듈(180)들 중 적어도 하나는 전면 카메라이고, 적어도 다른 하나는 후면 카메라일 수 있다.According to an embodiment, the electronic device 101 may include a plurality of camera modules 180 each having different properties or functions. In this case, for example, at least one of the plurality of camera modules 180 may be a wide-angle camera, and at least the other may be a telephoto camera. Similarly, at least one of the plurality of camera modules 180 may be a front camera, and at least another one may be a rear camera.

본 문서에 개시된 다양한 실시예들에 따른 전자 장치는 다양한 형태의 장치가 될 수 있다. 전자 장치는, 예를 들면, 휴대용 통신 장치(예: 스마트폰), 컴퓨터 장치, 휴대용 멀티미디어 장치, 휴대용 의료 기기, 카메라, 웨어러블 장치, 또는 가전 장치를 포함할 수 있다. 본 문서의 실시예에 따른 전자 장치는 전술한 기기들에 한정되지 않는다.Electronic devices according to various embodiments disclosed in this document may be devices of various types. The electronic device may include, for example, a portable communication device (eg, a smart phone), a computer device, a portable multimedia device, a portable medical device, a camera, a wearable device, or a home appliance. An electronic device according to an embodiment of the present document is not limited to the aforementioned devices.

본 문서의 다양한 실시예들 및 이에 사용된 용어들은 본 문서에 기재된 기술적 특징들을 특정한 실시예들로 한정하려는 것이 아니며, 해당 실시예의 다양한 변경, 균등물, 또는 대체물을 포함하는 것으로 이해되어야 한다. 도면의 설명과 관련하여, 유사한 또는 관련된 구성요소에 대해서는 유사한 참조 부호가 사용될 수 있다. 아이템에 대응하는 명사의 단수 형은 관련된 문맥상 명백하게 다르게 지시하지 않는 한, 상기 아이템 한 개 또는 복수 개를 포함할 수 있다. 본 문서에서, "A 또는 B", "A 및 B 중 적어도 하나", "A 또는 B 중 적어도 하나", "A, B 또는 C", "A, B 및 C 중 적어도 하나", 및 "A, B, 또는 C 중 적어도 하나"와 같은 문구들 각각은 그 문구들 중 해당하는 문구에 함께 나열된 항목들 중 어느 하나, 또는 그들의 모든 가능한 조합을 포함할 수 있다. "제1", "제2", 또는 "첫째" 또는 "둘째"와 같은 용어들은 단순히 해당 구성요소를 다른 해당 구성요소와 구분하기 위해 사용될 수 있으며, 해당 구성요소들을 다른 측면(예: 중요성 또는 순서)에서 한정하지 않는다. 어떤(예: 제1) 구성요소가 다른(예: 제2) 구성요소에, "기능적으로" 또는 "통신적으로"라는 용어와 함께 또는 이런 용어 없이, "커플드" 또는 "커넥티드"라고 언급된 경우, 그것은 상기 어떤 구성요소가 상기 다른 구성요소에 직접적으로(예: 유선으로), 무선으로, 또는 제3 구성요소를 통하여 연결될 수 있다는 것을 의미한다.Various embodiments of this document and terms used therein are not intended to limit the technical features described in this document to specific embodiments, but should be understood to include various modifications, equivalents, or substitutes of the embodiments. In connection with the description of the drawings, like reference numerals may be used for like or related elements. The singular form of a noun corresponding to an item may include one item or a plurality of items, unless the relevant context clearly dictates otherwise. In this document, "A or B", "at least one of A and B", "at least one of A or B", "A, B or C", "at least one of A, B and C", and "A Each of the phrases such as "at least one of , B, or C" may include any one of the items listed together in that phrase, or all possible combinations thereof. Terms such as "first", "second", or "first" or "secondary" may simply be used to distinguish that component from other corresponding components, and may refer to that component in other respects (eg, importance or order) is not limited. A (eg, first) component is said to be "coupled" or "connected" to another (eg, second) component, with or without the terms "functionally" or "communicatively." When mentioned, it means that the certain component may be connected to the other component directly (eg by wire), wirelessly, or through a third component.

본 문서의 다양한 실시예들에서 사용된 용어 "모듈"은 하드웨어, 소프트웨어 또는 펌웨어로 구현된 유닛을 포함할 수 있으며, 예를 들면, 로직, 논리 블록, 부품, 또는 회로와 같은 용어와 상호 호환적으로 사용될 수 있다. 모듈은, 일체로 구성된 부품 또는 하나 또는 그 이상의 기능을 수행하는, 상기 부품의 최소 단위 또는 그 일부가 될 수 있다. 일 실시예에 따르면, 모듈은 ASIC(application-specific integrated circuit)의 형태로 구현될 수 있다. The term "module" used in various embodiments of this document may include a unit implemented in hardware, software, or firmware, and is interchangeably interchangeable with terms such as, for example, logic, logical blocks, parts, or circuits. can be used as A module may be an integral part or the smallest unit of a part or part thereof that performs one or more functions. According to one embodiment, the module may be implemented in the form of an application-specific integrated circuit (ASIC).

본 문서의 다양한 실시예들은 기기(machine)(예: 전자 장치(101)) 의해 읽을 수 있는 저장 매체(storage medium)(예: 내장 메모리(136) 또는 외장 메모리(138))에 저장된 하나 이상의 명령어들을 포함하는 소프트웨어(예: 프로그램(140))로서 구현될 수 있다. 예를 들면, 기기(예: 전자 장치(101))의 프로세서(예: 프로세서(120))는, 저장 매체로부터 저장된 하나 이상의 명령어들 중 적어도 하나의 명령을 호출하고, 그것을 실행할 수 있다. 이것은 기기가 상기 호출된 적어도 하나의 명령어에 따라 적어도 하나의 기능을 수행하도록 운영되는 것을 가능하게 한다. 상기 하나 이상의 명령어들은 컴파일러에 의해 생성된 코드 또는 인터프리터에 의해 실행될 수 있는 코드를 포함할 수 있다. 기기로 읽을 수 있는 저장 매체는, 비일시적(non-transitory) 저장 매체의 형태로 제공될 수 있다. 여기서, ‘비일시적’은 저장 매체가 실재(tangible)하는 장치이고, 신호(signal)(예: 전자기파)를 포함하지 않는다는 것을 의미할 뿐이며, 이 용어는 데이터가 저장 매체에 반영구적으로 저장되는 경우와 임시적으로 저장되는 경우를 구분하지 않는다.Various embodiments of this document provide one or more instructions stored in a storage medium (eg, internal memory 136 or external memory 138) readable by a machine (eg, electronic device 101). It may be implemented as software (eg, the program 140) including them. For example, a processor (eg, the processor 120 ) of a device (eg, the electronic device 101 ) may call at least one command among one or more instructions stored from a storage medium and execute it. This enables the device to be operated to perform at least one function according to the at least one command invoked. The one or more instructions may include code generated by a compiler or code executable by an interpreter. The device-readable storage medium may be provided in the form of a non-transitory storage medium. Here, 'non-temporary' only means that the storage medium is a tangible device and does not contain a signal (e.g. electromagnetic wave), and this term refers to the case where data is stored semi-permanently in the storage medium. It does not discriminate when it is temporarily stored.

일 실시예에 따르면, 본 문서에 개시된 다양한 실시예들에 따른 방법은 컴퓨터 프로그램 제품(computer program product)에 포함되어 제공될 수 있다. 컴퓨터 프로그램 제품은 상품으로서 판매자 및 구매자 간에 거래될 수 있다. 컴퓨터 프로그램 제품은 기기로 읽을 수 있는 저장 매체(예: compact disc read only memory(CD-ROM))의 형태로 배포되거나, 또는 어플리케이션 스토어(예: 플레이 스토어^TM)를 통해 또는 두 개의 사용자 장치들(예: 스마트 폰들) 간에 직접, 온라인으로 배포(예: 다운로드 또는 업로드)될 수 있다. 온라인 배포의 경우에, 컴퓨터 프로그램 제품의 적어도 일부는 제조사의 서버, 어플리케이션 스토어의 서버, 또는 중계 서버의 메모리와 같은 기기로 읽을 수 있는 저장 매체에 적어도 일시 저장되거나, 임시적으로 생성될 수 있다.According to one embodiment, the method according to various embodiments disclosed in this document may be provided by being included in a computer program product. Computer program products may be traded between sellers and buyers as commodities. A computer program product is distributed in the form of a device-readable storage medium (eg compact disc read only memory (CD-ROM)), or through an application store (eg Play Store ^TM ) or on two user devices ( It can be distributed (eg downloaded or uploaded) online, directly between smart phones. In the case of online distribution, at least part of the computer program product may be temporarily stored or temporarily created in a device-readable storage medium such as a manufacturer's server, an application store server, or a relay server's memory.

다양한 실시예들에 따르면, 상기 기술한 구성요소들의 각각의 구성요소(예: 모듈 또는 프로그램)는 단수 또는 복수의 개체를 포함할 수 있으며, 복수의 개체 중 일부는 다른 구성요소에 분리 배치될 수도 있다. 다양한 실시예들에 따르면, 전술한 해당 구성요소들 중 하나 이상의 구성요소들 또는 동작들이 생략되거나, 또는 하나 이상의 다른 구성요소들 또는 동작들이 추가될 수 있다. 대체적으로 또는 추가적으로, 복수의 구성요소들(예: 모듈 또는 프로그램)은 하나의 구성요소로 통합될 수 있다. 이런 경우, 통합된 구성요소는 상기 복수의 구성요소들 각각의 구성요소의 하나 이상의 기능들을 상기 통합 이전에 상기 복수의 구성요소들 중 해당 구성요소에 의해 수행되는 것과 동일 또는 유사하게 수행할 수 있다. 다양한 실시예들에 따르면, 모듈, 프로그램 또는 다른 구성요소에 의해 수행되는 동작들은 순차적으로, 병렬적으로, 반복적으로, 또는 휴리스틱하게 실행되거나, 상기 동작들 중 하나 이상이 다른 순서로 실행되거나, 생략되거나, 또는 하나 이상의 다른 동작들이 추가될 수 있다.According to various embodiments, each component (eg, module or program) of the above-described components may include a single object or a plurality of entities, and some of the plurality of entities may be separately disposed in other components. there is. According to various embodiments, one or more components or operations among the aforementioned corresponding components may be omitted, or one or more other components or operations may be added. Alternatively or additionally, a plurality of components (eg modules or programs) may be integrated into a single component. In this case, the integrated component may perform one or more functions of each of the plurality of components identically or similarly to those performed by a corresponding component among the plurality of components prior to the integration. . According to various embodiments, the actions performed by a module, program, or other component are executed sequentially, in parallel, iteratively, or heuristically, or one or more of the actions are executed in a different order, or omitted. or one or more other actions may be added.

다양한 하우징 구조가 휴대 전자 장치(예: 도 1의 전자 장치(101))에 적용될 수 있다. 예를 들어, 휴대 전자 장치는 이른 바 바(bar) 타입의 하우징 구조, 폴더블(folable) 하우징 구조, 또는 슬라이더블(slidable)(또는, 롤러블(rollable)) 하우징 구조를 가질 수 있다. 일 실시예에서, 바 타입의 하우징 구조는 휴대 전자 장치는 휴대 전자 장치의 전면을 형성하는 제1 커버, 휴대 전자 장치의 후면을 형성하는 제2 커버, 및 휴대 전자 장치의 측면을 형성하는 측면 베젤 구조를 포함할 수 있다. 일 실시예에서, 폴더블 하우징 구조는 제1 하우징, 제2 하우징, 및 두 하우징들이 회동(rotatable) 가능하도록 하는 힌지 조립체를 포함할 수 있다. 폴더블 하우징 구조에서 제1 하우징에 디스플레이(예: 플랙서블(flexible) 디스플레이)의 제1 부분이 배치되고 제2 하우징에 디스플레이의 제2 부분이 배치될 수 있다. 폴더블 하우징 구조는 휴대 전자 장치가 접힐 때 제1부분과 제2부분이 서로 마주하는 인 폴딩(in folding) 방식으로 구현될 수 있다. 또는, 폴더블 하우징 구조는 휴대 전자 장치가 접힐 때 제1부분과 제2부분이 서로 반대로 향하는 아웃 폴딩(out folding) 방식으로 구현될 수도 있다. 일 실시예에서, 슬라이더블 하우징 구조는 하우징, 슬라이더부, 및 슬라이더부의 일부가 하우징 내부로 인입되거나 하우징으로부터 인출되도록 하는 롤러를 포함할 수 있다. 슬라이더가 하우징 내부로 인입됨에 따라 디스플레이(예: 플랙서블 디스플레이)의 일부가 하우징 내부로 들어갈 수 있다. Various housing structures may be applied to a portable electronic device (eg, the electronic device 101 of FIG. 1 ). For example, the portable electronic device may have a so-called bar-type housing structure, a foldable housing structure, or a slidable (or rollable) housing structure. In one embodiment, the bar-type housing structure includes a first cover forming a front surface of the portable electronic device, a second cover forming a rear surface of the portable electronic device, and a side bezel forming a side surface of the portable electronic device. structure may be included. In one embodiment, the foldable housing structure may include a first housing, a second housing, and a hinge assembly that makes the two housings rotatable. In the foldable housing structure, a first part of a display (eg, a flexible display) may be disposed in a first housing, and a second part of the display may be disposed in a second housing. The foldable housing structure may be implemented in an in-folding method in which a first part and a second part face each other when the portable electronic device is folded. Alternatively, the foldable housing structure may be implemented in an out-folding manner in which the first part and the second part face each other in opposite directions when the portable electronic device is folded. In one embodiment, the slideable housing structure may include a housing, a slider portion, and a roller through which a portion of the slider portion is drawn into or drawn out of the housing. As the slider is drawn into the housing, a portion of the display (eg, the flexible display) may enter the housing.

이하에서, 설명의 편의 상, 디스플레이(예: 플랙서블 디스플레이)가 사용자에게 시각적으로 노출되는 면을 휴대 전자 장치의 전면(또는, 제1 면)으로 지칭될 수 있다. 그리고, 전면의 반대 면을 휴대 전자 장치의 후면(또는, 제2 면)으로 지칭될 수 있다. 또한, 전면과 후면 사이의 공간을 둘러싸는 면을 휴대 전자 장치의 측면으로 지칭될 수 있다. 일 실시예에서, 휴대 전자 장치는 후면을 통해 시각적으로 노출되는 별도의 서브 디스플레이를 더 포함할 수도 있다.Hereinafter, for convenience of description, a surface on which a display (eg, a flexible display) is visually exposed to a user may be referred to as a front surface (or first surface) of the portable electronic device. Also, a surface opposite to the front surface may be referred to as a rear surface (or a second surface) of the portable electronic device. Also, a surface surrounding a space between the front and rear surfaces may be referred to as a side surface of the portable electronic device. In one embodiment, the portable electronic device may further include a separate sub-display visually exposed through the rear surface.

도 4a는 바(bar) 타입의 하우징 구조를 갖는 휴대 전자 장치(400) 전면의 사시도이다. 도 4b는 도 4a의 전자 장치(400) 후면의 사시도이다.4A is a perspective view of the front of a portable electronic device 400 having a bar-type housing structure. 4B is a perspective view of the back of the electronic device 400 of FIG. 4A.

도 4a 및 도 4b를 참조하면, 휴대 전자 장치(400)(예: 도 1의 전자 장치(101))의 하우징(410)은, 제1 면(또는 전면)(410A), 제2 면(또는 후면)(410B), 및 제1 면(410A)과 제2 면(410B) 사이의 공간을 둘러싸는 측면(410C)을 포함할 수 있다. 일 실시예에 따르면, 제1 면(410A)은 적어도 일부분이 실질적으로 투명한 전면 플레이트(예: 다양한 코팅 레이어들을 포함하는 글라스 플레이트, 또는 폴리머 플레이트)로 구성될 수 있다. 제2 면(410B)은 실질적으로 불투명한 후면 플레이트로 구성될 수 있다. 측면(410C)은, 전면 플레이트 및 후면 플레이트와 결합하며, 금속 및/또는 폴리머를 포함하는 측면 베젤 구조(또는 "측면 부재")로 구성될 수 있다. Referring to FIGS. 4A and 4B , a housing 410 of a portable electronic device 400 (eg, the electronic device 101 of FIG. 1 ) has a first surface (or front surface) 410A, and a second surface (or electronic device 101). It may include a rear surface) 410B, and a side surface 410C surrounding a space between the first surface 410A and the second surface 410B. According to one embodiment, the first surface 410A may be composed of a front plate (eg, a glass plate or a polymer plate including various coating layers) that is substantially transparent at least in part. The second face 410B may be composed of a substantially opaque back plate. The side surface 410C is combined with the front plate and the back plate and may be composed of a side bezel structure (or “side member”) including metal and/or polymer.

일 실시예에 따르면, 휴대 전자 장치(400)는, 디스플레이(401), 마이크 홀(403), 스피커 홀(407, 414), 센서 모듈(404, 419), 카메라 모듈(405, 412, 413), 키 입력 장치(417), 및 커넥터(408) 중 적어도 하나 이상을 포함할 수 있다. 어떤 실시예에서는, 휴대 전자 장치(400)는, 구성 요소들 중 적어도 하나(예: 키 입력 장치(417))를 생략하거나 다른 구성 요소를 추가적으로 포함할 수 있다.According to an embodiment, the portable electronic device 400 includes a display 401, a microphone hole 403, speaker holes 407 and 414, sensor modules 404 and 419, and camera modules 405, 412 and 413. , a key input device 417 , and a connector 408 . In some embodiments, the portable electronic device 400 may omit at least one of the components (eg, the key input device 417) or may additionally include other components.

디스플레이(401)(예: 도 1의 디스플레이 모듈(160))는 제1 면(410A)을 통해 노출될 수 있다. 디스플레이(401)는, 터치 감지 회로, 터치의 세기(압력)를 측정할 수 있는 압력 센서, 및/또는 자기장 방식의 스타일러스 펜을 검출하는 디지타이저와 결합되거나 인접하여 배치될 수 있다. The display 401 (eg, the display module 160 of FIG. 1 ) may be exposed through the first surface 410A. The display 401 may be combined with or disposed adjacent to a touch sensing circuit, a pressure sensor capable of measuring the intensity (pressure) of a touch, and/or a digitizer that detects a magnetic field type stylus pen.

센서 모듈(404, 419)(예: 도 1의 센서 모듈(176))은, 휴대 전자 장치(400)의 내부의 작동 상태, 또는 외부의 환경 상태에 반응하여 전기 신호 또는 데이터를 생성할 수 있다. 일 실시예에서, 센서 모듈(404, 419)은 제1 면(410A)에 배치된 제1 센서 모듈(404)(예: 근접 센서 및/또는 지문 센서) 및/또는 제2 면(410B)에 배치된 제2 센서 모듈(419)(예: HRM 센서)을 포함할 수 있다. 제1센서 모듈(404)은, 제1 면(410A) 위에서 디스플레이(401)를 바라볼 때, 디스플레이(401) 아래에 배치될 수 있다. The sensor modules 404 and 419 (eg, the sensor module 176 of FIG. 1 ) may generate electrical signals or data in response to an internal operating state of the portable electronic device 400 or an external environmental state. . In one embodiment, the sensor modules 404 and 419 are coupled to a first sensor module 404 (eg, a proximity sensor and/or a fingerprint sensor) disposed on the first side 410A and/or on the second side 410B. A disposed second sensor module 419 (eg, an HRM sensor) may be included. The first sensor module 404 may be disposed under the display 401 when viewing the display 401 from the first surface 410A.

카메라 모듈(405, 412, 413)(예: 도 1의 카메라 모듈(180))은 제1 면(410A)에 배치된 전면 카메라(405), 제2 면(410B)에 배치된 하나 이상의 후면 카메라(412) 및 플래시(413)를 포함할 수 있다. 카메라들(405, 412)은, 렌즈 어셈블리(하나 또는 복수의 렌즈들을 포함), 이미지 센서, 및/또는 이미지 시그널 프로세서를 포함할 수 있다. 플래시(413)는, 예를 들어, 발광 다이오드 또는 제논 램프(xenon lamp)를 포함할 수 있다. 어떤 실시예에서는, 2개 이상의 렌즈들(예: 광각 렌즈, 초광각 렌즈 또는 망원 렌즈) 및 이미지 센서들이 전자 장치(400)의 한 면에 배치될 수 있다. 일 실시예에서, 전면 카메라(405)는, 제1 면(410A) 위에서 디스플레이(401)를 바라볼 때, 디스플레이(401) 아래에 배치됨에 따라 디스플레이(401)를 통해 빛을 받아들이는UDC(under display camera)일 수 있다. 디스플레이(401)에 있어서 전면 카메라(405)와 마주하는 부위는 홀(예: punch hole)(또는, 오프닝(opening))이 형성될 수 있다. 예를 들어, 디스플레이(401)는 여러 개의 층들(예: 편광 필름, 디스플레이 패널, 부자재층(예: 디스플레이 패널에서 생성된 빛 또는 외부로부터 디스플레이 패널로 입사되는 빛을 차단하기 위한 차광층, 방열 시트, 스폰지))로 이루어질 수 있는데, 디스플레이(401)에 있어서 적어도 하나의 층(예: 디스플레이 패널)을 제외한 나머지 층에 관통 홀이 형성될 수 있다. 다른 예로, 모든 층들에 관통 홀(예: punch hole)이 형성될 수도 있다. 전면 카메라(405)의 적어도 일부(예: 렌즈)는 디스플레이(401)에 천공된 홀의 내부 공간 상에 배치될 수도 있다. 도시하지는 않지만, 어떠한 실시예에서는 복수의 전면 카메라들이 디스플레이(401) 아래에 배치될 수도 있다. 전자 장치(예: 도 1의 전자 장치(101))는 후면 디스플레이를 포함할 수도 있다. 이에 따라, 후면 디스플레이 아래에도 후면 디스플레이를 통해 빛을 받아들이는 UDC가 추가 배치될 수 있다. 후면 카메라(412)는 후면에 배열된(예: 도시된 바와 같이 일직선으로 나란히 배열된) 복수의 렌즈들을 갖는 어레이(array) 카메라를 포함할 수 있다.The camera modules 405, 412, and 413 (eg, the camera module 180 of FIG. 1) include a front camera 405 disposed on the first surface 410A and one or more rear cameras disposed on the second surface 410B. (412) and flash (413). The cameras 405 and 412 may include a lens assembly (including one or a plurality of lenses), an image sensor, and/or an image signal processor. The flash 413 may include, for example, a light emitting diode or a xenon lamp. In some embodiments, two or more lenses (eg, a wide-angle lens, an ultra-wide-angle lens, or a telephoto lens) and image sensors may be disposed on one side of the electronic device 400 . In one embodiment, the front camera 405, when viewing the display 401 from above the first side 410A, is positioned under the display 401 to receive light through the display 401 (UDC). display camera). A hole (eg, a punch hole) (or opening) may be formed in a portion of the display 401 facing the front camera 405 . For example, the display 401 includes several layers (e.g., a polarizing film, a display panel, a subsidiary material layer (e.g., a light blocking layer for blocking light generated from the display panel or light incident on the display panel from the outside), and a heat dissipation sheet. , Sponge)), and in the display 401, through-holes may be formed in the remaining layers except for at least one layer (eg, the display panel). As another example, through holes (eg, punch holes) may be formed in all layers. At least a part (eg, lens) of the front camera 405 may be disposed on an inner space of a hole drilled in the display 401 . Although not shown, in some embodiments, a plurality of front cameras may be disposed below the display 401 . An electronic device (eg, the electronic device 101 of FIG. 1 ) may include a rear display. Accordingly, a UDC that receives light through the rear display may be additionally disposed under the rear display. The rear camera 412 may include an array camera having a plurality of lenses arranged on the rear surface (eg, arranged side by side in a straight line as shown).

도시하지는 않지만, 폴더블 하우징 구조 또는 슬라이더블 하우징 구조를 갖는 휴대 전자 장치에 전면 카메라 및/또는 후면 카메라가 구성될 수도 있다. Although not shown, a front camera and/or a rear camera may be configured in a portable electronic device having a foldable housing structure or a slideable housing structure.

도 5는, 일 실시예에 따른, AV 싱크 일치된 합성 AV 파일을 생성하도록 구성된 전자 장치(500)의 블록도이다. 도 6은 도 5의 기록 모듈(503)이 타임스탬프(timestamp)가 표기되어 있는 원본 AV 파일들(630, 640)을 생성하는 동작을 예시한다. 본 문서의 다양한 실시예에서 타임스탬프는 해당 원본 AV 파일에서 첫번째 오디오 프레임이 생성된 시점을 나타내는 오디오 타임스탬프와 첫번째 비디오 프레임이 생성된 시점을 나타내는 비디오 타임스탬프를 포함할 수 있다. 본 문서의 다양한 실시예에서 첫번째 프레임이란, 해당 원본 AV 파일에서 시간 상 가장 먼저 생성된 프레임으로 정의될 수 있다. 예컨대, 첫번째 오디오 프레임은 해당 레코더가 녹음을 시작하여 처음으로 생성한 오디오 프레임일 수 있다. 첫번째 비디오 프레임은 해당 레코더가 녹화를 시작하여 처음으로 생성한 비디오 프레임일 수 있다. 도 7 및 도 8은 도 5의 비동기화 구간 결정 모듈(506)이 타임스탬프를 이용하여 원본 AV 파일들(630, 640)에 있어서 합성 시 오디오와 비디오의 동기화에 악영향을 줄 수 있는 비동기화 프레임 구간을 결정하는 동작을 예시한다. 또한, 도 7과 도 8은 도 5의 편집 모듈(505)이 비동기화 프레임 구간(section)을 제외한 나머지 프레임 구간을 이용하여 하나의 제1 합성 AV 파일(750)을 생성하는 동작을 예시한다. 또한, 도 7과 도 8은, 비동기화 프레임 구간을 더 포함하되, 오디오와 비디오의 동기화가 유지된 제2 합성 AV 파일(760)을 생성하는 동작을 예시한다.5 is a block diagram of an electronic device 500 configured to generate an AV synchronized composite AV file, according to one embodiment. FIG. 6 illustrates an operation in which the recording module 503 of FIG. 5 creates original AV files 630 and 640 marked with timestamps. In various embodiments of the present document, the timestamp may include an audio timestamp indicating when the first audio frame was created in the original AV file and a video timestamp indicating when the first video frame was created. In various embodiments of the present document, the first frame may be defined as a frame generated first in time in the corresponding original AV file. For example, the first audio frame may be an audio frame first generated when the corresponding recorder starts recording. The first video frame may be a video frame first created when the corresponding recorder starts recording. 7 and 8 show asynchronous frames that may adversely affect the synchronization of audio and video when the asynchronous section determination module 506 of FIG. 5 synthesizes original AV files 630 and 640 using timestamps. The operation of determining the interval is exemplified. Also, FIGS. 7 and 8 illustrate an operation of the editing module 505 of FIG. 5 generating one first composite AV file 750 by using the remaining frame sections except for the unsynchronized frame section. In addition, FIGS. 7 and 8 illustrate an operation of generating a second composite AV file 760 that further includes an asynchronous frame section and maintains synchronization between audio and video.

도 5를 참조하면, 전자 장치(500)는 카메라 모듈(501), 오디오 모듈(502), 기록 모듈(503), 시간 제공 모듈(504), 편집 모듈(505), 비동기화 구간 결정 모듈(506), 재생 모듈(507), 디스플레이(577), 메모리(588), 및 프로세서(599)를 포함할 수 있다. 다양한 실시예에서, 오디오와 비디오의 동기화가 이루어진 합성 AV 파일을 생성하기 위한 전자 장치(500)의 상기 구성 요소들은 서로 작동적으로 또는 전기적으로 연결될 수 있다. 일 실시예에 따르면, 모듈들(503-507)은 프로세서(599)(예: 도 1의 프로세서(120))에서 실행되는 프로그램 모듈일 수 있다. 예컨대, 모듈들(503-507) 중에서 적어도 하나는 메모리(588)(예: 도 1의 메모리(130))에 인스트럭션들(instructions)로 저장되고, 프로세서(599)(예: 도 1의 프로세서(120))에 의해 실행될 수 있다.Referring to FIG. 5 , the electronic device 500 includes a camera module 501, an audio module 502, a recording module 503, a time provision module 504, an editing module 505, and an asynchronous section determination module 506. ), a playback module 507, a display 577, a memory 588, and a processor 599. In various embodiments, the components of the electronic device 500 for generating a synthesized AV file in which audio and video are synchronized may be operatively or electrically connected to each other. According to one embodiment, modules 503 - 507 may be program modules executed on processor 599 (eg, processor 120 of FIG. 1 ). For example, at least one of the modules 503 to 507 is stored as instructions in a memory 588 (eg, the memory 130 of FIG. 1), and a processor 599 (eg, the processor of FIG. 1 ( 120)).

카메라 모듈(501)(예: 도 1의 카메라 모듈(180))은 복수의 카메라들(예: 도 4의 카메라들(405, 412))을 포함할 수 있다.The camera module 501 (eg, the camera module 180 of FIG. 1 ) may include a plurality of cameras (eg, the cameras 405 and 412 of FIG. 4 ).

오디오 모듈(502)(예: 도 1의 오디오 모듈(170))은 마이크(또는, 무선 통신 회로(예: 도 1의 무선 통신 모듈(192))를 통해 외부 장치(예: 무선 헤드셋))로부터 수신된 아날로그 오디오 신호를 프레임 단위의 디지털 오디오 신호로 변환하여 다른 구성 요소(예: 기록 모듈(503))로 출력할 수 있다. 오디오 모듈(502)은 다른 구성 요소(예: 프로세서(599))로부터 수신된 디지털 오디오 신호를 아날로그 오디오 신호로 변환하여 스피커(또는, 무선 통신 회로(예: 도 1의 무선 통신 모듈(192))를 통해 외부 장치)로 출력할 수 있다.The audio module 502 (eg, the audio module 170 of FIG. 1) receives a microphone (or an external device (eg, a wireless headset) through a wireless communication circuit (eg, the wireless communication module 192 of FIG. 1)). The received analog audio signal may be converted into a frame-unit digital audio signal and output to another component (eg, the recording module 503). The audio module 502 converts a digital audio signal received from another component (eg, the processor 599) into an analog audio signal, and converts a speaker (or a wireless communication circuit (eg, the wireless communication module 192 of FIG. 1)) into an analog audio signal. can be output to an external device).

기록 모듈(503)은 복수의 레코더들을 포함할 수 있다. 레코더들은 카메라의 개수만큼 기록 모듈(503)에 구비되어 카메라들에 각각 지정(예: 작동적으로 연결)될 수 있다. 레코더는 자신에게 지정된 카메라로부터 수신된 프레임 단위의 비디오 신호를 지정된 비디오 코덱을 이용하여 인코딩(예: 컨테이너(590)에 포함(또는, 저장)되는 비디오 파일의 압축 포맷(예: H.264, AV1)으로 인코딩)함으로써 비디오 파일을 생성할 수 있다. 또한, 레코더는 오디오 모듈(502)을 통해 마이크(510)로부터 수신된 프레임 단위의 오디오 신호를 지정된 오디오 코덱을 이용하여 인코딩(예: 컨테이너(590)에 포함되는 오디오 파일의 압축 포맷(예: AAC, MP3)으로 인코딩)함으로써 오디오 파일을 생성할 수 있다. 레코더는 생성된 오디오 파일과 비디오 파일을 하나의 AV 파일(또는, 컨테이너 파일)로 결합하여 컨테이너(590)에 포함시킬 수 있다. 컨테이너(590)에 저장되는 컨테이너 파일의 포맷은 예컨대 MP4일 수 있다. 다만, 이 포맷으로 한정되는 것은 아니고 AVI, MP4, MKV, MOV, 및 ASF 등 다양하다.The recording module 503 may include a plurality of recorders. Recorders may be provided in the recording module 503 as many as the number of cameras and may be assigned (eg, operatively connected) to each of the cameras. The recorder encodes the video signal in units of frames received from the camera designated for itself using the designated video codec (eg, the compression format of the video file included (or stored) in the container 590 (eg, H.264, AV1) ) to generate a video file. In addition, the recorder encodes the frame-unit audio signal received from the microphone 510 through the audio module 502 using a designated audio codec (eg, a compression format of an audio file included in the container 590 (eg, AAC) , MP3 encoding) to create an audio file. The recorder may combine the generated audio file and video file into one AV file (or container file) and include them in the container 590 . The format of the container file stored in the container 590 may be, for example, MP4. However, it is not limited to this format and various formats such as AVI, MP4, MKV, MOV, and ASF.

도 6을 참조하면, 카메라 모듈(501)은 제1 카메라(611)(예: 도 4a의 전면 카메라(405)) 및 제2 카메라(612(예: 도 4b의 후면 카메라(412))를 포함할 수 있다. 기록 모듈(503)은 제1 카메라(611)에서 촬영된 비디오를 기록하도록 지정된 제1 레코더(621) 및 제2 카메라(612)에서 촬영된 비디오를 기록하도록 지정된 제2 레코더(622)를 포함할 수 있다. 제1 레코더(621)와 제2 레코더(622)는 녹화 명령에 반응하여 오디오 타임스탬프 및 비디오 타임스탬프를 포함하는 AV 파일을 생성하고 생성된 AV 파일을 컨테이너(590)에 포함시킬 수 있다. 예컨대, 프로세서(599)는 제1 카메라(611)로부터 수신된 원본 이미지를 제1 프리뷰 이미지로 가공하여 디스플레이(577)에 표시할 수 있다. 프로세서(599)는 제2 카메라(612)로부터 수신된 원본 이미지를 제2 프리뷰 이미지로 가공하여 디스플레이(577)에 표시할 수 있다. 프로세서(599)는 디스플레이(577)의 화면을 분할하여 제1 프리뷰 이미지와 제2 프리뷰 이미지를 표시할 수 있다(화면 분할 방식). 또는, 프로세서(599)는 제1 프리뷰 이미지와 제2 프리뷰 이미지 중 하나를 다른 하나의 이미지 속에 표시할 수 있다 (PIP(picture in picture) 방식). 제1 프리뷰 이미지와 제2 프리뷰 이미지가 표시되는 동안, 사용자의 녹화 명령(예: 터치에 감응하는 디스플레이(577)에 표시된 녹화 버튼에 대한 터치 입력)이 레코더들(621, 622)에 전달될 수 있다.Referring to FIG. 6 , the camera module 501 includes a first camera 611 (eg, the front camera 405 of FIG. 4A ) and a second camera 612 (eg, the rear camera 412 of FIG. 4B ). The recording module 503 includes a first recorder 621 designated to record video taken by the first camera 611 and a second recorder 622 designated to record video taken by the second camera 612. ) The first recorder 621 and the second recorder 622 generate an AV file including an audio timestamp and a video timestamp in response to a recording command and store the created AV file in a container 590. For example, the processor 599 may process the original image received from the first camera 611 into a first preview image and display it on the display 577. The processor 599 may process the original image received from the first camera 611 into a first preview image and display it on the display 577. The original image received from 612 may be processed into a second preview image and displayed on the display 577. The processor 599 divides the screen of the display 577 to obtain a first preview image and a second preview image. (screen division method) Alternatively, the processor 599 can display one of the first preview image and the second preview image within the other image (picture in picture (PIP) method). While the preview image and the second preview image are displayed, a user's recording command (eg, a touch input to a record button displayed on the touch-sensitive display 577) may be transmitted to the recorders 621 and 622.

일 실시예에 따르면, 제1 레코더(621)는 제1 카메라(611)로부터 수신된 프레임 단위의 비디오 신호를 인코딩함으로써 비디오 프레임들(#0, #1, #2, …)(631)을 생성할 수 있다. 제1 레코더(621)는 제1 카메라(611)로부터 비디오 신호의 수신이 시작된 이후 오디오 모듈(502)로부터 수신된 프레임 단위의 오디오 신호를 인코딩함으로써 오디오 프레임들(#0, #1, #2, …)(632)을 생성할 수 있다. 제1 카메라(611)는 시점(t0)을 시간 정보 제공 모듈(504)로부터 확인하고 확인된 시점을 나타내는 제1 비디오 타임스탬프(633)를 생성할 수 있다. 제1 레코더(621)는 첫번째 비디오 프레임(#0)을 생성할 때 시점(t0)을 제1 카메라(611)로부터 전달받거나 시간 정보 제공 모듈(504)로부터 확인하고 확인된 시점을 나타내는 제1 비디오 타임스탬프(633)를 생성할 수 있다. 오디오 모듈(502)은 시점(t1)을 시간 정보 제공 모듈(504)로부터 확인하고 확인된 시점을 나타내는 제1 오디오 타임스탬프(634)를 생성할 수 있다. 제1 레코더(621)는 첫번째 오디오 프레임(#0)을 생성할 때 시점(t1)을 오디오 모듈(502)로부터 전달받거나 시간 정보 제공 모듈(504)로부터 확인하고 확인된 시점을 나타내는 제1 오디오 타임스탬프(634)를 생성할 수 있다. 제1 레코더(621)는 데이터(631-634)를 포함하는 제1 AV 파일(630)을 생성하여 컨테이너(590)에 저장할 수 있다. 제2 레코더(622)는 제2 카메라(612)로부터 수신된 프레임 단위의 비디오 신호를 인코딩함으로써 비디오 프레임들(#0, #1, #2, …)(641)을 생성할 수 있다. 제2 레코더(622)는 제2 카메라(612)로부터 비디오 신호의 수신이 시작된 이후 오디오 모듈(502)로부터 수신된 프레임 단위의 오디오 신호를 인코딩함으로써 오디오 프레임들(#0, #1, #2, …)(642)을 생성할 수 있다. 제2 카메라(612)는 시점(t2)을 시간 정보 제공 모듈(504)로부터 확인하고 확인된 시점을 나타내는 제2 비디오 타임스탬프(643)를 생성할 수 있다. 제2 레코더(622)는 첫번째 비디오 프레임(#0)을 생성할 때 시점(t2)을 제2 카메라(612)로부터 전달받거나 시간 정보 제공 모듈(504)로부터 확인하고 확인된 시점을 나타내는 제2 비디오 타임스탬프(643)를 생성할 수 있다. 오디오 모듈(502)은 시점(t3)을 시간 정보 제공 모듈(504)로부터 확인하고 확인된 시점을 나타내는 제2 오디오 타임스탬프(644)를 생성할 수 있다. 제2 레코더(622)는 첫번째 오디오 프레임(#0)을 생성할 때 시점(t3)을 오디오 모듈(502)로부터 전달받거나 시간 정보 제공 모듈(504)로부터 확인하고 확인된 시점을 나타내는 제2 오디오 타임스탬프(644)를 생성할 수 있다. 제2 레코더(622)는 데이터(641-644)를 포함하는 제2 AV 파일(640)을 생성하여 컨테이너(590)에 저장할 수 있다.According to an embodiment, the first recorder 621 generates video frames (#0, #1, #2, ...) 631 by encoding the video signal in units of frames received from the first camera 611. can do. The first recorder 621 encodes the audio signal in units of frames received from the audio module 502 after the video signal reception from the first camera 611 starts, thereby recording the audio frames #0, #1, #2, …) 632 can be created. The first camera 611 may check the time point t0 from the time information providing module 504 and generate a first video timestamp 633 indicating the checked time point. When the first video frame #0 is generated, the first recorder 621 receives the point of view t0 from the first camera 611 or checks it from the time information providing module 504 and identifies the first video frame #0. Timestamp 633 can be generated. The audio module 502 may check the time point t1 from the time information providing module 504 and generate a first audio timestamp 634 indicating the checked time point. When the first audio frame #0 is generated, the first recorder 621 receives the time point t1 from the audio module 502 or checks it from the time information providing module 504, and the first audio time indicating the checked time point. A stamp 634 may be created. The first recorder 621 may generate and store the first AV file 630 including the data 631 to 634 in the container 590 . The second recorder 622 may generate video frames (#0, #1, #2, ...) 641 by encoding the video signal received from the second camera 612 in units of frames. The second recorder 622 encodes the frame-by-frame audio signal received from the audio module 502 after receiving the video signal from the second camera 612 so that the audio frames #0, #1, #2, …) 642 can be created. The second camera 612 may check the time point t2 from the time information providing module 504 and generate a second video timestamp 643 indicating the checked time point. When the first video frame #0 is generated, the second recorder 622 receives the viewpoint t2 from the second camera 612 or checks it from the time information providing module 504 and confirms the second video frame #0. Timestamp 643 can be generated. The audio module 502 may check the time point t3 from the time information providing module 504 and generate a second audio timestamp 644 indicating the checked time point. When the first audio frame #0 is generated, the second recorder 622 receives the time point t3 from the audio module 502 or checks it from the time information providing module 504, and the second audio time indicating the checked time point. A stamp 644 may be created. The second recorder 622 may generate and store the second AV file 640 including the data 641 to 644 in the container 590 .

시간 정보 제공 모듈(504)은 기준 시각(예: 녹화 명령이 발생된 시각)으로부터 경과한 시간을 나타내는 수치를, 전자 장치(500)의 컴퓨팅 시스템(예: 프로세서(599))에서 클럭(clock) 신호를 발생하는 타이밍에 맞춰, 발생할 수 있다. 기록 모듈(503)의 레코더들은 첫번째 프레임을 생성하는 동안 시간 정보 제공 모듈(504)로부터 수신된 수치를 첫번째 프레임이 생성된 시점을 나타내는 타임스탬프로 설정할 수 있다. The time information providing module 504 converts a numerical value representing time elapsed from a reference time (eg, the time at which a recording command is issued) to a clock in a computing system (eg, the processor 599) of the electronic device 500. It can occur according to the timing of generating the signal. The recorders of the recording module 503 may set the number received from the time information providing module 504 while generating the first frame as a timestamp representing the time when the first frame was created.

컨테이너(590)에 포함되는 AV 파일(또는, 컨테이너 파일)은 도 6에 도시된 바와 같이, 헤더 영역(또는, 헤더 필드(field)), 데이터 영역(또는, 데이터 필드), 및 예비(reserved) 영역(또는, 예비 필드)으로 분류될 수 있다. 레코더에 의해 생성된 오디오 프레임과 비디오 프레임이 데이터 영역에 포함될 수 있다. 레코더에 의해 생성된 오디오/비디오 타임스탬프가 예비 영역에 포함될 수 있다. 헤더 영역에는 데이터 영역 내 프레임들과 관련된 메타 데이터를 포함할 수 있다. 예를 들어, 오디오 메타 데이터는 오디오 신호의 암호화 및/또는 복호화에 이용되는 코덱에 대한 코덱 정보 및 오디오 신호를 구성하는 포맷과 관련된 포맷 정보를 포함할 수 있다. 예를 들어, 오디오 포맷 정보는 비트 레이트(bit rate), 샘플링 레이트(sampling rate), 비트 뎁스(bit depth), 또는 채널 수를 포함할 수 있다. 비트 레이트는 단위 시간(예: 1초)마다 처리되는 비트의 수로 정의되며 그 단위는 bps(bit per second)가 사용될 수 있다. 비트 레이트가 클수록 해당 파일은 높은 품질(예: 충실도)을 갖는 것으로 이해될 수 있다. 비트 뎁스는, 이미지의 해상도에 대응하는 것으로서, 오디오 신호의 진폭을 얼마나 자세하게 표현할 수 있는지를 나타내는 지표일 수 있고 그 단위는 비트(bit)일 수 있다. 비트 뎁스가 클수록 해당 오디오 파일은 고해상도로 이해될 수 있다. 채널 수는 예컨대, 1(모노), 2(스테레오), 5.1, 또는 7.1일 수 있다. 샘플링 레이트는 원음(예: 마이크에서 출력된 아날로그 오디오 신호)에서 단위 시간(예: 1초) 당 추출된 샘플(sample)(본 문서에서 프레임으로 바꿔 표현될 수 있음)의 수로 정의될 수 있고 그 단위는 헤르츠(Hz)일 수 있다. 예컨대, 샘플링 레이트(본 문서에서 프레임 레이트로 바꿔 표현될 수 있음)가 48kHz 인 경우, 해당 오디오 파일에는 초당 48000개의 오디오 프레임이 존재할 수 있다. 비디오 메타 데이터는 비트 레이트 및 프레임 레이트를 포함할 수 있다. 비디오의 비트 레이트는 앞서 정의된 바와 같으며, 프레임 레이트는 단위 시간(예: 1초) 당 화면을 통해 보여지는 프레임의 수를 나타낼 수 있다. 예컨대, 프레임 레이트가 60Hz인 경우, 해당 비디오 파일에는 초당 60개의 비디오 프레임이 존재할 수 있다.As shown in FIG. 6, the AV file (or container file) included in the container 590 includes a header area (or header field), a data area (or data field), and a reserved It can be classified into areas (or preliminary fields). An audio frame and a video frame generated by the recorder may be included in the data area. An audio/video timestamp generated by the recorder may be included in the reserve area. The header area may include meta data related to frames in the data area. For example, the audio meta data may include codec information about a codec used to encrypt and/or decode an audio signal and format information related to a format constituting the audio signal. For example, the audio format information may include bit rate, sampling rate, bit depth, or number of channels. The bit rate is defined as the number of bits processed per unit time (eg, 1 second), and the unit may be bps (bits per second). It can be understood that the higher the bit rate, the higher the quality (eg, fidelity) of the corresponding file. The bit depth corresponds to the resolution of an image, and may be an index indicating how detailed the amplitude of an audio signal can be expressed, and its unit may be bits. As the bit depth increases, the corresponding audio file can be understood with higher resolution. The number of channels may be, for example, 1 (mono), 2 (stereo), 5.1, or 7.1. The sampling rate can be defined as the number of samples (which can be expressed as frames in this document) extracted per unit time (eg, 1 second) from the original sound (eg, an analog audio signal output from a microphone), and The unit may be hertz (Hz). For example, if the sampling rate (which may be expressed as a frame rate in this document) is 48 kHz, there may be 48000 audio frames per second in the audio file. Video meta data may include bit rate and frame rate. The bit rate of video is as defined above, and the frame rate may indicate the number of frames displayed on the screen per unit time (eg, 1 second). For example, when the frame rate is 60 Hz, 60 video frames per second may exist in the corresponding video file.

도 6에 도시된 바는 카메라 및 레코더가 2개씩이지만 그 이상의 카메라와 레코더가 존재할 수도 있다. 예를 들어, 전자 장치(500)에 제3 카메라와 제3 카메라에서 촬영된 비디오를 기록하도록 지정된 제3 레코더가 더 포함될 수 있다. 제3 레코더는 녹화 명령에 반응하여, 제1 레코더(621) 및 제2 레코더(622)가 수행하는 바와 동일하게, 오디오 타임스탬프 및 비디오 타임스탬프를 포함하는 AV 파일을 생성하여 컨테이너(590)에 저장할 수 있다.6 shows two cameras and two recorders, but more cameras and recorders may exist. For example, the electronic device 500 may further include a third camera and a third recorder designated to record video captured by the third camera. In response to the recording command, the third recorder generates an AV file including an audio timestamp and a video timestamp and stores it in the container 590, in the same way as the first recorder 621 and the second recorder 622 do. can be saved

도시되지는 않지만, AV 파일은 다른 AV 파일과 연관성을 나타내는 정보를 포함할 수 있다. 예를 들어, 프로세서(599)는 제1 AV 파일(630)과 제2 AV 파일(640) 간의 관계가 동일한 녹화 명령에 대한 레코딩 결과로서 생성된 것임을 나타내는 연관성 정보(예: 상대방 파일 명, 기록 일시, 녹화 모드(예: director’s view))를 각각의 AV 파일(예: 헤더 영역이나 예비 영역)에 포함하여 컨테이너(590)에 저장할 수 있다.Although not shown, an AV file may include information indicating association with other AV files. For example, the processor 599 may provide association information indicating that the relationship between the first AV file 630 and the second AV file 640 is generated as a recording result for the same recording command (eg, the other party's file name, recording date and time). , a recording mode (eg, director's view)) may be included in each AV file (eg, a header area or a spare area) and stored in the container 590 .

동일한 녹화 명령에 대한 레코딩 결과로서 생성된 원본 AV 파일들(예: 제1 AV 파일(630)과 제2 AV 파일(640))은 AV 싱크 일치된 하나의 AV 파일로 합성될(composed) 수 있다. Original AV files (e.g., the first AV file 630 and the second AV file 640) generated as a result of recording for the same recording command may be composed into one AV file synchronized with AV. .

도 7을 참조하면, 편집 모듈(505)은 컨테이너(590)에 포함된 AV 파일들 중에서 합성할 AV 파일들을 선택할 수 있다. 예를 들어, 편집 모듈(505)은 컨테이너(590)를 통해 레코더들(621, 622)로부터 수신된 제1 AV 파일(630)과 제2 AV 파일(640)을 합성 대상 파일로 결정할 수 있다. 다른 예로, 편집 모듈(505)은 컨테이너(590)에 포함된 각 AV 파일에 기록된 연관성 정보에 기초하여, 제1 AV 파일(630)과 제2 AV 파일(640)이 동일한 녹화 명령에 대한 레코딩 결과로서 생성된 것임을 인지하고 이에 따라 제1 AV 파일(630)과 제2 AV 파일(640)을 합성 대상 파일로 결정할 수 있다. 비동기화 구간 결정 모듈(506)은 합성 대상 파일들에서 오디오/비디오 타임스탬프를 확인하고, 확인 결과에 기반하여, 각 합성 대상 파일의 오디오/비디오 프레임들 중에서 AV 싱크 불일치를 야기할 것으로 예상되는 오디오/비디오 프레임(비동기화 오디오/비디오 프레임)을 결정할 수 있다. 편집 모듈(505)는 비동기화 프레임을 제외한 나머지 프레임들을 이용하여, 비디오 프레임들(#0, #1, #2, …)(751)과 오디오 프레임들(#0, #1, #2, …)(752)을 포함하는 제1 합성 AV 파일(750)을 생성하고 컨테이너(590)에 포함시킬 수 있다.Referring to FIG. 7 , the editing module 505 may select AV files to be synthesized from among AV files included in a container 590 . For example, the editing module 505 may determine the first AV file 630 and the second AV file 640 received from the recorders 621 and 622 through the container 590 as files to be synthesized. As another example, the editing module 505 records the first AV file 630 and the second AV file 640 for the same recording command based on the association information recorded in each AV file included in the container 590. It is recognized that they are generated as a result, and accordingly, the first AV file 630 and the second AV file 640 may be determined as files to be synthesized. The non-synchronization section determination module 506 checks audio/video timestamps in the synthesis target files, and based on the check result, the audio expected to cause AV sync mismatch among the audio/video frames of each synthesis target file. /Can determine video frames (non-synchronized audio/video frames). The editing module 505 uses the remaining frames except for the unsynchronized frames to create video frames (#0, #1, #2, ...) 751 and audio frames (#0, #1, #2, ...). ) 752 may be created and included in the container 590 .

비동기화 구간 결정 모듈(506)은 합성 대상 파일들에서 비디오 타임스탬프들을 확인하고, 상대적으로 값이 큰(예를 들어, 시간적으로 가장 늦게 생성된) 비디오 타임스탬프를 비동기화 비디오 프레임을 거르기 위한 제1 기준 시점으로 결정할 수 있다. 도 8에 예시된 바에 따르면, 제1 비디오 타임스탬프(t0)와 제2 비디오 타임스탬프(t2) 중에서 상대적으로 값이 큰 제2 비디오 타임스탬프(t2)가 제1 기준 시점으로 결정될 수 있다. 비동기화 구간 결정 모듈(506)은 합성 대상 파일들에서 오디오 타임스탬프들을 확인하고, 상대적으로 값이 큰(예를 들어, 시간적으로 가장 늦게 생성된) 오디오 타임스탬프를 비동기화 오디오 프레임을 거르기 위한 제2 기준 시점으로 결정할 수 있다. 도 8에 예시된 바에 따르면, 제1 오디오 타임스탬프(t1)와 제2 오디오 타임스탬프(t3) 중에서 상대적으로 값이 큰 제2 오디오 타임스탬프(t3)가 제2 기준 시점으로 결정될 수 있다. 또는, 비동기화 구간 결정 모듈(506)은 합성 대상 파일들에서 비디오 타임스탬프와 오디오 타임스탬프를 확인하고, 상대적으로 값이 큰(예를 들어, 시간적으로 가장 늦게 생성된) 타임스탬프를 비동기화 비디오 프레임과 비동기화 오디오 프레임을 거르기 위한 기준 시점으로 결정할 수 있다. 도 8에 예시된 바에 따르면, 제1 비디오 타임스탬프(t0), 제2 비디오 타임스탬프(t1), 제1 오디오 타임스탬프(t2) 및 제2 오디오 타임스탬프(t3) 중에서 제2 오디오 타임스탬프(t3)가 제3 기준 시점으로 결정될 수 있다.The unsynchronized section determination module 506 checks the video timestamps in the synthesized files, and selects the video timestamp having a relatively large value (eg, generated last in time) as a filter for filtering out the unsynchronized video frames. 1 can be determined as a reference point. As illustrated in FIG. 8 , a second video timestamp t2 having a relatively greater value among the first video timestamp t0 and the second video timestamp t2 may be determined as the first reference point in time. The unsynchronized section determination module 506 checks the audio timestamps in the synthesis target files, and selects the audio timestamp having a relatively large value (eg, generated last in time) as a factor for filtering out the unsynchronized audio frames. 2 can be determined as the reference point. As illustrated in FIG. 8 , among the first audio timestamp t1 and the second audio timestamp t3, the second audio timestamp t3 having a relatively large value may be determined as the second reference time point. Alternatively, the unsynchronized section determination module 506 checks the video timestamp and the audio timestamp in the synthesis target files, and selects a timestamp having a relatively large value (eg, generated last in time) for the unsynchronized video. It may be determined as a reference point in time for filtering out frames and unsynchronized audio frames. As illustrated in FIG. 8 , among the first video timestamp t0, the second video timestamp t1, the first audio timestamp t2, and the second audio timestamp t3, the second audio timestamp ( t3) may be determined as the third reference point in time.

비동기화 구간 결정 모듈(506)은 합성 대상 파일들에서 제1 기준 시점 또는 제3 기준 시점보다 먼저 생성된 비디오 프레임들을 비디오 합성에서 제외될 비동기화 비디오 프레임 구간으로 분류할 수 있다. 비동기화 구간 결정 모듈(506)은 제1 기준 시점 또는 제3 기준 시점보다 앞서 생성된 비디오 프레임들을, 해당 비디오 파일의 프레임 레이트에 기반하여, 식별할 수 있다. 예를 들어, 비동기화 구간 결정 모듈(506)은 하나의 비디오 프레임이 디스플레이를 통해 이미지로 재생되는 시간(이하, 비디오 프레임 시간)을 프레임 레이트에 기반하여 계산하고, 첫번째(#0) 비디오 프레임부터 n번째(#n) 비디오 프레임까지 비디오 프레임 시간을 누적할 수 있다. 누적 시간이 차분치((t2 ? t0) 또는 (t3 ? t0))를 초과할 경우, 첫번째 비디오 프레임부터 n-1 번째 비디오 프레임까지 비동기화 비디오 프레임들로 분류할 수 있다. 도 8에 예시된 바에 따르면, 비동기화 구간 결정 모듈(506)은 제1 AV 파일(630)의 비디오 프레임들(631) 중에 제1 기준 시점으로 결정된 제2 비디오 타임스탬프(t2) 보다 먼저 생성된 비디오 프레임들(810)을 비동기화 비디오 프레임 구간으로 분류할 수 있다. 또는, 비동기화 구간 결정 모듈(506)은 제1 AV 파일(630)의 비디오 프레임들(631) 중에 제3 기준 시점으로 결정된 제2 오디오 타임스탬프(t3)보다 먼저 생성된 비디오 프레임들(811)을 비동기화 비디오 프레임 구간으로 분류할 수도 있다. 또한 비동기화 구간 결정 모듈(506)은 제2 AV 파일(640)의 비디오 프레임들(641) 중에 제3 기준 시점으로 결정된 제2 오디오 타임스탬프(t3)보다 먼저 생성된 비디오 프레임들(812)을 비동기화 비디오 프레임 구간으로 분류할 수도 있다. The unsynchronized section determination module 506 may classify video frames generated prior to the first reference point of view or the third reference point of time in the synthesis target files as an asynchronous video frame section to be excluded from video composition. The unsynchronized section determination module 506 may identify video frames generated prior to the first reference point of view or the third reference point of view, based on the frame rate of the corresponding video file. For example, the asynchronous section determination module 506 calculates the time during which one video frame is reproduced as an image through a display (hereinafter referred to as video frame time) based on the frame rate, and from the first (#0) video frame The video frame time can be accumulated up to the nth (#n) video frame. When the accumulated time exceeds the difference value (t2 ? t0) or (t3 ? t0), the first video frame to the n-1 th video frame may be classified as asynchronous video frames. As illustrated in FIG. 8 , the non-synchronized section determination module 506 generates a video frame 631 of the first AV file 630 earlier than the second video timestamp t2 determined as the first reference point in time. The video frames 810 may be classified as unsynchronized video frame intervals. Alternatively, the unsynchronized section determining module 506 determines the video frames 811 generated earlier than the second audio timestamp t3 determined as the third reference point among the video frames 631 of the first AV file 630. may be classified as an asynchronous video frame interval. Also, the non-synchronized section determination module 506 selects video frames 812 generated earlier than the second audio timestamp t3 determined as the third reference point among the video frames 641 of the second AV file 640. It can also be classified as an asynchronous video frame section.

비동기화 구간 결정 모듈(506)은 합성 대상 파일들에서 제2 기준 시점(또는, 제3 기준 시점)보다 먼저 생성된 오디오 프레임을 오디오 합성에서 제외될 비동기화 오디오 프레임 구간으로 분류할 수 있다. 비동기화 구간 결정 모듈(506)은 제2 기준 시점보다 앞서 생성된 오디오 프레임들을, 해당 오디오 파일의 샘플링 레이트에 기반하여, 식별할 수 있다. 예를 들어, 비동기화 구간 결정 모듈(506)은 하나의 오디오 프레임이 스피커를 통해 소리로 재생되는 시간(이하, 오디오 프레임 시간)을 샘플링 레이트에 기반하여 계산하고, 첫번째(#0) 오디오 프레임부터 m번째(#m) 오디오 프레임까지 오디오 프레임 시간을 누적할 수 있다. 누적 시간이 차분치(t3 ? t1)를 초과할 경우, 첫번째 오디오 프레임부터 m-1 번째 오디오 프레임까지 비동기화 오디오 프레임들로 분류할 수 있다. 도 8에 예시된 바에 따르면, 비동기화 구간 결정 모듈(506)은 제1 AV 파일(630)의 오디오 프레임들(632) 중에 제2 기준 시점으로 결정된 제2 오디오 타임스탬프(t3)보다 먼저 생성된 오디오 프레임들을 비동기화 오디오 프레임 구간(820)으로 분류할 수 있다. 앞서 예시된 ‘구간’ 이란 표현은 기간(period) 또는 시간(time) 등으로 바꿔 표현될 수도 있다. 예를 들어, 비동기화 구간 대신 비동기화 기간으로 표현될 수도 있다.The unsynchronized section determination module 506 may classify an audio frame generated earlier than the second reference point in time (or the third reference point in time) in synthesis target files as an asynchronous audio frame section to be excluded from audio synthesis. The unsynchronized section determination module 506 may identify audio frames generated prior to the second reference point in time based on the sampling rate of the corresponding audio file. For example, the asynchronous section determination module 506 calculates the time during which one audio frame is reproduced as sound through a speaker (hereinafter referred to as audio frame time) based on the sampling rate, and from the first (#0) audio frame Audio frame times may be accumulated up to the mth (#m) audio frame. When the accumulated time exceeds the difference value (t3 − t1), the first audio frame to the m−1 th audio frame may be classified as asynchronous audio frames. As illustrated in FIG. 8 , the non-synchronized section determining module 506 generates an audio frame 632 of the first AV file 630 earlier than the second audio timestamp t3 determined as the second reference point in time. Audio frames may be classified into unsynchronized audio frame intervals 820 . The expression 'interval' exemplified above may be expressed as a period or time. For example, it may be expressed as an asynchronous period instead of an unsynchronized period.

일 실시예에 따르면, 편집 모듈(505)은 비디오 프레임들(631) 중에서 비동기화 비디오 프레임 구간(810)을 제외한 나머지 프레임 구간 내 비디오 프레임들을, 비디오 압축 때 이용된 코덱을 이용하여, 재생 가능한 제1 비디오 신호로 압축 해제할 수 있다. 편집 모듈(505)는 제2 AV 파일(640)의 비디오 프레임들(641)을, 비디오 압축 때 이용된 코덱을 이용하여, 재생 가능한 제2 비디오 신호로 압축 해제할 수 있다. 편집 모듈(505)는 제1 비디오 신호와 제2 비디오 신호를 프레임 단위로 합성함으로써 하나의 제3 비디오 신호를 획득할 수 있다. 편집 모듈(505)는 제3 비디오 신호를 비디오 압축 해제 시 이용된 코덱을 이용하여 프레임 단위로 인코딩함으로써 비디오 프레임들(#0, #1, #2, …)(751)을 생성할 수 있다. 편집 모듈(505)은 오디오 프레임들(632) 중에서 비동기화 오디오 프레임 구간(820)을 제외한 나머지 프레임 구간 내 오디오 프레임들을, 오디오 압축 때 이용된 코덱을 이용하여, 재생 가능한 제1 오디오 신호로 압축 해제할 수 있다. 편집 모듈(505)는 제2 AV 파일(640)의 오디오 프레임들(642)을, 오디오 압축 때 이용된 코덱 혹은 다른 종류의 코덱을 이용하여, 재생 가능한 제2 오디오 신호로 압축 해제할 수 있다. 편집 모듈(505)는 제1 오디오 신호와 제2 오디오 신호를 프레임 단위로 합성함으로써 하나의 제3 오디오 신호를 획득할 수 있다. 편집 모듈(505)는 제3 오디오 신호를 오디오 압축 해제 시 이용된 코덱 혹은 다른 종류의 코덱을 이용하여 프레임 단위로 인코딩함으로써 오디오 프레임들(#0, #1, #2, …)(752)을 생성할 수 있다. 편집 모듈(505)은 비디오 프레임들(751)과 오디오 프레임들(752)을 포함하는 제1 합성 AV 파일(750)을 생성하여 컨테이너(590)에 저장할 수 있다. 추가적으로, 편집 모듈(505)은 제1 합성 AV 파일(750)에서 첫번째 합성 프레임의 출처인 첫번째 원본 프레임들이 생성된 시점을 나타내는 정보로서 예컨대, 제2 비디오 타임스탬프(643)와 제2 오디오 타임스탬프(644)를 합성 AV 파일(750)의 예비 영역에 더 포함하여 컨테이너(590)에 저장할 수 있다. 도시되지는 않지만, 편집 모듈(505)은 제1 합성 AV 파일(750)의 출처인 원본 파일을 나타내는 정보로서 예컨대, AV 파일들(630, 640)의 파일 명을 제1 합성 AV 파일(750)의 예비 영역 또는 헤더 영역에 더 포함하여 컨테이너(590)에 저장할 수 있다.According to one embodiment, the editing module 505 reproduces the video frames in the remaining frame intervals except for the asynchronous video frame interval 810 among the video frames 631 using a codec used for video compression. 1 can be decompressed into a video signal. The editing module 505 may decompress the video frames 641 of the second AV file 640 into a reproducible second video signal using a codec used for video compression. The editing module 505 may obtain one third video signal by combining the first video signal and the second video signal frame by frame. The editing module 505 may generate video frames (#0, #1, #2, ...) 751 by encoding the third video signal frame by frame using a codec used for video decompression. The editing module 505 decompresses the audio frames in the frame section other than the unsynchronized audio frame section 820 from among the audio frames 632 into a first audio signal that can be reproduced using a codec used for audio compression. can do. The editing module 505 may decompress the audio frames 642 of the second AV file 640 into a reproducible second audio signal using a codec used for audio compression or another type of codec. The editing module 505 may obtain one third audio signal by synthesizing the first audio signal and the second audio signal frame by frame. The editing module 505 converts the audio frames (#0, #1, #2, ...) 752 by encoding the third audio signal frame by frame using the codec used for audio decompression or another type of codec. can create The editing module 505 may generate a first composite AV file 750 including video frames 751 and audio frames 752 and store it in a container 590 . Additionally, the editing module 505 generates information representing the time when the first original frames, which are the sources of the first synthesized frame, in the first synthesized AV file 750 are generated, for example, the second video timestamp 643 and the second audio timestamp. 644 may be further included in the spare area of the composite AV file 750 and stored in the container 590. Although not shown, the editing module 505 converts, for example, the file names of the AV files 630 and 640 into the first composite AV file 750 as information representing an original file that is the source of the first composite AV file 750 . It can be stored in the container 590 by further including it in the spare area or header area of the .

일 실시예에 따르면, 편집 모듈(505)은, 비동기화 프레임 구간을 더 포함하되, 오디오 및 비디오의 동기화가 유지되는 제2 합성 AV 파일(760)을 생성할 수도 있다. 예를 들어, 편집 모듈(505)은 비동기화 비디오 프레임 구간(810)을, 비디오 압축 때 이용된 코덱을 이용하여, 재생 가능한 제4 비디오 신호로 압축 해제할 수 있다. 편집 모듈(505)은 제4 비디오 신호와 제3 비디오 신호를 시간 순으로 이어 붙여서 제5 비디오 신호를 생성하고, 제5 비디오 신호를 비디오 압축 해제 시 이용된 코덱 혹은 다른 종류의 코덱을 이용하여 프레임 단위로 인코딩함으로써 비디오 프레임들(761)을 생성할 수 있다. 편집 모듈(505)은 비동기화 오디오 프레임 구간(820)을, 오디오 압축 때 이용된 코덱을 이용하여, 재생 가능한 제4 오디오 신호로 압축 해제할 수 있다. 편집 모듈(505)은 제4 오디오 신호와 제3 오디오 신호를 시간 순으로 이어 붙여서 제5 오디오 신호를 생성하고, 제5 오디오 신호를 오디오 압축 해제 시 이용된 코덱을 이용하여 프레임 단위로 인코딩함으로써 오디오 프레임들(762)을 생성할 수 있다. 편집 모듈(505)은 비디오 프레임들(761)과 오디오 프레임들(762)을 포함하는 제2 합성 AV 파일(7650)을 생성하여 컨테이너(590)에 저장할 수 있다.According to an embodiment, the editing module 505 may generate a second composite AV file 760 that further includes an asynchronous frame section and maintains synchronization of audio and video. For example, the editing module 505 may decompress the asynchronous video frame section 810 into a reproducible fourth video signal using a codec used for video compression. The editing module 505 generates a fifth video signal by concatenating the fourth video signal and the third video signal in chronological order, and frames the fifth video signal using a codec used in video decompression or another type of codec. The video frames 761 may be generated by encoding in units. The editing module 505 may decompress the unsynchronized audio frame section 820 into a reproducible fourth audio signal using a codec used for audio compression. The editing module 505 generates a fifth audio signal by concatenating the fourth audio signal and the third audio signal in chronological order, encodes the fifth audio signal in frame units using a codec used for audio decompression, and then encodes the audio signal. Frames 762 may be created. The editing module 505 may generate and store the second composite AV file 7650 including the video frames 761 and the audio frames 762 in the container 590 .

재생 모듈(507)은 컨테이너(590)에 포함(또는, 저장)된 AV 파일을 재생하여 출력 인터페이스를 통해 출력할 수 있다. 일례로, 재생 모듈(507)은 제1 합성 AV 파일(750)에서 비디오 프레임들(751)을, 비디오 압축 때 이용된 코덱을 이용하여, 재생 가능한 제3 비디오 신호로 압축 해제할 수 있다. 재생 모듈(507)은 제1 합성 AV 파일(750)에서 오디오 프레임들(752)를, 오디오 압축 때 이용된 코덱을 이용하여, 재생 가능한 제3 오디오 신호로 압축 해제할 수 있다. 재생 모듈(507)은 제3 비디오 신호를 디스플레이(577)로 출력하고 제3 오디오 신호를 오디오 모듈(502)을 통해 스피커로 출력할 수 있다. 이에 따라 제3 비디오 신호와 제3 오디오 신호는 각각, 디스플레이와 스피커를 통해 이미지와 소리로 재생될 수 있다. 다른 예로, 재생 모듈(507)은 제2 합성 AV 파일(760)에서 비디오 프레임들(761)을, 비디오 압축 때 이용된 코덱을 이용하여, 재생 가능한 제5 비디오 신호로 압축 해제할 수 있다. 재생 모듈(507)은 제2 합성 AV 파일(760)에서 오디오 프레임들(762)를, 오디오 압축 때 이용된 코덱을 이용하여, 재생 가능한 제5 오디오 신호로 압축 해제할 수 있다. 재생 모듈(507)은 제5 비디오 신호를 디스플레이(577)로 출력하고 제5 오디오 신호를 오디오 모듈(502)을 통해 스피커로 출력할 수 있다. 이에 따라 제5 비디오 신호와 제5 오디오 신호는 각각, 디스플레이와 스피커를 통해 이미지와 소리로 재생될 수 있다.The playback module 507 may play an AV file included (or stored) in the container 590 and output it through an output interface. For example, the playback module 507 may decompress the video frames 751 in the first composite AV file 750 into a playable third video signal using a codec used for video compression. The playback module 507 may decompress the audio frames 752 in the first composite AV file 750 into a playable third audio signal using a codec used for audio compression. The playback module 507 may output a third video signal to the display 577 and output a third audio signal to a speaker through the audio module 502 . Accordingly, the third video signal and the third audio signal may be reproduced as images and sounds through the display and the speaker, respectively. As another example, the playback module 507 may decompress the video frames 761 of the second composite AV file 760 into a playable fifth video signal using a codec used for video compression. The reproducing module 507 may decompress the audio frames 762 of the second composite AV file 760 into a reproducible fifth audio signal using a codec used for audio compression. The playback module 507 may output a fifth video signal to the display 577 and output a fifth audio signal to a speaker through the audio module 502 . Accordingly, the fifth video signal and the fifth audio signal may be reproduced as images and sounds through the display and the speaker, respectively.

도 9는, 일 실시예에 따른, 비동기화 프레임 구간을 결정하기 위한 동작들을 설명하기 위한 흐름도이다. 도 9의 동작들은 프로세서(599)가 도 5의 비동기화 구간 결정 모듈(506)을 이용하여 또는 비동기화 구간 설정 모듈(506)에 구성된 기능과 동일한 기능을 수행함으로써 구현될 수 있다.9 is a flowchart illustrating operations for determining an unsynchronized frame period, according to an exemplary embodiment. The operations of FIG. 9 may be implemented by the processor 599 using the asynchronous section determination module 506 of FIG. 5 or by performing the same function as that configured in the asynchronous section setting module 506 .

동작 910에서 프로세서(599)는 합성되어야 할 비디오 파일들에서 비디오 타임스탬프들을 확인하고, 상대적으로 값이 큰(예를 들어, 시간적으로 가장 늦게 생성된) 비디오 타임스탬프와 상대적으로 값이 작은(예를 들어, 시간적으로 가장 먼저 생성된) 비디오 타임스탬프 간의 비디오 차분치(또는, 제1 차분치)를 계산할 수 있다. 프로세서(599)는 합성되어야 할 오디오 파일들에서 오디오 타임스탬프들을 확인하고, 상대적으로 값이 큰(예를 들어, 시간적으로 가장 늦게 생성된) 오디오 타임스탬프와 상대적으로 값이 작은(예를 들어, 시간적으로 가장 먼저 생성된) 오디오 타임스탬프 간의 오디오 차분치(또는, 제2 차분치)를 계산할 수 있다. 또는 프로세서(599)는 비디오 타임스탬프들과 오디오 타임스탬프들 중에서 값이 상대적으로 큰 타임스탬프와 상대적으로 값이 작은 비디오 타임스탬프 간의 차이를 나타내는 비디오 차분치(또는, 제1 차분치)를 계산할 수 있다. 프로세서(599)는 비디오 타임스탬프들과 오디오 타임스탬프들 중에서 값이 상대적으로 큰 타임스탬프와 값이 상대적으로 작은 오디오 타임스탬프 간의 차이를 나타내는 오디오 차분치(또는, 제2 차분치)를 계산할 수 있다.In operation 910, the processor 599 checks video timestamps in the video files to be synthesized, and a video timestamp with a relatively large value (eg, generated last in time) and a video timestamp with a relatively small value (eg, generated late in time). For example, a video difference value (or a first difference value) between video timestamps (generated first temporally) may be calculated. The processor 599 checks audio timestamps in the audio files to be synthesized, and determines audio timestamps with a relatively large value (eg, generated last in time) and audio timestamps with a relatively small value (eg, generated last in time). An audio difference value (or a second difference value) between audio timestamps (generated first in terms of time) may be calculated. Alternatively, the processor 599 may calculate a video difference value (or a first difference value) representing a difference between a timestamp having a relatively large value and a video timestamp having a relatively small value among video timestamps and audio timestamps. there is. The processor 599 may calculate an audio difference value (or a second difference value) representing a difference between a timestamp having a relatively large value and an audio timestamp having a relatively small value among video timestamps and audio timestamps. .

동작 920에서 프로세서(599)는 합성되어야 할 비디오 파일들 중 하나를 선택하고 선택된 비디오 파일에서 첫번째 비디오 프레임부터 n번째 비디오 프레임까지 비디오 프레임 시간을 누적할 수 있다. 프로세서(599)는 합성되어야 할 오디오 파일들 중에서 하나를 선택하고 선택된 오디오 파일에서 첫번째 오디오 프레임부터 m번째 오디오 프레임까지 오디오 프레임 시간을 누적할 수 있다.In operation 920, the processor 599 may select one of the video files to be synthesized and accumulate video frame times from the first video frame to the n-th video frame in the selected video file. The processor 599 may select one of the audio files to be synthesized and accumulate the audio frame time from the first audio frame to the m-th audio frame in the selected audio file.

동작 930에서 프로세서(599)는, 비디오 프레임 누적 시간이 비디오 차분치를 초과한 것에 기반하여, 첫번째 비디오 프레임부터 n-1번째 비디오 프레임 까지를 선택된 비디오 파일에서 비동기화 비디오 프레임 구간으로 분류할 수 있다. 프로세서(599)는, 오디오 프레임 누적 시간이 오디오 차분치를 초과한 것에 기반하여, 첫번째 오디오 프레임부터 m-1번째 오디오 프레임 까지를 선택된 오디오 파일에서 비동기화 오디오 프레임 구간으로 분류할 수 있다.In operation 930, the processor 599 may classify the first video frame to the n−1 th video frame as an asynchronous video frame period in the selected video file, based on the video frame accumulation time exceeding the video difference value. The processor 599 may classify the first audio frame to the m−1 th audio frame as an asynchronous audio frame section in the selected audio file, based on the fact that the audio frame accumulation time exceeds the audio difference value.

동작 920과 930은 합성되어야 할 나머지 파일들에 대해서도 동일하게 적용될 수 있다.Operations 920 and 930 may be equally applied to the remaining files to be synthesized.

도 10은, 일 실시예에 따른, AV 싱크 일치된 합성 AV 파일을 생성하기 위한 동작들을 설명하기 위한 흐름도이다. 도 10의 동작들은 프로세서(599)가 도 5의 모듈들(503, 504, 505, 506)을 이용하여 또는 모듈들(503, 504, 505, 506)에 구성된 기능과 동일한 기능을 수행함으로써 구현될 수 있다.10 is a flowchart illustrating operations for generating an AV sync-matched composite AV file, according to an embodiment. The operations of FIG. 10 may be implemented by the processor 599 using the modules 503, 504, 505, and 506 of FIG. 5 or by performing the same functions as those configured in the modules 503, 504, 505, and 506. can

동작 1010에서 프로세서(599)는 전자 장치(500)의 제1카메라로부터 수신된 비디오 신호를 프레임 단위로 인코딩함으로써 제1 비디오 파일을 생성하고, 전자 장치(500)의 마이크로부터 수신된 오디오 신호를 프레임 단위로 인코딩함으로써 제1 오디오 파일을 생성하고, 제1 비디오 파일과 제1 오디오 파일을 제1 AV 파일로 결합하여 메모리(588)에 저장할 수 있다. 제1 비디오 파일과 제1 오디오 파일을 생성하는 동안, 프로세서(599)는 전자 장치(500)의 제2 카메라로부터 수신된 비디오 신호를 프레임 단위로 인코딩함으로써 제2 비디오 파일을 생성하고, 전자 장치(500)의 마이크로부터 수신된 오디오 신호를 프레임 단위로 인코딩함으로써 제2 오디오 파일을 생성하고, 제2 비디오 파일과 제2 오디오 파일을 제2 AV 파일로 결합하여 메모리(588)에 저장할 수 있다. 전자 장치(500)에 3개 이상의 카메라가 구비될 경우, 프로세서(599)의 멀티태스킹(multitasking)(예: 프로세서(599)가 카메라들에 각각 지정되는 레코더들을 동시에 실행함)에 따라 3개 이상의 AV 파일이 상기와 같은 방식으로 동 시간대에 생성될 수 있다.In operation 1010, the processor 599 generates a first video file by encoding the video signal received from the first camera of the electronic device 500 frame by frame, and converts the audio signal received from the microphone of the electronic device 500 into frames. A first audio file may be generated by encoding in units, and the first video file and the first audio file may be combined into a first AV file and stored in the memory 588 . While generating the first video file and the first audio file, the processor 599 encodes the video signal received from the second camera of the electronic device 500 frame by frame to generate a second video file, and the electronic device ( A second audio file may be generated by encoding the audio signal received from the microphone of 500 in units of frames, and the second video file and the second audio file may be combined into a second AV file and stored in the memory 588 . When three or more cameras are provided in the electronic device 500, the processor 599 multitasking (eg, the processor 599 simultaneously executes recorders assigned to the cameras respectively) to set three or more cameras. AV files can be created at the same time in the same manner as above.

동작 1020에서 프로세서(599)는 제1 AV 파일의 제1 비디오 파일에서 첫번째 프레임이 생성된 시점을 나타내는 제1 비디오 타임스탬프와 제1 오디오 파일에서 첫번째 프레임이 생성된 시점을 나타내는 제1 오디오 타임스탬프를 메모리(588)에 저장할 수 있다. 프로세서(599)는 제2 AV 파일의 제2 비디오 파일에서 첫번째 프레임이 생성된 시점을 나타내는 제2 비디오 타임스탬프와 제2 오디오 파일에서 첫번째 프레임이 생성된 시점을 나타내는 제2 오디오 타임스탬프를 메모리(588)에 저장할 수 있다. 동 시간대에 3개 이상의 AV 파일이 생성될 경우 상기와 같은 방식으로 3개 이상의 비디오/오디오 타임스탬프가 메모리(588)에 저장될 수 있다.In operation 1020, the processor 599 generates a first video timestamp indicating when the first frame was created in the first video file of the first AV file and a first audio timestamp indicating when the first frame was created in the first audio file. may be stored in the memory 588. The processor 599 stores a second video timestamp indicating when the first frame was generated in the second video file of the second AV file and a second audio timestamp indicating when the first frame was generated in the second audio file in memory ( 588) can be stored. When three or more AV files are generated at the same time, three or more video/audio timestamps may be stored in the memory 588 in the same manner as described above.

동작 1030에서 프로세서(599)는 메모리(588)에 동시간대에 저장된 AV 파일들에 포함된 비디오 타임스탬프들 중에서 상대적으로 큰 값을 갖는(예를 들어, 상대적으로 느린 시점을 나타내는) 제1 타임스탬프에 기초하여 비동기화 비디오 프레임 구간(예: 도 8의 프레임 구간 810)을 결정할 수 있다. 프로세서(599)는 AV 파일들에 포함된 오디오 타임스탬프들 중에서 상대적으로 큰 값(예를 들어, 상대적으로 느린 시점을 나타내는)을 갖는 제2 타임스탬프에 기초하여 비동기화 오디오 프레임 구간(예: 도 8의 프레임 구간 820)을 결정할 수 있다. 예컨대, 프로세서(599)는 비디오 타임스탬프들 중에서 가장 큰 값을 기준 시점으로 설정하고, 기준 시점보다 앞서 생성된 비디오 프레임들을 비동기화 비디오 프레임 구간에 속한 프레임으로 결정할 수 있다. 프로세서(599)는 오디오 타임스탬프들 중에서 가장 큰 값을 기준 시점으로 설정하고, 기준 시점보다 앞서 생성된 오디오 프레임들을 비동기화 오디오 프레임 구간에 속한 프레임으로 결정할 수 있다. 다른 실시예에서, 프로세서(599)는 메모리(588)에 동시간대에 저장된 AV 파일들에 포함된 비디오 타임스탬프들과 오디오 타임스탬프들 중에서 가장 큰 값을 갖는 타임스탬프를 비동기화 비디오 프레임 구간을 결정하기 위해 이용되는 제1 타임스탬프로 선택할 수도 있다. 또한, 프로세서(599)는 비디오 타임스탬프들과 오디오 타임스탬프들 중에서 가장 큰 값을 갖는 타임스탬프를 비동기화 오디오 프레임 구간을 결정하기 위해 이용되는 제2 타임스탬프로 선택할 수 있다.In operation 1030, the processor 599 selects a first timestamp having a relatively large value (eg, indicating a relatively slow time point) among video timestamps included in AV files stored in the memory 588 at the same time. An unsynchronized video frame period (eg, the frame period 810 of FIG. 8) may be determined based on. The processor 599 performs an asynchronous audio frame interval (eg, diagram) based on a second timestamp having a relatively large value (eg, indicating a relatively slow time point) among audio timestamps included in AV files. A frame period 820 of 8) may be determined. For example, the processor 599 may set the largest value among video timestamps as a reference view, and determine video frames generated prior to the reference view as frames belonging to an asynchronous video frame period. The processor 599 may set the largest value among the audio timestamps as a reference time, and determine audio frames generated prior to the reference time as frames belonging to the asynchronous audio frame period. In another embodiment, the processor 599 determines the asynchronous video frame period by using the timestamp having the largest value among video timestamps and audio timestamps included in AV files stored at the same time in the memory 588. may be selected as the first timestamp used for Also, the processor 599 may select a timestamp having the largest value among the video timestamps and the audio timestamps as the second timestamp used to determine the unsynchronized audio frame period.

동작 1040에서 프로세서(599)는 AV 파일들마다 비동기화 비디오 프레임 구간에 속하지 않는 비디오 프레임들을 디코딩함으로써 복수의 비디오 신호들을 생성할 수 있다. 프로세서(599)는 AV 파일들마다 비동기화 오디오 프레임 구간에 속하지 않은 오디오 프레임들을 디코딩함으로써 복수의 오디오 신호들을 생성할 수 있다. In operation 1040, the processor 599 may generate a plurality of video signals by decoding video frames that do not belong to an asynchronous video frame period for each AV file. The processor 599 may generate a plurality of audio signals by decoding audio frames that do not belong to an asynchronous audio frame period for each AV file.

동작 1050에서 프로세서(599)는 복수의 비디오 신호들을 하나의 비디오 신호로 합성하고 프레임 단위로 인코딩함으로써 하나의 합성 비디오 파일을 생성하고, 복수의 오디오 신호들을 하나의 오디오 신호로 합성하고 프레임 단위로 인코딩함으로써 하나의 합성 오디오 파일을 생성할 수 있다. 프로세서(599)는 합성 비디오 파일과 합성 오디오 파일을 하나의 합성 AV 파일로 결합하여 메모리(588)에 저장할 수 있다.In operation 1050, the processor 599 synthesizes a plurality of video signals into one video signal and encodes them in units of frames to generate a single composite video file, and synthesizes a plurality of audio signals into one audio signal and encodes them in units of frames. By doing so, a single synthesized audio file can be created. The processor 599 may combine the composite video file and the composite audio file into a single composite AV file and store it in the memory 588 .

동작 1060에서 프로세서(599)는 합성 AV 파일을 재생하여 디스플레이와 스피커를 통해 출력할 수 있다. 추가적으로, 프로세서(599)는 비동기화 프레임 구간 내 오디오/비디오 프레임들을 먼저 재생하고 나서 합성 AV 파일을 재생할 수도 있다. 도 8을 참조하면, 프로세서(599)는 비동기화 프레임 구간(810, 820) 내 오디오/비디오 프레임들을 재생하고 나서 제1 합성 AV 파일(750)을 재생할 수 있다.In operation 1060, the processor 599 may reproduce and output the composite AV file through a display and a speaker. Additionally, the processor 599 may play the audio/video frames in the asynchronous frame period first and then play the composite AV file. Referring to FIG. 8 , the processor 599 may reproduce the audio/video frames within the asynchronous frame intervals 810 and 820 and then reproduce the first composite AV file 750 .

다양한 실시예에서 전자 장치(예: 도 5의 전자 장치(500))는, 카메라들; 마이크; 상기 카메라들과 상기 마이크에 작동적으로 연결된 프로세서; 및 상기 프로세서에 작동적으로 연결된 메모리를 포함할 수 있다. 상기 메모리(예: 도 5의 메모리(588))는, 실행될 때, 상기 프로세서(예: 도 5의 프로세서(599))가: 상기 카메라들 중 제1카메라로부터 수신된 비디오 신호를 프레임 단위로 인코딩함으로써 제1 비디오 파일을 생성하고, 상기 마이크로부터 수신된 오디오 신호를 프레임 단위로 인코딩함으로써 제1 오디오 파일을 생성하고, 상기 제1 비디오 파일과 상기 제1 오디오 파일을 제1 AV 파일로 결합하여 상기 메모리에 저장하고, 상기 제1 비디오 파일과 상기 제1 오디오 파일을 생성하는 동일한 시간대에, 상기 카메라들 중 제2카메라로부터 수신된 비디오 신호를 프레임 단위로 인코딩함으로써 제2 비디오 파일을 생성하고, 상기 마이크로부터 수신된 오디오 신호를 프레임 단위로 인코딩함으로써 제2 오디오 파일을 생성하고, 상기 제2 비디오 파일과 상기 제2 오디오 파일을 제2 AV 파일로 결합하여 상기 메모리에 저장하고, 상기 제1 비디오 파일에서 첫번째 프레임이 생성된 시점을 나타내는 제1 비디오 타임스탬프, 상기 제1 오디오 파일에서 첫번째 프레임이 생성된 시점을 나타내는 제1 오디오 타임스탬프, 상기 제2 비디오 파일에서 첫번째 프레임이 생성된 시점을 나타내는 제2 비디오 타임스탬프, 및 상기 제2 오디오 파일에서 첫번째 프레임이 생성된 시점을 나타내는 제2 오디오 타임스탬프를 상기 메모리에 저장하고, 상기 메모리에 동 시간대에 저장된 AV 파일들에 포함된 비디오 타임스탬프들 중에서 또는 상기 AV 파일들에 포함된 비디오 타임스탬프들 및 오디오 타임스탬프들 중에서 상대적으로 느린 시점을 나타내는 제1 타임스탬프에 기초하여 비동기화 비디오 프레임 구간을 결정하고, 상기 AV 파일들에 포함된 오디오 타임스탬프들 중에서 또는 AV 파일들에 포함된 비디오 타임스탬프들 및 오디오 타임스탬프들 중에서 상대적으로 느린 시점을 나타내는 제2 타임스탬프에 기초하여 비동기화 오디오 프레임 구간을 결정하고, 상기 AV 파일들마다 비동기화 비디오 프레임 구간에 속하지 않는 비디오 프레임들을 디코딩함으로써 복수의 비디오 신호들을 생성하고, 상기 복수의 비디오 신호들을 하나의 비디오 신호로 합성하고, 하나로 합성된 상기 비디오 신호를 인코딩함으로써 합성 비디오 파일을 생성하고, 상기 AV 파일들마다 비동기화 오디오 프레임 구간에 속하지 않은 오디오 프레임들을 디코딩함으로써 복수의 오디오 신호들을 생성하고, 상기 복수의 오디오 신호들을 하나의 오디오 신호로 합성하고, 하나로 합성된 상기 오디오 신호를 인코딩함으로써 합성 오디오 파일을 생성하고, 상기 합성 비디오 파일과 상기 합성 오디오 파일을 합성 AV 파일로 결합하여 상기 메모리에 저장하도록 하는 인스트럭션들을 저장할 수 있다.In various embodiments, an electronic device (eg, the electronic device 500 of FIG. 5 ) includes cameras; mike; a processor operatively connected to the cameras and the microphone; and a memory operatively coupled to the processor. When the memory (eg, memory 588 of FIG. 5 ) is executed, the processor (eg, processor 599 of FIG. 5 ): encodes a video signal received from a first one of the cameras in units of frames. To generate a first video file, to generate a first audio file by encoding the audio signal received from the microphone frame by frame, to combine the first video file and the first audio file into a first AV file, storing in a memory, and generating a second video file by encoding a video signal received from a second camera of the cameras frame by frame at the same time period during which the first video file and the first audio file are generated; A second audio file is generated by encoding the audio signal received from the microphone frame by frame, the second video file and the second audio file are combined into a second AV file, stored in the memory, and the first video file A first video timestamp indicating when the first frame was created in , a first audio timestamp indicating when the first frame was created in the first audio file, and a second video timestamp indicating when the first frame was created in the second video file. 2 A video timestamp and a second audio timestamp indicating a time when a first frame was generated in the second audio file are stored in the memory, and among video timestamps included in AV files stored in the memory at the same time period, or determining an unsynchronized video frame period based on a first timestamp indicating a relatively slow time point among video timestamps and audio timestamps included in the AV files, and determining an asynchronous video frame period, and Determines an unsynchronized audio frame period based on a second timestamp representing a relatively slow time point among video timestamps and audio timestamps included in AV files or among video timestamps and audio timestamps included in AV files, and determines an unsynchronized video frame interval for each AV file. A plurality of video signals are generated by decoding video frames not belonging to a section, the plurality of video signals are synthesized into one video signal, and a composite video file is generated by encoding the synthesized video signal, and the AV file A plurality of audio signals are generated by decoding audio frames that do not belong to the unsynchronized audio frame interval for each segment, the plurality of audio signals are synthesized into one audio signal, and the synthesized audio signal is encoded to generate a synthesized audio file. and storing instructions for generating and combining the composite video file and the composite audio file into a composite AV file and storing the composite AV file in the memory.

상기 인스트럭션들은 상기 프로세서가, 상기 제1 타임스탬프를 비디오 기준 시점으로 결정하고 상기 비디오 기준 시점보다 앞서 생성된 비디오 프레임을 상기 비동기화 비디오 프레임 구간 내 비디오 프레임으로 결정하고, 상기 제2 타임스탬프를 오디오 기준 시점으로 결정하고 상기 오디오 기준 시점보다 앞서 생성된 오디오 프레임을 상기 비동기화 오디오 프레임 구간 내 오디오 프레임으로 결정하도록 할 수 있다.The instructions include the processor determining the first timestamp as a video reference time, determining a video frame generated before the video reference time as a video frame within the asynchronous video frame period, and determining the second timestamp as an audio reference time. An audio frame generated prior to the audio reference point in time determined as the reference point in time may be determined as an audio frame within the asynchronous audio frame period.

상기 인스트럭션들은 상기 프로세서가, 상기 오디오 기준 시점보다 앞서 생성된 오디오 프레임과 상기 비디오 기준 시점보다 앞서 생성된 비디오 프레임을 식별함에 있어서, 첫번째 비디오 프레임부터 n번째 비디오 프레임까지 하나의 비디오 프레임이 디스플레이를 통해 이미지로 재생되는 비디오 프레임 시간을 누적하고, 상기 누적된 비디오 프레임 시간이 상기 비디오 타임스탬프들 중에서 가장 큰 값과 가장 작은 값 간의 차분치를 초과할 경우, 상기 첫번째 비디오 프레임부터 n-1번째 비디오 프레임까지 상기 비동기화 비디오 프레임 구간으로 결정하고, 첫번째 오디오 프레임부터 n번째 오디오 프레임까지 하나의 오디오 프레임이 스피커를 통해 소리로 재생되는 오디오 프레임 시간을 누적하고, 상기 누적된 오디오 프레임 시간이 상기 오디오 타임스탬프들 중에서 가장 큰 값과 가장 작은 값 간의 차분치를 초과할 경우, 상기 첫번째 오디오 프레임부터 n-1번째 오디오 프레임까지 상기 비동기화 오디오 프레임 구간으로 결정하도록 할 수 있다.In the instructions, when the processor identifies an audio frame generated prior to the audio reference viewpoint and a video frame generated prior to the video reference viewpoint, one video frame from the first video frame to the n-th video frame is displayed through the display. Accumulate video frame times reproduced as images, and if the accumulated video frame times exceed the difference between the largest value and the smallest value among the video timestamps, from the first video frame to the n-1th video frame. determined as the asynchronous video frame period, and accumulating audio frame times at which one audio frame is reproduced as sound through a speaker from the first audio frame to the n-th audio frame, and the accumulated audio frame times correspond to the audio timestamps When the difference value between the largest value and the smallest value is exceeded, the asynchronous audio frame section may be determined from the first audio frame to the n−1 th audio frame.

상기 인스트럭션들은 상기 프로세서가, 상기 합성 AV 파일을 재생하여 디스플레이와 스피커를 통해 이미지와 소리로 출력하되, 상기 비동기화 비디오 프레임 구간과 상기 비동기화 오디오 프레임 구간을 상기 합성 AV 파일보다 먼저 재생하도록 할 수 있다.The instructions may cause the processor to reproduce the synthesized AV file and output images and sounds through a display and a speaker, and reproduce the asynchronous video frame section and the asynchronous audio frame section before the synthesized AV file. there is.

상기 제1 카메라는 상기 전자 장치의 전면에 배치된 카메라이고 상기 제2 카메라는 상기 전자 장치의 후면에 배치된 카메라일 수 있다.The first camera may be a camera disposed on a front side of the electronic device, and the second camera may be a camera disposed on a rear side of the electronic device.

상기 인스트럭션들은 상기 프로세서가, 상기 제1 오디오 타임스탬프와 상기 제1 비디오 타임스탬프를 상기 제1 AV 파일의 예비 영역에 저장하고, 상기 제2 오디오 타임스탬프와 상기 제2 비디오 타임스탬프를 상기 제2 AV 파일의 예비 영역에 저장하도록 할 수 있다.The instructions cause the processor to store the first audio timestamp and the first video timestamp in a reserved area of the first AV file, and to store the second audio timestamp and the second video timestamp in the second AV file. It can be stored in the reserved area of the AV file.

상기 메모리가 AV 파일을 포함하는 컨테이너를 포함하되, 상기 컨테이너가 MP4일 수 있다.The memory may include a container including an AV file, and the container may be an MP4 file.

상기 인스트럭션들은 상기 프로세서가, 상기 MP4 컨테이너와 관련된 비디오 코덱을 이용하여 비디오 신호를 압축함으로써 비디오 파일을 생성하고, 상기 MP4 컨테이너와 관련된 오디오 코덱을 이용하여 오디오 신호를 압축함으로써 오디오 파일을 생성하도록 할 수 있다.The instructions may cause the processor to generate a video file by compressing a video signal using a video codec associated with the MP4 container, and to generate an audio file by compressing an audio signal using an audio codec associated with the MP4 container. there is.

상기 인스트럭션들은 상기 프로세서가, 상기 프로세서에서 발생하는 시스템 클럭에 기반하여 오디오 타임스탬프와 비디오 타임스탬프를 생성하도록 할 수 있다.The instructions may cause the processor to generate an audio timestamp and a video timestamp based on a system clock generated by the processor.

상기 카메라들에 일대일 대응되고 대응되는 카메라에서 촬영된 비디오를 기록하도록 구성된 복수의 레코더들이 상기 메모리에 저장되되, 상기 인스트럭션들은 상기 프로세서가, 상기 전자 장치의 입력 장치로부터 수신된 녹화 명령에 반응하여, 상기 레코더들을 동시에 실행함으로써 복수의 AV 파일들을 동 시간대에 생성하도록 할 수 있다.A plurality of recorders configured to correspond to the cameras one-to-one and record video captured by the corresponding cameras are stored in the memory, and the instructions are configured so that the processor responds to a recording command received from an input device of the electronic device, By simultaneously executing the recorders, a plurality of AV files can be created at the same time.

다양한 실시예에서 전자 장치(예: 도 5의 전자 장치(500))를 동작하는 방법은, 상기 전자 장치에 구비된 카메라들 중 제1카메라로부터 수신된 비디오 신호를 프레임 단위로 인코딩함으로써 제1 비디오 파일을 생성하고, 상기 마이크로부터 수신된 오디오 신호를 프레임 단위로 인코딩함으로써 제1 오디오 파일을 생성하고, 상기 제1 비디오 파일과 상기 제1 오디오 파일을 제1 AV 파일로 결합하여 상기 전자 장치의 메모리에 저장하는 동작(예: 도 10의 동작 1010); 상기 제1 비디오 파일과 상기 제1 오디오 파일을 생성하는 동일한 시간대에, 상기 카메라들 중 제2카메라로부터 수신된 비디오 신호를 프레임 단위로 인코딩함으로써 제2 비디오 파일을 생성하고, 상기 마이크로부터 수신된 오디오 신호를 프레임 단위로 인코딩함으로써 제2 오디오 파일을 생성하고, 상기 제2 비디오 파일과 상기 제2 오디오 파일을 제2 AV 파일로 결합하여 상기 메모리에 저장하는 동작(예: 도 10의 동작 1010); 상기 제1 비디오 파일에서 첫번째 프레임이 생성된 시점을 나타내는 제1 비디오 타임스탬프, 상기 제1 오디오 파일에서 첫번째 프레임이 생성된 시점을 나타내는 제1 오디오 타임스탬프, 상기 제2 비디오 파일에서 첫번째 프레임이 생성된 시점을 나타내는 제2 비디오 타임스탬프, 및 상기 제2 오디오 파일에서 첫번째 프레임이 생성된 시점을 나타내는 제2 오디오 타임스탬프를 상기 메모리에 저장하는 동작(예: 도 5의 동작 1020); 상기 메모리에 동 시간대에 저장된 AV 파일들에 포함된 비디오 타임스탬프들 중에서 또는 상기 AV 파일들에 포함된 비디오 타임스탬프들 및 오디오 타임스탬프들 중에서 상대적으로 느린 시점을 나타내는 제1 타임스탬프에 기초하여 비동기화 비디오 프레임 구간을 결정하고, 상기 AV 파일들에 포함된 오디오 타임스탬프들 중에서 또는 상기 AV 파일들에 포함된 비디오 타임스탬프들 및 오디오 타임스탬프들 중에서 상대적으로 느린 시점을 나타내는 제2 타임스탬프에 기초하여 비동기화 오디오 프레임 구간을 결정하는 동작(예: 도 5의 동작 1030); 상기 AV 파일들마다 비동기화 비디오 프레임 구간에 속하지 않는 비디오 프레임들을 디코딩함으로써 복수의 비디오 신호들을 생성하고, 상기 복수의 비디오 신호들을 하나의 비디오 신호로 합성하고, 하나로 합성된 상기 비디오 신호를 인코딩함으로써 합성 비디오 파일을 생성하는 동작(예: 도 5의 동작 1040 및 1050); 상기 AV 파일들마다 비동기화 오디오 프레임 구간에 속하지 않은 오디오 프레임들을 디코딩함으로써 복수의 오디오 신호들을 생성하고, 상기 복수의 오디오 신호들을 하나의 오디오 신호로 합성하고, 하나로 합성된 상기 오디오 신호를 인코딩함으로써 합성 오디오 파일을 생성하는 동작(예: 도 5의 동작 1040 및 1050); 및 상기 합성 비디오 파일과 상기 합성 오디오 파일을 합성 AV 파일로 결합하여 상기 메모리에 저장하는 동작(예: 도 5의 동작 1050)을 포함할 수 있다.In various embodiments, a method of operating an electronic device (eg, the electronic device 500 of FIG. 5 ) encodes a video signal received from a first camera among cameras included in the electronic device in units of frames to generate a first video signal. A file is generated, an audio signal received from the microphone is encoded frame by frame to generate a first audio file, the first video file and the first audio file are combined into a first AV file, and the memory of the electronic device is generated. operation of storing in (eg, operation 1010 of FIG. 10); A second video file is generated by encoding a video signal received from a second camera among the cameras frame by frame during the same time period in which the first video file and the first audio file are generated, and the audio received from the microphone is generated. generating a second audio file by encoding a signal frame by frame, combining the second video file and the second audio file into a second AV file, and storing the second audio file in the memory (eg, operation 1010 of FIG. 10 ); A first video timestamp indicating when the first frame was created in the first video file, a first audio timestamp indicating when the first frame was created in the first audio file, and the first frame in the second video file storing in the memory a second video timestamp indicating when a first frame was created and a second audio timestamp indicating when a first frame was created in the second audio file (eg, operation 1020 of FIG. 5 ); Based on a first timestamp indicating a relatively slow time point among video timestamps included in AV files stored in the memory at the same time, or among video timestamps and audio timestamps included in the AV files, the ratio Determines a synchronization video frame interval, based on a second timestamp indicating a relatively slow time point among audio timestamps included in the AV files or among video timestamps and audio timestamps included in the AV files to determine an unsynchronized audio frame section (eg, operation 1030 of FIG. 5); For each of the AV files, a plurality of video signals are generated by decoding video frames that do not belong to an asynchronous video frame period, the plurality of video signals are synthesized into one video signal, and the video signal synthesized into one is synthesized by encoding. creating a video file (eg, operations 1040 and 1050 of FIG. 5 ); For each of the AV files, a plurality of audio signals are generated by decoding audio frames that do not belong to the asynchronous audio frame period, the plurality of audio signals are synthesized into one audio signal, and the synthesized audio signal is synthesized by encoding. creating an audio file (eg, operations 1040 and 1050 of FIG. 5 ); and combining the composite video file and the composite audio file into a composite AV file and storing the composite AV file in the memory (eg, operation 1050 of FIG. 5 ).

상기 비동기화 비디오 프레임 구간과 상기 비동기화 오디오 프레임 구간을 결정하는 동작은, 상기 제1 타임스탬프를 비디오 기준 시점으로 결정하고 상기 비디오 기준 시점보다 앞서 생성된 비디오 프레임을 상기 비동기화 비디오 프레임 구간 내 비디오 프레임으로 결정하는 동작과, 상기 제2 타임스탬프를 오디오 기준 시점으로 결정하고 상기 오디오 기준 시점보다 앞서 생성된 오디오 프레임을 상기 비동기화 오디오 프레임 구간 내 오디오 프레임으로 결정하는 동작을 포함할 수 있다.The determining of the unsynchronized video frame period and the unsynchronized audio frame period may include determining the first timestamp as a video reference time and using a video frame generated before the video reference time as a video within the asynchronous video frame period. and determining the second timestamp as an audio reference time point and determining an audio frame generated prior to the audio reference time point as an audio frame within the asynchronous audio frame section.

상기 비동기화 비디오 프레임 구간을 결정하는 동작은 첫번째 비디오 프레임부터 n번째 비디오 프레임까지 하나의 비디오 프레임이 디스플레이를 통해 이미지로 재생되는 비디오 프레임 시간을 누적하는 동작과, 상기 누적된 비디오 프레임 시간이 상기 비디오 타임스탬프들 중에서 가장 큰 값과 가장 작은 값 간의 차분치를 초과할 경우, 상기 첫번째 비디오 프레임부터 n-1번째 비디오 프레임까지 상기 비동기화 비디오 프레임 구간으로 결정하는 동작을 포함할 수 있다. 상기 비동기화 오디오 프레임 구간을 결정하는 동작은 첫번째 오디오 프레임부터 n번째 오디오 프레임까지 하나의 오디오 프레임이 스피커를 통해 소리로 재생되는 오디오 프레임 시간을 누적하는 동작과, 상기 누적된 오디오 프레임 시간이 상기 오디오 타임스탬프들 중에서 가장 큰 값과 가장 작은 값 간의 차분치를 초과할 경우, 상기 첫번째 오디오 프레임부터 n-1번째 오디오 프레임까지 상기 비동기화 오디오 프레임 구간으로 결정하는 동작을 포함할 수 있다.The determining of the unsynchronized video frame interval may include accumulating video frame times at which one video frame is reproduced as an image through a display from a first video frame to an n-th video frame, and the accumulated video frame time corresponds to the video frame time. and determining the asynchronous video frame interval from the first video frame to the n−1 th video frame when the difference between the largest value and the smallest value among the timestamps is exceeded. The determining of the unsynchronized audio frame section may include accumulating audio frame times in which one audio frame is reproduced as sound through a speaker from a first audio frame to an n-th audio frame, and the accumulated audio frame time is the audio frame time. and determining the asynchronous audio frame period from the first audio frame to the n−1 th audio frame when the difference between the largest value and the smallest value among the timestamps is exceeded.

상기 방법은 상기 합성 AV 파일을 재생하여 디스플레이와 스피커를 통해 이미지와 소리로 출력하되, 상기 비동기화 비디오 프레임 구간과 상기 비동기화 오디오 프레임 구간을 상기 합성 AV 파일보다 먼저 재생하는 동작(예: 도 5의 동작 1060)을 더 포함할 수 있다.The method reproduces the synthesized AV file and outputs images and sounds through a display and a speaker, and reproduces the asynchronous video frame section and the asynchronous audio frame section before the synthesized AV file (eg, FIG. 5 ). Operation 1060 of) may be further included.

상기 제1 AV 파일과 상기 제2 AV 파일을 저장하는 동작은, 상기 제1 오디오 타임스탬프와 상기 제1 비디오 타임스탬프를 상기 제1 AV 파일의 예비 영역에 저장하는 동작과, 상기 제2 오디오 타임스탬프와 상기 제2 비디오 타임스탬프를 상기 제2 AV 파일의 예비 영역에 저장하는 동작을 포함할 수 있다.The storing of the first AV file and the second AV file may include: storing the first audio timestamp and the first video timestamp in a spare area of the first AV file; and storing the stamp and the timestamp of the second video in a reserved area of the second AV file.

본 명세서와 도면에 개시된 본 발명의 실시예들은 본 발명의 실시예에 따른 기술 내용을 쉽게 설명하고 본 발명의 실시예의 이해를 돕기 위해 특정 예를 제시한 것일 뿐이며, 본 발명의 실시예의 범위를 한정하고자 하는 것은 아니다. 따라서 본 발명의 다양한 실시예의 범위는 여기에 개시된 실시예들 이외에도 본 발명의 다양한 실시예의 기술적 사상을 바탕으로 도출되는 모든 변경 또는 변형된 형태가 본 발명의 다양한 실시예의 범위에 포함되는 것으로 해석되어야 한다.The embodiments of the present invention disclosed in the present specification and drawings are only presented as specific examples to easily explain the technical content according to the embodiments of the present invention and help understanding of the embodiments of the present invention, and limit the scope of the embodiments of the present invention. It's not what I want to do. Therefore, the scope of various embodiments of the present invention should be construed as including all changes or modified forms derived based on the technical spirit of various embodiments of the present invention in addition to the embodiments disclosed herein are included in the scope of various embodiments of the present invention. .

Claims

In electronic devices,
cameras;
mike;
a processor operatively coupled to the cameras and the microphone; and
a memory operatively coupled to the processor;
The memory, when executed, causes the processor to:
A first video file is generated by encoding a video signal received from a first camera of the cameras in units of frames, and a first audio file is generated by encoding an audio signal received from the microphone in units of frames. combining a video file and the first audio file into a first AV file and storing the first AV file in the memory;
A second video file is generated by encoding a video signal received from a second camera among the cameras frame by frame during the same time period in which the first video file and the first audio file are generated, and the audio received from the microphone is generated. generating a second audio file by encoding a signal frame by frame, combining the second video file and the second audio file into a second AV file, and storing the second audio file in the memory;
A first video timestamp indicating when the first frame was created in the first video file, a first audio timestamp indicating when the first frame was created in the first audio file, and the first frame in the second video file storing in the memory a second video timestamp indicating a time when a first frame was created and a second audio timestamp indicating a time when a first frame was generated in the second audio file;
Based on a first timestamp indicating a relatively slow time point among video timestamps included in AV files stored in the memory at the same time, or among video timestamps and audio timestamps included in the AV files, the ratio determining a synchronization video frame interval;
Determining an unsynchronized audio frame interval based on a second timestamp indicating a relatively slow time point among audio timestamps included in the AV files or among video timestamps and audio timestamps included in the AV files; ,
For each of the AV files, a plurality of video signals are generated by decoding video frames that do not belong to an asynchronous video frame period, the plurality of video signals are synthesized into one video signal, and the video signal synthesized into one is synthesized by encoding. create a video file;
For each of the AV files, a plurality of audio signals are generated by decoding audio frames that do not belong to the asynchronous audio frame period, the plurality of audio signals are synthesized into one audio signal, and the synthesized audio signal is synthesized by encoding. create an audio file;
An electronic device storing instructions for combining the composite video file and the composite audio file into a composite AV file and storing the composite AV file in the memory.

The method of claim 1, wherein the instructions are the processor,
determining the first timestamp as a video reference viewpoint and determining a video frame generated before the video reference viewpoint as a video frame within the asynchronous video frame interval;
The electronic device determines the second timestamp as an audio reference time and determines an audio frame generated before the audio reference time as an audio frame within the asynchronous audio frame section.

3. The method of claim 2, wherein the instructions are configured by the processor to identify an audio frame generated prior to the audio reference viewpoint and a video frame generated prior to the video reference viewpoint,
From the first video frame to the n-th video frame, video frame times during which one video frame is reproduced as an image through a display are accumulated, and the accumulated video frame time is the difference between the largest value and the smallest value among the video timestamps. If it exceeds the value, determining the asynchronous video frame interval from the first video frame to the n-1 th video frame;
One audio frame from the first audio frame to the n-th audio frame accumulates audio frame times reproduced as sound through a speaker, and the accumulated audio frame time is the difference between the largest value and the smallest value among the audio timestamps. If the value exceeds the value, the electronic device determines the asynchronous audio frame period from the first audio frame to the n-1 th audio frame.

The method of claim 1, wherein the instructions are the processor,
An electronic device that reproduces the synthesized AV file and outputs images and sounds through a display and a speaker, wherein the asynchronous video frame section and the asynchronous audio frame section are played before the synthesized AV file.

According to claim 1,
The first camera is a camera disposed on the front of the electronic device,
The second camera is a camera disposed on a rear surface of the electronic device.

The method of claim 1, wherein the instructions are the processor,
store the first audio timestamp and the first video timestamp in a reserved area of the first AV file;
and storing the second audio timestamp and the second video timestamp in a reserved area of the second AV file.

The method of claim 1, wherein the memory comprises a container including an AV file,
An electronic device in which the container is MP4.

The method of claim 7, wherein the instructions are the processor,
generating a video file by compressing a video signal using a video codec associated with the MP4 container;
An electronic device for generating an audio file by compressing an audio signal using an audio codec associated with the MP4 container.

The method of claim 1, wherein the instructions are the processor,
An electronic device that generates an audio timestamp and a video timestamp based on a system clock generated by the processor.

The method of claim 1 , wherein a plurality of recorders configured to correspond to the cameras one-to-one and record video captured by the corresponding cameras are stored in the memory, wherein the instructions are configured to:
An electronic device that generates a plurality of AV files at the same time by simultaneously executing the recorders in response to a recording command received from an input device of the electronic device.

A method for operating an electronic device,
A first video file is generated by encoding a video signal received from a first camera among cameras included in the electronic device in units of frames, and a first audio file is generated by encoding an audio signal received from the microphone in units of frames. combining the first video file and the first audio file into a first AV file and storing the combined first AV file in a memory of the electronic device;
A second video file is generated by encoding a video signal received from a second camera among the cameras frame by frame during the same time period in which the first video file and the first audio file are generated, and the audio received from the microphone is generated. generating a second audio file by encoding a signal frame by frame, combining the second video file and the second audio file into a second AV file, and storing the second audio file in the memory;
A first video timestamp indicating when the first frame was created in the first video file, a first audio timestamp indicating when the first frame was created in the first audio file, and the first frame in the second video file storing, in the memory, a second video timestamp indicating when a first frame was created and a second audio timestamp indicating when a first frame was generated in the second audio file;
Based on a first timestamp indicating a relatively slow time point among video timestamps included in AV files stored in the memory at the same time, or among video timestamps and audio timestamps included in the AV files, the ratio Determines a synchronization video frame interval, based on a second timestamp indicating a relatively slow time point among audio timestamps included in the AV files or among video timestamps and audio timestamps included in the AV files determining an unsynchronized audio frame section by performing the step;
For each of the AV files, a plurality of video signals are generated by decoding video frames that do not belong to an asynchronous video frame period, the plurality of video signals are synthesized into one video signal, and the video signal synthesized into one is synthesized by encoding. creating a video file;
For each of the AV files, a plurality of audio signals are generated by decoding audio frames that do not belong to the asynchronous audio frame period, the plurality of audio signals are synthesized into one audio signal, and the synthesized audio signal is synthesized by encoding. creating an audio file; and
and combining the composite video file and the composite audio file into a composite AV file and storing the composite AV file in the memory.

The method of claim 11, wherein determining the asynchronous video frame period and the asynchronous audio frame period comprises:
determining the first timestamp as a video reference viewpoint and determining a video frame generated before the video reference viewpoint as a video frame within the asynchronous video frame period;
and determining the second timestamp as an audio reference time point and determining an audio frame generated prior to the audio reference time point as an audio frame within the asynchronous audio frame period.

According to claim 12,
The operation of determining the asynchronous video frame period
accumulating video frame times during which one video frame is reproduced as an image through a display from the first video frame to the nth video frame;
determining the asynchronous video frame period from the first video frame to the n-1 th video frame when the accumulated video frame time exceeds the difference between the largest value and the smallest value among the video timestamps; include,
The operation of determining the unsynchronized audio frame period
accumulating audio frame times during which one audio frame is reproduced as sound through a speaker from the first audio frame to the nth audio frame;
determining the asynchronous audio frame period from the first audio frame to the n−1 th audio frame when the accumulated audio frame time exceeds a difference between the largest value and the smallest value among the audio timestamps; How to include.

According to claim 11,
The method of claim 1 , further comprising reproducing the synthesized AV file and outputting images and sounds through a display and a speaker, and reproducing the asynchronous video frame section and the asynchronous audio frame section prior to the synthesized AV file.

12. The method of claim 11, wherein the storing of the first AV file and the second AV file comprises:
storing the first audio timestamp and the first video timestamp in a reserved area of the first AV file;
and storing the second audio timestamp and the second video timestamp in a reserved area of the second AV file.