EP4004918A1

EP4004918A1 - Media client for recording and playing back interpretation

Info

Publication number: EP4004918A1
Application number: EP20739675.5A
Authority: EP
Inventors: Xavier De Donder; Sven Callewaert
Original assignee: TELEVIC EDUCATION NV
Current assignee: TELEVIC EDUCATION NV
Priority date: 2019-07-29
Filing date: 2020-07-10
Publication date: 2022-06-01
Also published as: WO2021018555A1; AU2020321316A1

Abstract

A media client (1) for recording interpreted segments (201;202) into a second language of a source segment (101; 102) of a source media (10), configured to: - play back a first source segment (101); - at a first point in time (91), pause said playback of said first source segment (101) and record a first interpreted segment (201) of said first source segment (101); - at a second point in time (92), define an end of first interpreted segment (201) and play back a second source segment (102) of said source media (10) starting from said first point in time (91); and - store data (30) on said first interpreted segment (201) comprising said first point in time (91) and second point in time (92) or a duration (7) of said first interpreted segment (201) between said first and second points in time.

Description

MEDIA CLIENT FOR RECORDING AND PLAYING BACK INTERPRETATION

Technical Field

[01] The present invention generally relates, amongst others, to a media client and a method for recording and playing back interpretation. More particularly, it relates to a media client and a method for recording a sequence of interpreted segments, wherein each interpreted segment comprises an interpretation into a second language of a source segment of a source media comprising a first language.

Background

[02] Interpreting is a translational activity in which one produces a translation based on an exposure to an expression in a source language, wherein the translation and/or the source language can be for example spoken or can be for example sign language. The most common two modes of interpreting are simultaneous interpreting, which is done at the time of the exposure to the source language, and consecutive interpreting, which is done at breaks to this exposure. Both for simultaneous and consecutive interpretation, the interpreter plays a key role in the exchange between partners whose language and culture are different.

[03] Simultaneous interpreting is the oral translation of a speech as it unfolds. In other words, during simultaneous interpretation, the interpreter does not interrupt the speech of the talker. The interpreter listens through to a speech which he or she translates in real time while listening. The listeners can hear the interpreted speech, wherein the interpreted speech is overlaid or not with the speech of the talker. This type of interpretation is often used as its benefits in terms of time savings are indisputable. For example, simultaneous interpreting is tailor-fit for large seminars and conferences. The great advantage of simultaneous interpretation is indeed that efficiency is high as the interpretation happens in a few seconds or minutes, and that it does not affect or interrupt the talker's thinking. [04] Consecutive interpreting, on the other hand, allows the oral transposition of a speech into another language just after the speech has finished. The principle is as follows: as the speaker talks, the interpreter listens, for example takes notes, and then reproduces the speech in the target language. The main difference between simultaneous interpreting and consecutive interpreting lies in how the comprehension phase is separated from the interpretation playback phase. Indeed, the simultaneous interpreter has little time to reproduce and properly structure the speech in the target language. With consecutive interpretation, the interpreter has more time. This means that he or she has the possibility to move away from the original speech, avoid literal translation more easily and even summarise as well as cut out any undesirable interferences. Consecutive interpreting is therefore often preferred to simultaneous interpreting in the context of human resources meetings for multilingual staff, press and personal interviews, business meeting, court depositions, medical interviews and consultations, tours and travels, etc.

[05] Interpretation is a high-level language translation activity with the added difficulty that it must be performed according to strict time restrictions. Interpreting requires the interpreter to listen accurately to distinguish source language speech, to rapidly plan, express, monitor and correctly predict, understand, memorize and convert source language information on a particular thematic knowledge to a target language different from the source language. Such skills can be trained by practicing live interpretation, both for live interpreting as for consecutive interpreting. On-the-job trainings for interpreters also exist to allow them to gather knowledge in specific technical domains requiring the use of precise detailed technical terms, such as for example healthcare, medicine, engineering, architecture and construction, finance, literature, etc. For instance, for medical interpreting, the interpreter should have prior experience in a particular field. They should know and understand the specific terminology used in the specific branch of medicine and translate the words into the language that the listener could easily understand.

[06] Most training capabilities for interpreters rely on live academic courses and conference sessions which require both interpreters and teachers to be physically present in an interpreting lab at a specific time such that the teachers can evaluate one or more trainee interpreters when the latter are practicing. This forms a strong limitation in time and place during preparation, practice or evaluation, both for the interpreters and their teachers.

Summary

[07] It is thus an object of embodiments of the present disclosure to propose a computer-implemented method and a media client which do not show the inherent shortcomings of the prior art. More specifically, it is an object of embodiments of the present disclosure to propose a method and a media client to combine learning resources with professional interpreting training enabling to capture performances of interpreters during classes, live conferences and examinations.

[08] The scope of protection sought for various embodiments of the invention is set out by the independent claims.

[09] The embodiments and features described in this specification that do not fall within the scope of the independent claims, if any, are to be interpreted as examples useful for understanding various embodiments of the invention.

[10] There is a need for preparing future interpreters for the real world of interpretation, and for ensuring interpreters are effectively trained not just in interpreting but also in the best use of the interpreter booths. There is an additional need for, as a teacher, giving interpreters sufficient freedom to perform an interpretation task, while maintaining control over what they do.

[11] Amongst others, it is an object of embodiments of the present disclosure to practice interpreting skills and/or to evaluate interpreters and/or student interpreters during interpreting exercises. It is a further object of embodiments of the present disclosure to allow teachers and interpreters and student interpreters to enjoy the benefits of professional and intuitive interpreting equipment outside of an interpreting lab, thereby taking away the limitations of time and place during preparation, practice or evaluation. [12] This object is achieved, according to a first example aspect of the present disclosure, by a media client for recording a sequence of interpreted segments, wherein each interpreted segment comprises a consecutive interpretation into a second language of a source segment of a source media comprising a first language, wherein the media client is configured to play back a first source segment of the source media comprising the first language; and wherein the media client is further configured to:

- at a first point in time, pause the playback of the first source segment and record a first interpreted segment comprising a consecutive interpretation into the second language of the first source segment;

- at a second point in time, define an end of the first interpreted segment and play back a second source segment of the source media starting from the first point in time; and

- store data on the first interpreted segment, wherein the data comprises the first point in time and of the second point in time or a duration of the first interpreted segment determined between the second point in time and the first point in time.

[13] The media client according to the present disclosure allows to prepare interpreting exercises and send them to interpreters, to record performances of interpreters for consecutive interpreting exercises, and to evaluate interpretations and add feedback for the interpreters in the recording. Teachers or peers can evaluate these recorded interpretations rapidly and easily. The media client according to the present disclosure evaluates not only the spoken interpretation, but also the way the interpreter presents the message. Indeed, by capturing not only for example the interpreter's microphone, but also for example the webcam of the interpreter, the performance can be properly evaluated by all means. By allowing the interpretation and the recording of a consecutive interpretation into a second language of a source segment of a source media comprising a first language, the media client according to the present disclosure allows a more emotional and fluent delivery of the interpretation. The consecutive interpreter has more time to prepare for the nuances of the language and to choose the words carefully, thereby improving the quality of the interpretation. [14] This way, the media client according to the present disclosure makes it easy for interpreters to check the quality of their interpretation as well as the conformity with the source media. An interpreter and/or a teacher can quickly jump to points in time of interest in the source media. This is done in a user-friendly and intuitive manner. The media client according to the present disclosure allows recording a sequence of interpreted segments of a source media. The first point in time and the second point in time are determined with respect to the source media. The steps of this method are repeated for all the interpreted segments, thereby recording a sequence of interpreted segments. Additionally, a teacher may simultaneously supervise, compare and/or evaluate the work of a plurality of interpreters and/or student interpreters which all use the media client according to the present invention.

[15] In the context of the present description, a media client is for example a media player configured to record and play media files. For example, a media client in the context of the present disclosure is a video player. Alternatively, a media client is for example any type of media player software or any type of application software for playing multimedia computer files like audio and video files. For example, a media client is for example a media player comprised in a web browser. Alternatively, a media client is for example a media player comprised in a web application, or a desktop application. Examples of media clients that are suitable as a media client are desktop and laptop computers, smartphones, tablets, setup boxes and TVs. A media client may also refer to a media player application, or application, running on any of such devices. The media client according to the present disclosure therefore allows teachers and interpreters and student interpreters to enjoy the benefits of practicing interpretation outside of the interpreting lab, taking away all limitations of time and place during preparation, practice or evaluation. It becomes possible to train professional interpreting skills through the intuitive training method according to the present disclosure, and to capture performances of interpreters during classes, live conferences and examinations. Additionally, the media client according to the present disclosure allows interpreters to train using the same equipment they will be using in their professional environment. This way, the media client according to the present disclosure maximizes the learning effect and ease of use for the interpreters, the teachers and/or the students. [16] In the context of the present disclosure, source media being played is for example audio or audio in combination with ordered still pictures or frames that are decoded or decompressed and played one after the other within for example a media client. In other words, source media is for example audio, such as a source track, or audio combined with video. In the context of the present disclosure, the terms “fragment”,“slice”,“segment”,“part” are used throughout the text below as synonyms. The recording of interpreted “fragment”, interpreted “slice”, interpreted “segment”, “part” correspond to the recording the media client according to the present disclosure of the audio and/or video interpretation of source segments of the source media by an interpreter using the media client according to the present disclosure. In other words, interpreted segment is for example audio, such as a track, or audio combined with video, corresponding to the interpretation of a source segment of the source media. In the context of the present disclosure, the term“interpret” relates to the translation of what is being said or signed into another language. In the context of the present disclosure, the term“data” refers to for example“metadata”, for example one or more metadata files. In the context of the present disclosure , the term“duration” refers to the length in time of media. In the context of the present disclosure, the expressions comprising the term“recording” preferably refer to the interpretation of a source media by a user of the media client, i.e. preferably refer to interpreted segments. In the context of the present disclosure, the resulting sequence of interpreted segments consists of one audio and/or video file. Information on where individual interpreted segments are exactly positioned in the sequence of interpreted segments is for example saved in the data. This information for example consists of the start time in relation to the source media and the duration of each interpreted segments of the sequence of interpreted segments. The first point in time and the second point in time are defined with respect to a duration of the source media. Preferably, the media client determines and stores a timestamp of the first point in time with respect to the source media and the media client further determines and stores a timestamp of the second point in time with respect to the source media. Alternatively, the media client determines and stores a timestamp of the first point in time with respect to the source media and further determines and stores a duration of an interpreted segment determined between the second point in time and the first point in time. All the recorded interpreted segments, together with the data and the source media can for example subsequently be saved into a project file, such as for example into an IQF project file. [17] Source media played by the media client is for example audio or audio in combination with ordered still pictures or frames that are decoded or decompressed and played one after the other within the media client. In other words, source media is for example audio, such as a source track, or audio combined with video. To this respect, the media client may be any device capable of receiving a digital representation of such media over a communication network and capable of decoding the representation into a sequence of frames that can be displayed on a screen to a user.

[18] Interpreter training requires practicing live interpretation of a source fragment, this fragment being audio only or audio combined with video. A source fragment of source media for example comprises audio or audio combined with video, wherein the audio comprises a source language which is to be interpreted, for example during an interpretation exercise, by for example an interpreter and/or a teacher and/or a student. This live interpretation of a source segment can be:

- consecutive when the interpreter interprets sentence by sentence, or source segment by source segment, the audio of the source media, or part of sentence by part of sentence of the source media;

- simultaneous, when the source media plays uninterrupted.

[19] An interpreter typically starts by listening to a part or source segment of the source media. To subsequently record the interpretation, the student can click on for example the‘Start Recording’ button displayed on the media client. The playback of the source track is then paused and the video of the source media is frozen, and the microphone, i.e. audio and optionally the webcam, i.e. video, start being recorded by the media client. The interpreter can visually see a waveform being inserted into the recording track on the media client, while a flat line indicating silence is being inserted to the waveform representation of the source media. When the interpretation of the first source segment is finished, the interpreter can continue the exercise by clicking on for example the‘Continue playback’ button displayed on the media client. The recording of the first interpreted segment then pauses and the playback of source media resumes where it left off at the start of the previous recording, thereby defining an end to the first interpreted segment. Alternatively, it is not required to pause the recording of the first interpreted segment and it is sufficient that a timestamp of the end of the first interpreted segment is determined, thereby defining an end of the first interpreted segment. In this embodiment, the sequence of interpreted segments is recorded continuously and the timestamps of the first point in time and of the second point in time are used to determine the time boundaries of the first interpreted segment. Alternatively, the sequence of interpreted segments is recorded continuously and the timestamp of the first point in time and a duration of the first interpreted segment between the timestamp of the second point in time and the first point in time are determined. This embodiment has the technical advantage that reactions and/or emotions of the interpreter are recorded while the interpreter is listening to and/or looking at the video of the source media. Data such as for example metadata containing the start time of the recording of the first interpreted segment in relation to the source media and its duration is generated. When the interpreter wants to translate the next source segment, the interpreter can click on for example the ‘Continue recording’ button displayed on the media client. This pauses the playback of the source media and resumes recording of the sequence of interpreted segments. This sequence of actions continues until the exercise is finished, i.e. for example until the whole source media has been interpreted.

[20] In for example a review mode of the media client, a teacher or the interpreter can play back the source media alongside the sequence of interpreted segments. For example, when an IQF project is opened for review, a waveform visualization of the source media is generated based on the binary audio information of the source media file. Depending on the selected recorded sequence of interpreted segments, this waveform is then adjusted based on the data of that particular recording. In consecutive mode, a flat line, indicating silence, is added on the waveform of the source media where a recording interpreted segment is located according to its start time and duration. A waveform visualization of the recorded sequence of interpreted segments is based on the binary audio information of its corresponding media file. This waveform is then adjusted based on the data and the interpreting mode. For example, in consecutive mode, flat lines, indicating silence, are inserted between the locations of the recording interpreted segments according to their start time and duration. Additionally, a flat line is added at the start of the recorded waveform representing a silence until the start time of the first interpreted segment. The total duration of the sequence of interpreted segments is also recalculated based on the source media and the data. Playback of the exercise can be initiated by for example clicking on a‘Play’ button displayed by the media client. In consecutive mode, this will automatically alternate playback and pause of source media and interpreted segments. Depending on the position of a time mark and the data, a particular source segment of the source media is played while the interpreted segment is being paused and vice versa. When a position of a time mark coincides with an interpreted segment, defined by its start time and duration, the interpreted segment is played at a position relative to the overall length while the source media is paused. When the position of the time mark does not match one of the interpreted segments, the sequence of interpreted segments is paused, and the source media is played at a position relative to the overall length of the exercise. The media client displays the source media and the video of the recording of the sequence of interpreted segments next to each other. When the playback of the video of the source media pauses to start the recording of an interpreted segment, the video of the recording of the interpreted segment starts; and when the recording of the sequence of the interpreted segments is paused to interpret a further source segment of the source media, the video of the recording of the interpreted segment pauses and the playback of the video of the source media starts again.

[21] The project file, with for example extension IQF, can optionally be an encrypted and compressed into for example a container file, and comprises:

- a source media, for example a source media file;

- one or more recorded files, for example one or more interpreted segments; and

- data, for example a metadata file.

A metadata file for example comprises one or more of the following:

- the interpreting mode: simultaneous or consecutive;

- the interpreting media type: audio or video;

- name of the project;

- name of the source track;

- a list of recorded versions with, for each version, a list of recorded interpreted segments;

- a list of marks.

An interpreted segment has a start time and a duration with respect to the start time being relative to the source media. Alternative ways of describing an interpreted segment can be to save the start and stop time relative to the source media, to save the interval between the interpreted segments or to save times relative to the total duration of the exercise. In simultaneous mode there is only one recording of a sequence of interpreted segments for each recorded version.

[22] According to example embodiments, the media client is further configured to:

- play back the first source segment;

- at the first point in time, play back the first interpreted segment for the duration of the first interpreted segment; and

- at the second point in time, play back a second source segment of the source media starting from the first point in time referenced with respect to the source media.

[23] The interpreter can start the recording at any time relative to the source track. When the recording of the interpretation of the source fragment / track is finished, the interpreter can play back the recorded interpretation together with the source track. The media client according to the present disclosure allows recording a sequence of interpreted segments of a source media. The first point in time and the second point in time are determined with respect to the source media. The steps of this method are repeated for all the interpreted segments, thereby recording a sequence of interpreted segments. The consecutive interpretation can be played back by the media client such that the playback plays source segments consecutively followed by the recording of their respective interpreted segments. In other words, the video of the source media is for example paused or frozen at the first point in time, thereby allowing the playback of the recording of the first interpreted segment for the duration of the first interpreted segment, and at the second point in time, the recording of the first interpreted segment is paused or frozen such that a second source segment of the source media starting from the first point in time can be played back. This ensures either a source segment or an interpreted segment is always played back and shows the user of the media client that the video is still relevant even though not actively being played. Alternatively, the audio recording of the first interpreted segment is paused, but the video is still being recorded. This ensures in playback mode a continuous video playback of the interpreted segment, but audio of the interpreted segment is only active when the source is paused. The media client allows the export of for example an IQF file or a zip file comprising for example an MP3 or an MP4 file comprising only the recording of the sequence of interpreted segments, such as for example the audio and/or the video track of the source media and of the sequence of interpreted segments consecutively or simultaneously playing, or the audio and/or the video track of only the sequence of interpreted segments, and/or further comprising the data. Alternatively, the recorded video and/or audio of the sequence of interpreted segments comprises several files. In this configuration, additional data is stored in the project file in the media client, such as for example references to the different recorded files of the same exercise, and possibly the lengths, in milliseconds / seconds / frames / ... , of these files.

[24] The media client for example comprises a playback unit configured to play back the first source segment of the source media comprising the first language. The playback unit may be further configured to pause the playback of the first source segment at the first point in time. The media client for example further comprises a recording unit configured to, at the first point in time, record a first interpreted segment comprising a live consecutive interpretation into the second language of the first source segment. The recording unit is for example further configured to define an end of the recording of the first interpreted segment at a second point in time and the playback unit is further configured to, at the second point in time, play back a second source segment of the source media starting from the first point in time referenced with respect to the source media. The media client for example further comprises a metadata generator configured to generate metadata for the interpreted segments. The metadata generator is configured to determine a duration of the first interpreted segment between the second point in time and the first point in time and store data on the first interpreted segment, wherein the data comprises the first point in time and the duration of the first interpreted segment.

[25] When the recorded interpretation of the source media is complete, a teacher grades the quality of the interpretation as well as the conformity with the source media. Therefore, both the recorded interpreted segments and the source segments should be played back together, either in a consecutive or in a simultaneous manner. In addition, a user of the media client should be able to quickly jump to interesting pieces in a long interpretation. [26] According to example embodiments, at the first point in time, the media client is further configured to pause the playback of the first source segment and to start recording the first interpreted segment simultaneously.

[27] This way, there is no interruption between the playback of the first source segment and the recording of the first interpreted segment. This ensures a smooth transition when playing back the consecutive interpretation between the source media and the interpreted segments.

[28] According to example embodiments, when the media client is further configured to play back the second source segment of the source media starting from the first point in time when the duration of the first interpreted segment is reached.

[29] This way, during the playback of the consecutive interpretation, the media client alternates the playback between source segments consecutively followed by the recording of their respective interpreted segments. The interpretation of the source media starts again from the first point in time such that no fraction of the source media is left uninterpreted. In other words, the source media may be completely interpreted by the user of the media client.

[30] According to example embodiments, the media client is further configured to allow a user of the media client to define a mark at one or more predetermined points in time in the source media.

[31] The media client allows interaction with one or two or more interpreters at the same time, or with only one individual interpreted. For example, a mark can comprise an audio track and/or a video track and/or text which can be then played and/or displayed by the media client for one or more interpreters. Further, if the conversation is relevant for the other interpreters, information can be shared using marks at one or more predetermined points in time in the source media. For example, in a preparation mode, a teacher can prepare an interpretation exercise based on an audio track or a video comprising an audio source track. Marks to indicate positions for consecutive interpreting can optionally be added to the source track and/or be made visible to the interpreter. Also, marks for points of time of interest, such as for example difficult words, specific vocabulary, or parts of the exercise that require special attention from the interpreters, etc., can be added to the source track. For example, in review mode, a teacher or student interpreter can use the same application in review mode to play back the source track alongside the submitted interpreted versions. The teacher can make use of feedback marks to add remarks about the interpretation of the student interpreter.

[32] An interpreter can insert marks to a recorded interpreted segment by clicking on a specific position on the recorded interpreted segment waveform representation in the media client. In a popup window, a mark label can then be set. Together with the position, in milliseconds, or seconds, or frames, relative to the total exercise length, this information is also stored in the data, such as for example a metadata file. Alternatively, the position(s) of the marks on the recorded sequence of interpreted segments are in relation to the recorded sequence of interpreted segments only, i.e. without any relationship to the total exercise length.

[33] Several types of marks exist:

- “attention” marks are placed by a teacher in for example a preparation or a review mode onto the source waveform to indicate for example something interesting in the source media. Such a mark will for example remain visible only by a teacher;

- “consecutive interpretation” marks are placed by a teacher in for example a preparation mode onto the source waveform to indicate the end of a sentence or to indicate the end of a part of a sentence, at the locations where the speaker is assumed to pause and the interpreter is assumed to start translate. Such a mark will be visible to for example both teacher and interpreter;

- “feedback” marks are placed by a teacher in for example a review mode onto the recorded waveforms to give some remark about the interpretation. Such a mark will be visible to both teacher and interpreter in review mode;

- “student” marks are placed by a student interpreter in for example a review mode onto the recorded waveforms to give some information about the interpretation. Such a mark will be visible to both teacher and student interpreter in review mode.

[34] According to example embodiments, the media client is further configured to store the marks in the data. [35] For example, in a preparation mode, a teacher creates a project, for example an IQF project, based on an audio and/or video source track. The teacher can also set the project name, the interpreting mode (simultaneous or consecutive), the interpreting media type (audio or audio/video) and, optionally, a list of marks. These settings are all stored as metadata in the IQF project. The resulting IQF project for example comprises these settings. Preferably, the settings are saved in an encrypted and compressed container comprising the source media and the data, such as for example a metadata file. The teacher can define or insert marks on the source media by clicking on a specific position on the timeline or waveform representing the source media. In a popup window opened in the media client, a mark type and a label can for example be set. Together with the position in time, in milliseconds, relative to the source media, this information is also stored in the data, such as for example in a metadata file.

[36] An interpreter typically starts by listening to a part of the source media. When the source media reaches for example a“consecutive interpretation” mark, the source media is paused.

- In one implementation, the recording of the microphone, i.e. audio, and optionally the webcam, i.e. video, is started automatically;

- In another implementation, the‘Start Recording’ button is enabled. This button becomes visible on the media client or it changes for example from an idle color to an active color. The interpreter can then click the‘Start Recording’ button. Only then the recording of the microphone, i.e. audio, and optionally the webcam, i.e. video is started.

When the interpreter has finished the interpretation of the first source segment, the interpreter can click the‘Stop recording’ or‘Continue playback’ button.

- In one implementation, the interpreter will see a‘Continue playback’ button.

When the interpreter clicks the‘Continue playback’ button, then the recording of the first interpreted segment pauses and the playback of source media resumes immediately where it left off, for example immediately where it left off, which is at the“consecutive interpretation” mark;

- In another implementation, the interpreter will see a‘Stop recording’ button on the media client. When the interpreter clicks the‘Stop recording’ button: o the recording of the first interpreted segment pauses while the source media is still paused. This allows the interpreter to for example take a breath, take a sip of water, to concentrate, etc.;

o a button‘Continue playback’ becomes available or active on the media client.

When the interpreter clicks‘Continue playback’, the playback of source media resumes immediately where it left off, for example immediately where it left off, which is for example at the“consecutive interpretation” mark.

In both cases, when the interpreter is interpreting the last part of the source media, a flat line indicating silence is being appended to the representation of the source media on the media client. When the recording of the first interpreted segment is finished, the index and the duration of the recording of the first interpreted segment is for example added to a recordings list in the user interface of the media client.

[37] According to example embodiments, the media client is further configured to record again the first interpreted segment from the first point in time, thereby generating an alternative for the first interpreted segment.

[38] The interpreter can record multiple versions, optionally merge them, and then select a version to submit. This way, the interpreter can correct or improve his or her interpretation before submitting it for review and/or for publishing, thereby generally improving the quality of the interpretation of the source media.

[39] The interpreter can make as many recorded versions as necessary. Two or more recorded versions can be merged by selecting them in the versions list and clicking the‘Merge’ menu item in the media client. The recorded interpreted segments are then merged into a new recorded interpreted segments using for example ffmpeg. The metadata of the original interpreted segments is used to determine their position in the new interpreted segments. New metadata is also generated to describe the start time and durations of the slices of the resulting interpreted segments. All the recorded interpreted segments, together with the data and the source media can for example subsequently be saved into a project file, such as for example into an IQF project file.

[40] According to example embodiments, the media client is further configured to; - choose between the first interpreted segment and the alternative for the interpreted segment, thereby generating an updated first interpreted segment; and

- store updated data for the updated first interpreted segment, wherein the updated data comprises the first point in time and a duration of the updated first interpreted segment.

[41] According to example embodiments, the media client is further configured to:

- generate live a source waveform representing the source media while the source media is being played;

- generating live a first interpreted segment waveform representing the first interpreted segment when the first interpreted segment is being recorded; and

- displaying a line in the source waveform between the first source segment and the second source segment indicative for a pause of the source media while the first interpreted segment is being recorded.

[42] This way, the media client displays in a very intuitive manner that the source media is being interpreted. Also, the media client displays in a very intuitive manner to which source segment corresponds an interpreted segment.

[43] According to example embodiments, the media client is further configured to display an end line after a last source segment of the source media when a duration of the sequence of interpreted segments comprising a consecutive interpretation into the second language of the source media is larger than a duration of the source media.

[44] This way, even if the sequence of interpreted segments lasts longer than the source media, an end line is added at end of the source media to intuitively show to the user of the media client that the source media is shorter than the sequence of interpreted segments.

[45] According to example embodiments, the media client is further configured to simultaneously play back one or more source segments of the source media and record one or more simultaneous interpreted segments of the source segments into the second language. [46] The media client allows to prepare interpreting exercises and send them to interpreters, to record performances of interpreters for simultaneous interpreting exercises, and to evaluate interpretations and add feedback for the interpreters in the recording.

[47] In simultaneous mode, recording of a sequence of interpreted segments happens simultaneously with the playback of source media. The interpreter can start the recording by clicking on for example the‘Start Recording’ button displayed by the media client. The source media then starts to play while the microphone, i.e. audio and optionally the webcam, i.e. video, is being recorded. A waveform indicative for the recording of the sequence of interpreted segments is automatically updated live on the media client. If the recording continues after the end of the playback of the source media, a flat line, indicating silence, is being appended to the representation of the source media. The interpreter can stop the recording for example by clicking on the ‘Stop’ button displayed by the media client. Data comprising a start time and duration of the recording of the sequence of interpreted segments is saved. The index and the duration of the recording can then for example be added to a recordings list. A resulting recorded sequence of interpreted segments consists of one audio or video file which is continuous and without interruptions. Information of where the sequence of interpreted segments is exactly positioned with respect to the source media can be saved in for example the data. This information comprises for example a start time in relation to the source media and a duration. The interpreter can insert student marks to a recorded track by clicking on a specific position on the recorded track waveform representation. In a popup window displayed on the media client, a mark label can then be set. Together with the position, in milliseconds, relative to the total exercise length, this information is also stored in the data.

[48] In simultaneous mode, the source media plays without interruption. The recorded sequence of interpreted segments starts playing concurrently with the source media when the start time of the sequence of interpreted segments is met. The recorded sequence of interpreted segments stops playing when the position of the time mark is past the start time added with the duration of the recording sequence of interpreted segments. [49] According to example embodiments, the media client is further configured to, when a duration of the one or more simultaneous interpreted segments is larger than a duration of the source segments, display an end line after the source waveform.

[50] According to a second example aspect of the present disclosure, there is provided a computer-implemented method for recording a sequence of interpreted segments, wherein each interpreted segment comprises a consecutive interpretation into a second language of a source segment of a source media comprising a first language, wherein the method comprises the steps of:

- playing back a first source segment of the source media comprising the first language;

- at a first point in time, pausing the playback of the first source segment and recording a first interpreted segment comprising a consecutive interpretation into the second language of the first source segment;

- at a second point in time, defining an end of the first interpreted segment and playing back a second source segment of the source media starting from the first point in time; and

- storing data on the first interpreted segment, wherein the data comprises the first point in time and the second point in time or a duration of the first interpreted segment determined between the second point in time and the first point in time.

[51] The computer-implemented method according to the present disclosure allows to prepare interpreting exercises and send them to interpreters, to record performances of interpreters for consecutive interpreting exercises, and to evaluate interpretations and add feedback for the interpreters in the recording. Teachers or peers can evaluate these recorded interpretations rapidly and easily. The method according to the present disclosure evaluates not only the spoken interpretation, but also the way the interpreter presents the message. Indeed, by capturing not only for example the interpreter's microphone, but also for example the webcam of the interpreter, the performance can be properly evaluated by all means. By allowing the interpretation and the recording of a consecutive interpretation into a second language of a source segment of a source media comprising a first language, the method according to the present disclosure allows a more emotional and fluent delivery of the interpretation. The consecutive interpreter has more time to prepare for the nuances of the language and to choose the words carefully, thereby improving the quality of the interpretation.

[52] This way, the method according to the present disclosure makes it easy for interpreters to check the quality of their interpretation as well as the conformity with the source media. An interpreter and/or a teacher can quickly jump to points in time of interest in the source media. This is done in a user-friendly and intuitive manner. The method allows recording a sequence of interpreted segments of a source media. The first point in time and the second point in time are determined with respect to the source media. The steps of this method are repeated for all the interpreted segments, thereby recording a sequence of interpreted segments.

[53] According to example embodiments, the method further comprises the steps of:

- playing back the first source segment;

- at the first point in time, playing back the first interpreted segment for the duration of the first interpreted segment;

- at the second point in time, playing back a second source segment of the source media starting from the first point in time.

[54] Preferably, the source media and the sequence of interpreted segments are recorded in two separate files. Alternatively, the method according to the present disclosure is a dual-track recorder: the source media and the interpreted segments are recorded on separate tracks. However, one can export an interpretation to MP3 or MP4, and decide whether one wants to include the source speech and/or the source video. This gives complete flexibility to review performances in other media players as well.

[55] According to example embodiments, at the first point in time, the step of pausing the playback of the first source segment and the step of starting to record the first interpreted segment are simultaneous.

[56] According to example embodiments, at the second point in time, the step of reaching the end of the first interpreted segment and the step of playing back the second source segment of the source media starting from the first point in time are simultaneous.

[57] According to example embodiments, the method further comprises the step of defining a mark at one or more predetermined points in time in the source media.

[58] According to example embodiments, the method further comprises the step of storing the marks in the data.

[59] According to example embodiments, the method further comprises the step of recording again the first interpreted segment from the first point in time, thereby generating a copy of the first interpreted segment.

[60] According to a third example aspect of the present disclosure, there is provided a media client comprising at least one processor and at least one memory including computer program code, the at least one memory and computer program code configured to, with the at least one processor, cause the media client to perform the method according to a first aspect of the present disclosure.

[61 ] According to a fourth example aspect of the present disclosure, there is provided a computer program product comprising computer-executable instructions for causing a media client to perform at least the method according to a first example aspect of the present disclosure.

[62] According to a fifth example aspect of the present disclosure, there is a computer readable storage medium comprising computer-executable instructions for performing the method according to a first example aspect of the present disclosure when the program is run on a computer.

Brief Description of the Drawings

[63] Some example embodiments will now be described with reference to the accompanying drawings. [64] Fig. 1 depicts an example embodiment of a media client according to the present disclosure.

[65] Fig. 2 depicts an example embodiment of a source media and of a sequence of interpreted segments according to the present disclosure.

[66] Fig. 3 depicts an example embodiment of a consecutive interpretation with the media client according to the present disclosure.

[67] Fig. 4 depicts an example embodiment of a simultaneous interpretation with the media client according to the present disclosure.

[68] Fig. 5 depicts an example embodiment of a computer-implemented method according to the present disclosure.

[69] Fig. 6 shows an example embodiment of a suitable computing system for performing one or several steps in embodiments of the invention.

Detailed Description of Embodiment(s)

[70] Fig. 1 illustrates an example embodiment of a media client 10 according to the present disclosure for recording a sequence 20 of interpreted segments 201 ;202, wherein each interpreted segments 201 ;202 comprises a consecutive or a simultaneous interpretation into a second language of a source segment 101 ; 102 of a source media 10 comprising a first language. The media client 1 is configured to play back a first source segment 101 of the source media 10 comprising the first language. The media client is further configured to:

- at a first point in time 91 , pause the playback of the first source segment 101 and record a first interpreted segment 201 comprising a consecutive interpretation into the second language of the first source segment 101 ; - at a second point in time 92, pause the recording of the first interpreted segment 201 and play back a second source segment 102 of the source media 10 starting from the first point in time 91 ;

- determine a duration 7 of the first interpreted segment 201 between the second point in time 92 and the first point in time 91 ; and

- store data 30 on the first interpreted segment 201 , wherein the data 30 comprises the first point in time 91 and the duration 7 of the first interpreted segment 201.

The media client 1 is further configured to:

- play back the first source segment 101 ;

- at the first point in time 91 , play back the first interpreted segment 201 for the duration of the first interpreted segment 201 ; and

- at the second point in time 92, play back a second source segment 102 of the source media 10 starting from the first point in time 91.

At the first point in time 91 , the media client 10 pauses the playback of the first source segment 101 and starts recording the first interpreted segment 201 simultaneously. The media client 1 plays back the second source segment 102 of the source media 10 starting from the first point in time 91 when the duration 7 of the first interpreted segment 201 is reached. The media client 1 allows a user of the media client 1 to define a mark 50 at one or more predetermined points in time 80 in the source media 10. The media client 1 can optionally store the marks 50 in the data 30. The media client 1 can record again the first interpreted segment 201 from the first point in time 91 , thereby generating a copy of the first interpreted segment. The media client 1 can:

- merge the first interpreted segment 201 and the copy of the interpreted segment, thereby generating an updated first interpreted segment; and

- store updated data for the updated first interpreted segment, wherein the updated data comprises the first point in time 91 and a duration of the updated first interpreted segment.

The media client 1 is further configured to:

- generate a source waveform 60 representing the source media 10;

- displaying the source waveform 60;

- generating live a first interpreted segment waveform 61 representing the first interpreted segment 201 when the first interpreted segment 201 is being recorded; and - displaying a line 62 between the first source segment 101 and the second source segment 102 for the duration 7 of the first interpreted segment 201 .

The media client 1 is further configured to display an end line after a last source segment of said source media 10 when a duration of the sequence of interpreted segments comprising a consecutive interpretation into the second language of the source media 10 is larger than a duration of the source media 10. The media client 1 is further configured to simultaneously play back the source media 10 and record a simultaneous interpreted segment of the source media 10 into the second language, and when a duration of the simultaneous interpreted segment is larger than a duration of the source media 10, the media client 10 is further configured to display an end line 63 after the source waveform 60. The media client 1 is further configured to simultaneously play back the source media 10 and record a simultaneous live interpreted segment 201 ;202 of the source media 10 into the second language.

[71] Fig. 2 illustrates an example embodiment of a sequence 20 of interpreted segments 201 ;202, wherein each interpreted segments 201 ;202 comprises a consecutive or a simultaneous interpretation into a second language of a source segment 101 ; 102 of a source media 10 comprising a first language. Data 30 on the sequence of interpreted segments 201 ;202 is stored in for example the media client of Fig. 1 , wherein the data 30 comprises one or more start points in time 9 of interpreted segments with respect to the source media 10, such as for example the first point in time 91 and the second point in time 92, and further comprises a duration 6 of the interpreted segments, such as the duration 7 of the first interpreted segment 201 calculated between the second point in time 92 and the first point in time 91 and the duration 8 of the second interpreted segment 202 calculated between a fourth point in time 94 and a third point in time 93.

[72] Fig. 3 illustrates an example embodiment of a consecutive interpretation by the media client 1 of Fig. 1. A source media 10 comprises one or more source segments 101 ; 102. The first source segment starts in time at T1 . The media client then plays back the first source segment 101 comprising a first language from the moment 90 to the first point in time 91. At the first point in time 91 , the media client pauses the play back of the first source segment 101 and records a live interpretation of the source segment 101 by a user of the media client. In other words, at the first point in time 91 , the media client records a first interpreted segment 201 comprising a consecutive interpretation into a second language different from the first language of the first source segment 101. At a second point in time 92, the media client pauses the recording of the first interpreted segment 201 , for example when the user of the media client is done with interpreting the first source segment 101 , and the media client plays back a second source segment 102 of the source media 10 starting again from the first point in time 91. The media client determines a duration 7 of the first interpreted segment 201 by calculating a time difference between the second point in time 92 and the first point in time 91 , and saves this duration 7 as a duration 6 in data 30. The media client then stores data 30, for example metadata related to the first interpreted segment 201 , wherein the metadata 30 comprises a start time 9 of the first interpreted segment 201 with respect to the source media 10 and a duration 7 of the first interpreted segment 201. For example, for the first interpreted segment 201 , the start time 9 is 91 and the duration 7 is equal to 92-91. At another point in time 93, the media client pauses the play back of the second source segment 102 and records a second interpreted segment 202 comprising a consecutive interpretation into another language of the second source segment 102. This other language is different from the first language, but it can be identical or different to the second language. At another point in time 94, the media client pauses the recording of the second interpreted segment 202, for example when the user of the media client is done with interpreting the second source segment 102, and the media client plays back a third source segment of the source media 10 starting again from the other point in time 93. The media client determines a duration 8 of the second interpreted segment 202 by calculating a time difference between the point in time 94 and the other point in time 93, and saves this duration 8 as a duration 6 in data 30. The media client then stores data 30, for example metadata related to the second interpreted segment 202, wherein the metadata 30 comprises a start time 9 of the second interpreted segment 202 with respect to the source media 10 and a duration 8 of the second interpreted segment 202. For example, for the second interpreted segment 202, the start time 9 is 93 and the duration 8 is equal to 94-93. The media client of Fig. 1 repeats this method for all the interpreted segments, thereby recording a sequence of interpreted segments 20, and being able to play back an alternating media comprising source segments 101 ; 102 and interpreted segments 201 ;202 alternating during playback. The source segments comprise a first language and the interpreted segments comprise a second language different from the first language.

[73] Fig. 4 illustrates an example embodiment of a simultaneous interpretation by the media client 1 of Fig. 1. A source media 10 comprises one source segment 101. The media client then plays back the source segment 101 of the media source 10 comprising a first language from the moment 90 to the first point in time 91. At the moment 90, the media client records a live interpretation of the source segment 101 by a user of the media client. In other words, at the moment 90, the media client records a first interpreted segment 201 comprising a consecutive interpretation into a second language different from the first language of the first source segment 101. At the first point in time 91 , the source media 10 is over and the media client continues the recording of a live interpretation 201 of the source segment 101 by a user of the media client until the second point in time 92. The media client determines a duration 7 of the first interpreted segment 201 by calculating a time difference between the first point in time 91 and the moment 90, and saves this duration 7 as a duration 6 in data 30. The media client then stores data 30, for example metadata related to the first interpreted segment 201 , wherein the metadata 30 comprises a start time 9 of the first interpreted segment 201 with respect to the source media 10 and a duration 7 of the first interpreted segment 201. The media client is further configured to simultaneously play back the source media 10 and record a simultaneous interpreted segment of the source media 10 into the second language, and when a duration of the simultaneous interpreted segment is larger than a duration of the source media 10, the media client 10 is further configured to display an end line 64 after the source waveform 60.

[74] Fig. 5 schematically illustrates the steps of a method according to an example embodiment for recording a sequence of interpreted segments 201 ;202, wherein each interpreted segment 201 ;202 comprises a consecutive interpretation into a second language of a source segment 101 ; 102 of a source media 10 comprising a first language. In a first step 501 , a first source segment 101 of the source media 10 comprising the first language is played back. In a second step 502 consecutive to the first step 501 , and at a first point in time 91 , the playback of the first source segment 101 is paused and a first interpreted segment 201 comprising a consecutive interpretation into the second language of the first source segment 101 is recorded. In a third step 503 consecutive to the second step 502, and at a second point in time 92, the recording of the first interpreted segment 201 is paused and a second source segment 102 of the source media 10 starting from the first point in time 91 is played back. In a fourth step 504 consecutive to the third step 503, a duration of the first interpreted segment 201 is determined between the second point in time 92 and the first point in time 91. Finally, in a fifth step 505 consecutive to the fourth step 504, data 30 on the first interpreted segment 201 is stored, wherein the data 30 comprises the first point in time 91 and the duration 7 of the first interpreted segment 201.

[75] Fig. 6 shows a suitable computing system 800 enabling to implement embodiments of the system. Computing system 800 may in general be formed as a suitable general-purpose computer and comprise a bus 810, a processor 802, a local memory 804, one or more optional input interfaces 814, one or more optional output interfaces 816, a communication interface 812, a storage element interface 806, and one or more storage elements 808. Bus 810 may comprise one or more conductors that permit communication among the components of the computing system 800. Processor 802 may include any type of conventional processor or microprocessor that interprets and executes programming instructions. Local memory 804 may include a random-access memory (RAM) or another type of dynamic storage device that stores information and instructions for execution by processor 802 and/or a read only memory (ROM) or another type of static storage device that stores static information and instructions for use by processor 802. Input interface 814 may comprise one or more conventional mechanisms that permit an operator or user to input information to the computing device 800, such as a keyboard 820, a mouse 830, a pen, voice recognition and/or biometric mechanisms, a camera, etc. Output interface 816 may comprise one or more conventional mechanisms that output information to the operator or user, such as a display 840, etc. Communication interface 812 may comprise any transceiver-like mechanism such as for example one or more Ethernet interfaces that enables computing system 800 to communicate with other devices and/or systems, for example with other computing devices 881 , 882, 883. The communication interface 812 of computing system 800 may be connected to such another computing system by means of a local area network (LAN) or a wide area network (WAN) such as for example the internet. Storage element interface 806 may comprise a storage interface such as for example a Serial Advanced Technology Attachment (SATA) interface or a Small Computer System Interface (SCSI) for connecting bus 810 to one or more storage elements 808, such as one or more local disks, for example SATA disk drives, and control the reading and writing of data to and/or from these storage elements 808. Although the storage element(s) 808 above is/are described as a local disk, in general any other suitable computer-readable media such as a removable magnetic disk, optical storage media such as a CD or DVD, -ROM disk, solid state drives, flash memory cards, ... could be used. Computing system 800 could thus correspond to the media client 1 in an embodiment of the present disclosure for example illustrated by Fig. 1 .

[76] As used in this application, the term“circuitry” may refer to one or more or all of the following:

(a) hardware-only circuit implementations such as implementations in only analog and/or digital circuitry and

(b) combinations of hardware circuits and software, such as (as applicable):

(i) a combination of analog and/or digital hardware circuit(s) with software/firmware and

(ii) any portions of hardware processor(s) with software (including digital signal processor(s)), software, and memory(ies) that work together to cause an apparatus, such as a mobile phone or server, to perform various functions) and

(c) hardware circuit(s) and/or processor(s), such as microprocessor(s) or a portion of a microprocessor(s), that requires software (e.g. firmware) for operation, but the software may not be present when it is not needed for operation.

This definition of circuitry applies to all uses of this term in this application, including in any claims. As a further example, as used in this application, the term circuitry also covers an implementation of merely a hardware circuit or processor (or multiple processors) or portion of a hardware circuit or processor and its (or their) accompanying software and/or firmware. The term circuitry also covers, for example and if applicable to the particular claim element, a baseband integrated circuit or processor integrated circuit for a mobile device ora similar integrated circuit in a server, a cellular network device, or other computing or network device.

[77] Although the present invention has been illustrated by reference to specific embodiments, it will be apparent to those skilled in the art that the invention is not limited to the details of the foregoing illustrative embodiments, and that the present invention may be embodied with various changes and modifications without departing from the scope thereof. The present embodiments are therefore to be considered in all respects as illustrative and not restrictive, the scope of the invention being indicated by the appended claims rather than by the foregoing description, and all changes which come within the scope of the claims are therefore intended to be embraced therein.

It will furthermore be understood by the reader of this patent application that the words "comprising" or "comprise" do not exclude other elements or steps, that the words "a" or "an" do not exclude a plurality, and that a single element, such as a computer system, a processor, or another integrated unit may fulfil the functions of several means recited in the claims. Any reference signs in the claims shall not be construed as limiting the respective claims concerned. The terms "first", "second", third", "a", "b", "c", and the like, when used in the description or in the claims are introduced to distinguish between similar elements or steps and are not necessarily describing a sequential or chronological order. Similarly, the terms "top", "bottom", "over", "under", and the like are introduced for descriptive purposes and not necessarily to denote relative positions. It is to be understood that the terms so used are interchangeable under appropriate circumstances and embodiments of the invention are capable of operating according to the present invention in other sequences, or in orientations different from the one(s) described or illustrated above.

Claims

1 . A media client (1 ) for recording a sequence (20) of interpreted segments (201 ;202), wherein each interpreted segment (201 ;202) comprises a consecutive interpretation into a second language of a source segment (101 ; 102) of a source media (10) comprising a first language, wherein said media client (1 ) is configured to play back a first source segment (101 ) of said source media (10) comprising said first language; and wherein said media client (1 ) is further configured to:

- at a first point in time (91 ), pause said playback of said first source segment (101 ) and record a first interpreted segment (201 ) comprising a consecutive interpretation into said second language of said first source segment (101 );

- at a second point in time (92), define an end of first interpreted segment (201 ) and play back a second source segment (102) of said source media (10) starting from said first point in time (91 ); and

- store data (30) on said first interpreted segment (201 ), wherein said data (30) comprises said first point in time (91 ) and said second point in time (92) or a duration (7) of said first interpreted segment (201 ) determined between said second point in time (92) and said first point in time (91 ).

2. A media client (1) according to claim 1 , wherein, said media client (1 ) is further configured to:

- play back said first source segment (101 );

- at said first point in time (91 ), play back said first interpreted segment (201 ) for said duration (7) of said first interpreted segment (201 ); and

- at said second point in time (92), play back a second source segment (102) of said source media (10) starting from said first point in time (91 ).

3. A media client (1 ) according to claim 1 or 2, wherein at said first point in time (91 ), said media client (10) is further configured to pause said playback of said first source segment (101 ) and to start recording said first interpreted segment (201 ) simultaneously.

4. A media client (1) according to any of the preceding claims, wherein, when said media client (1) is further configured to play back said second source segment (102) of said source media (10) starting from said first point in time (91 ) when said duration (7) of said first interpreted segment (201 ) is reached.

5. A media client (1 ) according to any of the preceding claims, wherein said media client (10) is further configured to allow a user of said media client (10) to define a mark (50) at one or more predetermined points in time (80) in said source media (10).

6. A media client (1 ) according to claim 5, wherein said media client (1 ) is further configured to store said marks (50) in said data (30).

7. A media client (1 ) according to any of the preceding claims, wherein said media client (1 ) is further configured to record again said first interpreted segment (201 ) from said first point in time (91 ), thereby generating an alternative for said first interpreted segment.

8. A media client (1 ) according to claim 7, wherein said media client (1 ) is further configured to;

- choose between said first interpreted segment (201 ) and said alternative for said interpreted segment, thereby generating an updated first interpreted segment; and

- store updated data for said updated first interpreted segment, wherein said updated data comprises said first point in time (91 ) and a duration of said updated first interpreted segment.

9. A media client (1 ) according to any of the preceding claims, wherein said media client (1 ) is further configured to:

- generate live a source waveform (60) representing said source media (10) while the source is being played;

- generating live a first interpreted segment waveform (61 ) representing said first interpreted segment (201 ) when said first interpreted segment (201 ) is being recorded; and

- displaying a line (62) in said source waveform (60) between said first source segment (101 ) and said second source segment (102) indicative for a pause of said source media (10) while said first interpreted segment (201 ) is being recorded.

10. A media client (1 ) according to any of the preceding claims, wherein said media client (1) is further configured to display an end line (63) after a last source segment of said source media (10) when a duration (6) of said sequence of interpreted segments comprising a consecutive interpretation into said second language of said source media (10) is larger than a duration of said source media (10).

1 1. A media client (1) according to any of the preceding claims, wherein, when said media client (1 ) is further configured to simultaneously play back one or more source segments of said source media (10) and record one or more simultaneous interpreted segments of said source segments into said second language.

12. A media client (1 ) according to claim 1 1 , wherein said media client (1 ) is further configured to, when a duration of said one or more simultaneous interpreted segments is larger than a duration of said source segments, display an end line (64) after said source waveform (60).

13. A computer-implemented method for recording a sequence (20) of interpreted segments (201 ;202), wherein each interpreted segment (201 ;202) comprises a consecutive interpretation into a second language of a source segment (101 ; 102) of a source media (10) comprising a first language, wherein said method comprises the steps of:

- playing back a first source segment (101 ) of said source media (10) comprising said first language;

- at a first point in time (91 ), pausing said playback of said first source segment (101 ) and recording a first interpreted segment (201 ) comprising a consecutive interpretation into said second language of said first source segment (101 );

- at a second point in time (92), defining an end of first interpreted segment (201 ) and playing back a second source segment (102) of said source media (10) starting from said first point in time (91 ); and

- storing data (30) on said first interpreted segment (201 ), wherein said data (30) comprises said first point in time (91 ) and said second point in time (92) or a duration (7) of said first interpreted segment (201 ) determined between said second point in time (92) and said first point in time (91 ).

14. A method according to claim 13, wherein, said method further comprises the steps of:

- playing back said first source segment (101 );

- at said first point in time (91 ), playing back said first interpreted segment (201 ) for said duration (7) of said first interpreted segment (201 ); and

- at said second point in time (92), playing back a second source segment (102) of said source media (10) starting from said first point in time (91 ).

15. A method according to any of the preceding claims 13 to 14, wherein, at said first point in time (91 ), said step of pausing said playback of said first source segment (101 ) and said step of starting to record said first interpreted segment (201 ) are simultaneous.

16. A method according to any of the preceding claims 13 to 15, wherein, at said second point in time (92), said step of reaching the end of said first interpreted segment (201 ) and said step of playing back said second source segment (102) of said source media (10) starting from said first point in time (91 ) are simultaneous.

17. A method according to any of the preceding claims 13 to 16, wherein said method further comprises the step of defining a mark (50) at one or more predetermined points in time (9) in said source media (10).

18. A method according to claim 17, wherein said method further comprises the step of storing said marks (50) in said data (30).

19. A method according to any of the preceding claims 13 to 18, wherein said method further comprises the step of recording again said first interpreted segment (201 ) from said first point in time (91 ), thereby generating an alternative for said first interpreted segment (201 ).

20. A media client (1 ) comprising at least one processor and at least one memory including computer program code, the at least one memory and computer program code configured to, with the at least one processor, cause the media client (1 ) to perform the method according to any of claims 13 to 19.

21. A computer program product comprising computer-executable instructions for causing a media client to perform at least the method according to any of claims 13 to 19.

22. A computer readable storage medium comprising computer-executable instructions for performing the method according to any of claims 13 to 19 when the program is run on a computer.