CN107527623A - Screen transmission method, device, electronic equipment and computer-readable recording medium - Google Patents

Screen transmission method, device, electronic equipment and computer-readable recording medium Download PDF

Info

Publication number
CN107527623A
CN107527623A CN201710666179.8A CN201710666179A CN107527623A CN 107527623 A CN107527623 A CN 107527623A CN 201710666179 A CN201710666179 A CN 201710666179A CN 107527623 A CN107527623 A CN 107527623A
Authority
CN
China
Prior art keywords
acoustic information
spokesman
text message
screen
information
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CN201710666179.8A
Other languages
Chinese (zh)
Other versions
CN107527623B (en
Inventor
欧阳宇基
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Guangzhou Shiyuan Electronics Thecnology Co Ltd
Guangzhou Shizhen Information Technology Co Ltd
Original Assignee
Guangzhou Shiyuan Electronics Thecnology Co Ltd
Guangzhou Shizhen Information Technology Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Guangzhou Shiyuan Electronics Thecnology Co Ltd, Guangzhou Shizhen Information Technology Co Ltd filed Critical Guangzhou Shiyuan Electronics Thecnology Co Ltd
Priority to CN201710666179.8A priority Critical patent/CN107527623B/en
Priority to PCT/CN2017/116067 priority patent/WO2019029073A1/en
Publication of CN107527623A publication Critical patent/CN107527623A/en
Application granted granted Critical
Publication of CN107527623B publication Critical patent/CN107527623B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L21/00Speech or voice signal processing techniques to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
    • G10L21/02Speech enhancement, e.g. noise reduction or echo cancellation
    • G10L21/0208Noise filtering
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/26Speech to text systems
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F3/00Input arrangements for transferring data to be processed into a form capable of being handled by the computer; Output arrangements for transferring data from processing unit to output unit, e.g. interface arrangements
    • G06F3/14Digital output to display device ; Cooperation and interconnection of the display device with other functional units
    • G06F3/1454Digital output to display device ; Cooperation and interconnection of the display device with other functional units involving copying of the display data of a local workstation or window to a remote workstation or window so that an actual copy of the data is displayed simultaneously on two or more displays, e.g. teledisplay
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L21/00Speech or voice signal processing techniques to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
    • G10L21/02Speech enhancement, e.g. noise reduction or echo cancellation
    • G10L21/0208Noise filtering
    • G10L2021/02087Noise filtering the noise being separate speech, e.g. cocktail party

Landscapes

  • Engineering & Computer Science (AREA)
  • Computational Linguistics (AREA)
  • Health & Medical Sciences (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Human Computer Interaction (AREA)
  • Physics & Mathematics (AREA)
  • Acoustics & Sound (AREA)
  • Multimedia (AREA)
  • Quality & Reliability (AREA)
  • Signal Processing (AREA)
  • Telephonic Communication Services (AREA)
  • Two-Way Televisions, Distribution Of Moving Picture Or The Like (AREA)
  • Circuits Of Receivers In General (AREA)

Abstract

The present invention provides a kind of screen transmission method, device, electronic equipment and computer-readable recording medium, the acoustic information for the surrounding environment that the screen transmission method is gathered and sended over by receiving source equipment, with reference to the acoustic information for the surrounding environment that itself is gathered, acoustic information corresponding with same spokesman is converted into by one text information by feature recognition, the text message is rendered in screen picture is thrown.The source proximity of devices generally held due to spokesman with oneself, the sound that thus the source equipment collects the spokesman in acoustic information is apparent, improve the accuracy that each spokesman is distinguished from acoustic information, it is easy to, according to acoustic information converting text information corresponding to same spokesman, improve the accuracy of speech recognition.

Description

Screen transmission method, device, electronic equipment and computer-readable recording medium
Technical field
The present invention relates to passing to shield technical field, more particularly to a kind of screen transmission method, device, electronic equipment and computer-readable Storage medium.
Background technology
Pass screen technology be primarily referred to as by mobile phone, apparatus such as computer screen on the content that shows and the sound (desktop of broadcasting Data) it is synchronized to the technology that the display devices such as projecting apparatus, television set, meeting flat board are shown.Mobile phone, apparatus such as computer have The advantages such as easy to operate, disposal ability is strong, and the display device such as meeting flat board the has advantage such as screen is big, audio is good, pass through biography The advantage that screen technology can possesses both combines, and is widely used under the scenes such as meeting.
By taking conference scenario as an example, personnel participating in the meeting may use different language, accent, word speed, cause other personnels participating in the meeting Possibly conferencing information can not be understood completely.Although current speech recognition can convert speech into captions, begged in meeting More mouthfuls of people is miscellaneous during, and the captions of generation are also chaotic, and therefore, application effect of the speech recognition in conference scenario is not It is good so that the information distortion or omission issued, discussed in meeting, to reduce the efficiency of meeting communication.
The content of the invention
In view of this, the present invention provides a kind of screen transmission method, device, electronic equipment and computer-readable recording medium, with Overcome the problem of applying speech recognition ineffective in current conference scenario.
Specifically, the present invention is achieved through the following technical solutions:
A kind of screen transmission method, comprises the following steps:
The acoustic information for the surrounding environment that source equipment is gathered and sended over is received, the surrounding environment gathered with reference to itself Acoustic information, acoustic information is converted into by text message corresponding with spokesman by feature recognition;
The text message is rendered in screen picture is thrown.
In one embodiment, the acoustic information for receiving the surrounding environment that source equipment is gathered and sended over, with reference to The acoustic information of the surrounding environment of itself collection, text envelope corresponding with spokesman is converted into by feature recognition by acoustic information The step of breath, includes:
The acoustic information of the surrounding environment of itself collection is analyzed and processed, changed acoustic information according to feature recognition Into the first text message corresponding with spokesman;
The acoustic information for the surrounding environment that source equipment is gathered and sended over is received, according to the acoustic information to the first text This information is corrected.
In one embodiment, the acoustic information for receiving the surrounding environment that source equipment is gathered and sended over, with reference to The acoustic information of the surrounding environment of itself collection, text envelope corresponding with spokesman is converted into by feature recognition by acoustic information The step of breath, includes:
The acoustic information for the surrounding environment that source equipment is gathered and sended over is received, by the acoustic information and itself collection The acoustic information of surrounding environment analyzed and processed, extract acoustic information corresponding with spokesman;
Acoustic information corresponding with spokesman is converted into text message.
In one embodiment, the acoustic information by the acoustic information and the surrounding environment of itself collection is carried out at analysis The step of managing, extracting acoustic information corresponding with spokesman includes:
Using the acoustic information and reference information that from the acoustic information that source equipment receives as reference information, itself is gathered Correlation operation is carried out, ambient noise and/or the acoustic information of other spokesman is removed, extracts sound corresponding with single spokesman Message ceases.
In one embodiment, the screen picture of throwing is the picture that the desktop data sent to source equipment is shown gained Face, after described the step of extracting acoustic information corresponding with single spokesman, in addition to:
Acoustic information corresponding to spokesman is subjected to speech processes, the speech processes include gain process, attenuation processing, Volume corresponding to acoustic information is set to be in preset range;
It is according to timestamp that the acoustic information after processing is associated with desktop data.
In one embodiment, the text message includes at least one of:
Text message corresponding with acoustic information languages;
Text message corresponding with target language;
Text message corresponding with subject kind in acoustic information;
Text message corresponding with acoustic information languages.
In one embodiment, throw shield picture in render the text message the step of include:
Matching with different spokesman corresponding to renderer property, according to the renderer property throw screen picture in render the text This information;
Wherein, the renderer property includes at least one of:Font color, font size, font weight, display side Position, customized tags;The customized tags include following any:Underscore, word highlight color.
In one embodiment, throw shield picture in render the text message the step of include:
Matching sends the acoustic information of the surrounding environment transmitted by the source equipment of desktop data, and the acoustic information is corresponding Single spokesman as speaker, the text message of the speaker is focused on display in the form of being different from other spokesman.
It is described that acoustic information corresponding with same spokesman is converted into by same text by feature recognition in one embodiment After the step of this information, in addition to:
It is according to timestamp that text message is associated with desktop data.
The invention also discloses one kind to pass screen device, including:
Processing module, the acoustic information of surrounding environment for gathering and sending over for receiving source equipment, with reference to itself The acoustic information of the surrounding environment of collection, acoustic information is converted into by text message corresponding with spokesman by feature recognition;
Rendering module, for rendering the text message in screen picture is thrown.
The invention also discloses a kind of electronic equipment, including:
Processor;
For storing the memory of processor-executable instruction;
Wherein, the processor is configured as performing the screen transmission method as described in preceding any one.
The invention also discloses a kind of computer-readable recording medium, computer program is stored thereon with, the program is located Manage the screen transmission method realized when device performs as described in preceding any one.
The acoustic information for the surrounding environment that the present invention gather by receiving source equipment and sended over, with reference to itself collection Surrounding environment acoustic information, by feature recognition by acoustic information corresponding with same spokesman be converted into one text believe Breath, the text message is rendered in screen picture is thrown.Due to the source proximity of devices that spokesman generally holds with oneself, thus should The sound that source equipment collects the spokesman in acoustic information is apparent, improves and distinguishes each spokesman's from acoustic information Accuracy, it is easy to, according to acoustic information converting text information corresponding to same spokesman, improve the accuracy of speech recognition.
Brief description of the drawings
Fig. 1 is a kind of flow chart of screen transmission method shown in an exemplary embodiment of the invention;
Fig. 2 a are the exemplary plots of the conference scenario shown in an exemplary embodiment of the invention;
Fig. 2 b are the refinement exemplary plots of the processing method to acoustic information shown in an exemplary embodiment of the invention;
Fig. 2 c are the refinement exemplary plots of the processing method to acoustic information shown in an exemplary embodiment of the invention;
Fig. 3 is a kind of flow chart of screen transmission method shown in an exemplary embodiment of the invention;
Fig. 4 is a kind of design sketch for rendering text message shown in an exemplary embodiment of the invention;
Fig. 5 is the logic diagram of a kind of electronic equipment shown in an exemplary embodiment of the invention;
Fig. 6 is a kind of logic diagram of biography screen device shown in an exemplary embodiment of the invention.
Embodiment
Here exemplary embodiment will be illustrated in detail, its example is illustrated in the accompanying drawings.Following description is related to During accompanying drawing, unless otherwise indicated, the same numbers in different accompanying drawings represent same or analogous key element.Following exemplary embodiment Described in embodiment do not represent and the consistent all embodiments of the present invention.On the contrary, they be only with it is such as appended The example of the consistent apparatus and method of some aspects being described in detail in claims, of the invention.
It is only merely for the purpose of description specific embodiment in terminology used in the present invention, and is not intended to be limiting the present invention. It is also intended in " one kind " of the singulative of the invention with used in appended claims, " described " and "the" including majority Form, unless context clearly shows that other implications.It is also understood that term "and/or" used herein refers to and wrapped Containing the associated list items purpose of one or more, any or all may be combined.
It will be appreciated that though various information, but this may be described using term first, second, third, etc. in the present invention A little information should not necessarily be limited by these terms.These terms are only used for same type of information being distinguished from each other out.For example, do not departing from In the case of the scope of the invention, the first information can also be referred to as the second information, and similarly, the second information can also be referred to as One information.Depending on linguistic context, word as used in this " if " can be construed to " ... when " or " when ... When " or " in response to determining ".
The equipment such as meeting flat board are because with giant-screen, audio is good, supports the advantages such as handwriting input, in recent years, in meeting field Conjunction is widely used.As a rule, show on screen of the speaker by passing mobile phone, apparatus such as computer that screen technology uses oneself The content shown and the sound (desktop data) played are synchronized to the display devices such as meeting flat board and are shown.However, personnel participating in the meeting Different language, accent, word speed may be used, causes other personnels participating in the meeting possibly can not understand conferencing information completely.Current Although speech recognition can convert speech into captions, more mouthfuls of people is miscellaneous during session discussing, and the captions of generation are also Chaotic, therefore, application effect of the speech recognition in conference scenario is bad so that the information distortion issued, discussed in meeting Or omit, reduce the efficiency of meeting communication.
On the other hand, the present invention proposes a kind of screen transmission method, as shown in figure 1, this method includes:
S110, the acoustic information for receiving the surrounding environment that source equipment is gathered and sended over, the week gathered with reference to itself The acoustic information in collarette border, acoustic information is converted into by text message corresponding with spokesman by feature recognition;
S120, render the text message in screen picture is thrown.
As a rule, the equipment such as meeting flat board (hereinafter referred to as passing screen equipment), which needs to be placed on, is easy to all participants to see The position seen, therefore pass position and the personnel participating in the meeting's holding certain distance of screen equipment.Show as shown in Figure 2 a for a mini-session Be intended to, 4 personnels participating in the meeting 230 along round table 240 successively fall sit, pass screen equipment 210 be placed on personnel participating in the meeting 230 opposite (for example, On wall), the speaker of meeting is configured with source equipment 220 (such as computer, microphone etc.), and other personnels participating in the meeting 230 can also Configure source equipment 220.Because all personnels participating in the meeting 230 may make a speech (currently will be referred to as spokesman in talker), respectively Spokesman's distance passes screen equipment 210 farther out, because sonic transmissions have decay, the noise of surrounding environment and depositing for other interference The quality that source equipment 220 gathers sound, and spokesman will be as a rule less than by passing screen equipment 210 and gathering the quality of sound When having supporting source equipment 220, the sound of the spokesman of the source equipment 220 collection can be apparent.
The unit 211 in screen equipment 210 is passed in Fig. 2 a and represents that microphone etc. can gather the sound of spokesman, certainly, also may be used It is of the invention that this is not made to be the equipment (for example, omnidirectional microphone etc.) for the external collection sound being connected with passing screen equipment 210 Limitation.Source equipment 220 can also gather the sound of spokesman, and the acoustic information of collection is sent to and passes screen equipment 210, pass Unit 212 in screen equipment 210 represents communicator, it is of course also possible to be that the modes such as bluetooth or wireless network send the sound Information, meanwhile, meeting speaker also passes through source equipment 220 and desktop data is sent into biography screen equipment 210, passes screen equipment 210 Show the content (throwing screen picture) of the desktop data of source equipment 220.
Pass screen equipment 210 and the acoustic information that itself is gathered and the acoustic information that source equipment 220 gathers are subjected to total score Analysis is handled, and so as to accurately identify the acoustic information of each spokesman, and acoustic information can be changed into text message (each spokesman can be directed to one text message is set, or the text message of each spokesman is recorded in a data), then will Text information is rendered in screen picture is thrown, and rendering effect can be similar to captions.
The acoustic information of itself collection is carried out to the mode of comprehensive analysis processing with the acoustic information that source equipment 220 gathers Can have it is a variety of, such as:
The acoustic information of the surrounding environment of itself collection is analyzed and processed, changed acoustic information according to feature recognition Into the first text message corresponding with spokesman;
The acoustic information for the surrounding environment that source equipment is gathered and sended over is received, according to the acoustic information to the first text This information is corrected.
As shown in Figure 2 b, pass screen equipment 210 and source equipment 220 gathers the acoustic information of surrounding environment simultaneously, pass screen and set Standby 210 are analyzed and processed the acoustic information 0# of itself collection, identified according to feature recognition from acoustic information 0# with respectively Acoustic information corresponding to spokesman and change into the first text message (can be directed to each spokesman set one first text envelope Cease, or the first text message of each spokesman recorded in a data), the acoustic information that source equipment 220 gathers itself 1#, which is sent to, passes screen equipment 210, passes screen equipment 210 and the first text message is corrected according to acoustic information 1#, so as to obtain The high text message of the degree of accuracy.Wherein, correcting mode can be that acoustic information 1# is converted into text message 1#, by the first text Information is compared with text message 1# to be corrected;Can also be that the first text message is answered by acoustic information 1# Core;The concrete mode for correcting text in the application by sound is not limited to this, can also use other correcting modes.
It is, of course, also possible to acoustic information is handled in the following way:
The acoustic information for the surrounding environment that source equipment is gathered and sended over is received, by the acoustic information and itself collection The acoustic information of surrounding environment analyzed and processed, extract acoustic information corresponding with spokesman;
Acoustic information corresponding with spokesman is converted into text message.
As shown in Figure 2 c, pass screen equipment 210 and source equipment 220 gathers the acoustic information of surrounding environment simultaneously, source is set The acoustic information 1# of itself collection is sent to biography screen equipment 210 by standby 220, passes the acoustic information that screen equipment 210 gathers itself 0# is analyzed and processed with acoustic information 1#, so as to extract the acoustic information (acoustic information of extraction corresponding with each spokesman Can be for single spokesman or comprising multiple spokesman), will acoustic information conversion corresponding with spokesman Into text message, the acoustic information that noise even other spokesman have been filtered out due to the acoustic information of extraction (is designated as clean speech Information), clean speech can obtain in the following manner:Using from the acoustic information that source equipment 210 receives as reference information, Acoustic information and the reference information of itself collection are subjected to correlation operation, the method for correlation operation has a variety of, and designer can Selected according to actual use situation;So as to remove the acoustic information of ambient noise and/or other spokesman, so as to can Extract acoustic information (clean speech information) corresponding with single spokesman.Certainly, source equipment 210 can also be by collection Acoustic information is decayed and filtered etc. processing, only retains the acoustic information of the spokesman using the source equipment 210, then should Acoustic information is sent to screen equipment 210 is passed, and by improving the precision of reference information, can further be improved and single spokesman couple The purity for the acoustic information answered.
It is higher that the accuracy of text message is converted according to clean speech information, and clean speech information can also be done further Ground optimization processing.For example, the voice of part spokesman is too small, too big or the big minor swing of voice is larger, to this kind of acoustic information Auditory effect will be improved by carrying out speech processes, particularly be reviewed and (listened) after record screen and/or record screen data are sent into place remote When participating in the participant of meeting, processing procedure is as shown in Figure 3:
Acoustic information corresponding to spokesman is subjected to speech processes, the speech processes include gain process and/or decay Processing, makes volume corresponding to acoustic information be in preset range;
It is according to timestamp that the acoustic information after processing is associated with desktop data.
By in the speech processes of clean speech information to preset range, spike low ebb is removed, it is possible to increase auditory effect, when So, acoustic information is converted into text message again after volume adjustment can also being carried out, without the interference of spike low ebb, can also be carried Height is converted into the degree of accuracy of text message.
With Internationalization level more and more higher, multilingual, such as Chinese, English, day may be used in a meeting Language etc., so as to which text message can include at least one of:
Text message corresponding with acoustic information languages;For example, Chinese is changed into Chinese, English converts English, Sino-British Mixed changes into Chinese and English etc.;
Text message corresponding with target language;For example, target language is Chinese, then Chinese is changed into Chinese, English What conversion Chinese, China and Britain used with changes into Chinese etc.;
Text message corresponding with subject kind in acoustic information;For example, during subject kind is in Sino-British mixed acoustic information Text, then change into Chinese by what the China and Britain used with;Subject kind is English in the mixed acoustic information of China and Britain, then the China and Britain are mixed Change into English etc.;
Text message corresponding with acoustic information languages;For example, during time languages are in Sino-British mixed acoustic information Text, then change into Chinese by what the China and Britain used with;Time languages are English in the mixed acoustic information of China and Britain, then the China and Britain are mixed Change into English etc..
It is, of course, also possible to dialect is changed into text message corresponding to target language, for example, Guangdong language turns Chinese etc..
Due to there may be more people in meeting while making a speech, particularly during arguement etc., it may be difficult to whom tells and said What word, by previous embodiment, the present invention can be directed to each spokesman generate corresponding to text message, thus, When rendering the text message (captions) in throwing screen picture, it can distinguish different spokesman's using various forms of captions Text message:
Matching with different spokesman corresponding to renderer property, according to the renderer property throw screen picture in render the text This information;
Wherein, the renderer property includes at least one of:Font color, font size, font weight, display side Position, customized tags;The customized tags include following any:Underscore, word highlight color.
As shown in figure 4, the captions band background color of a spokesman, the captions of another spokesman are without background color, certainly, renderer property Species it is a lot, can also be using the mode such as different color.Can be by the captions of each spokesman corresponding to screen one Fixed position is shown, can not also fix the position of captions appearance, or the captions of the common spokesman of similar barrage show at both ends Show, the captions of speaker centre display etc..
As a rule, the speech content of the speaker of meeting is emphasis, therefore, can will text envelope corresponding with speaker Breath is focused on display in the form of being different from other spokesman.It is considered that the source equipment for sending desktop data is that speaker makes Source equipment, can according to MAC (Media Access Control, media access control) address of source equipment etc., It is speaker so as to distinguish which acoustic information, using corresponding to the acoustic information, single spokesman is as speaker, with area The text message of the speaker is not focused on display in the form of other spokesman.For example, the as shown in figure 4, captions with background color The captions of speaker are may be considered, the captions of no background color belong to common spokesman's.It is of course also possible to renderer property is changed, For example, renderer property corresponding with MAC Address is set, according to each spokesman of MAC Address differentiation for sending acoustic information and correspondingly Text message, and then for text message loading corresponding to renderer property be rendered into throwing screen picture in form captions.Though shown in Fig. 4 To throw the situation of screen (desktop data that a source equipment is only shown in biography screen equipment) by single speaker, but it is existing one at present Received in individual biography screen equipment and show the implementation of multiple source equipment desktop datas, it is clear that passed and one is shown in screen equipment Or the desktop data of multiple source equipment, do not change the use condition that the present invention passes screen scheme, therefore, the solution of the present invention Situation suitable for showing multiple source equipment desktop datas being passed screen equipment.
Meeting usually requires to carry out minutes, only picture and/or the sound such as conventional video recording or record screen, reviews video recording When it is uninteresting, and for being unfamiliar with the people of meeting situation, it is who says that light listening, which is difficult to what which said told, therefore, Invention proposition is associated with desktop data by text message according to timestamp, in the desktop data that subsequent playback is recorded, right The time answered text exhibition information successively, certainly, text information can also occur simultaneously with aforementioned sound information, so as to, Review the speech content that each spokesman is easily discernible during video recording;For example, there are red, black, blue three-color captions, correspond to respectively First, second, the third three spokesman, by by red captions are corresponding with the sound of first, the sound of black captions and second during viewing video recording Correspondingly, blue captions are corresponding with third sound, can easily differentiate the speech content of each spokesman.Certainly, which also may be used So that applied in teleconference, desktop data, acoustic information and/or the text message of local are sent into nonlocal equipment, increase Add strange land people with a part in a conference to understand the mode of conference content, improve relay session effect.
Corresponding with the embodiment of foregoing screen transmission method, present invention also offers the embodiment for passing screen device.
The embodiment that the present invention passes screen device can be applied on meeting flat board.Device embodiment can be real by software It is existing, it can also be realized by way of hardware or software and hardware combining.Exemplified by implemented in software, as on a logical meaning Device, it is to be read corresponding computer program instructions in nonvolatile memory by the processor of meeting flat board where it Operation is formed in internal memory.For hardware view, as shown in figure 5, one kind of meeting flat board where passing screen device for the present invention Hardware structure diagram, in addition to the processor shown in Fig. 5, internal memory, network interface and nonvolatile memory, in embodiment Meeting flat board where device can also include other hardware, this is repeated no more generally according to the actual functional capability of the biography screen.
Fig. 6 is refer to, the biography screen device 600 includes:
Processing module 610, the acoustic information of surrounding environment for gathering and sending over for receiving source equipment, with reference to from The acoustic information of the surrounding environment of body collection, text envelope corresponding with spokesman is converted into by feature recognition by acoustic information Breath;
Rendering module 620, for rendering the text message in screen picture is thrown.
Further, the invention also provides a kind of electronic equipment, including:
Processor;
For storing the memory of processor-executable instruction;
Wherein, the processor is configured as performing the screen transmission method as described in preceding any one.
Further, the invention also provides a kind of computer-readable recording medium, computer program is stored thereon with, should The screen transmission method as described in preceding any one is realized when program is executed by processor.
Meeting flat board of the present invention, which has, passes screen function, and audio conversion text is added on the basis of screen function is passed in original The functions such as word, the function can be realized using existing transliteration software, and the transliteration result of funcall transliteration software is shielded by passing; Can also be by the function and service of transliteration software in screen function is passed;It is of course also possible to it can be realized according to actual conditions design is other The plug-in unit of the function, this is not limited by the present invention.
The function of unit and the implementation process of effect specifically refer to and step are corresponded in the above method in said apparatus Implementation process, it will not be repeated here.
For device embodiment, because it corresponds essentially to embodiment of the method, so related part is real referring to method Apply the part explanation of example.Device embodiment described above is only schematical, wherein described be used as separating component The unit of explanation can be or may not be physically separate, can be as the part that unit is shown or can also It is not physical location, you can with positioned at a place, or can also be distributed on multiple NEs.Can be according to reality Need to select some or all of module therein to realize the purpose of the present invention program.Those of ordinary skill in the art are not paying In the case of going out creative work, you can to understand and implement.
The foregoing is merely illustrative of the preferred embodiments of the present invention, is not intended to limit the invention, all essences in the present invention God any modification, equivalent substitution and improvements done etc., should be included within the scope of protection of the invention with principle.

Claims (12)

1. a kind of screen transmission method, it is characterised in that comprise the following steps:
The acoustic information for the surrounding environment that source equipment is gathered and sended over is received, with reference to the sound for the surrounding environment that itself is gathered Message is ceased, and acoustic information is converted into text message corresponding with spokesman by feature recognition;
The text message is rendered in screen picture is thrown.
2. screen transmission method as claimed in claim 1, it is characterised in that the week for receiving source equipment and gathering and sending over The acoustic information in collarette border, with reference to the acoustic information for the surrounding environment that itself is gathered, acoustic information is changed by feature recognition Include into the step of text message corresponding with spokesman:
The acoustic information of surrounding environment of itself collection is analyzed and processed, according to feature recognition by acoustic information be converted into First text message corresponding to spokesman;
The acoustic information for the surrounding environment that source equipment is gathered and sended over is received, according to the acoustic information to the first text envelope Breath is corrected.
3. screen transmission method as claimed in claim 1, it is characterised in that the week for receiving source equipment and gathering and sending over The acoustic information in collarette border, with reference to the acoustic information for the surrounding environment that itself is gathered, acoustic information is changed by feature recognition Include into the step of text message corresponding with spokesman:
The acoustic information for the surrounding environment that source equipment is gathered and sended over is received, by the acoustic information and the week of itself collection The acoustic information in collarette border is analyzed and processed, and extracts acoustic information corresponding with spokesman;
Acoustic information corresponding with spokesman is converted into text message.
4. screen transmission method as claimed in claim 3, it is characterised in that described by the acoustic information and surrounding's ring of itself collection The acoustic information in border is analyzed and processed, extract acoustic information corresponding with spokesman the step of include:
So that from the acoustic information that source equipment receives as reference information, the acoustic information of itself collection to be carried out with reference information Correlation operation, ambient noise and/or the acoustic information of other spokesman are removed, extract sound letter corresponding with single spokesman Breath.
5. screen transmission method as claimed in claim 3, it is characterised in that the screen picture of throwing is the table sent to source equipment Face data are shown the picture of gained, after described the step of extracting acoustic information corresponding with single spokesman, in addition to:
Acoustic information corresponding to spokesman is subjected to speech processes, the speech processes include gain process and/or attenuation processing, Volume corresponding to acoustic information is set to be in preset range;
It is according to timestamp that the acoustic information after processing is associated with desktop data.
6. screen transmission method as claimed in claim 1, it is characterised in that the text message includes at least one of:
Text message corresponding with acoustic information languages;
Text message corresponding with target language;
Text message corresponding with subject kind in acoustic information;
Text message corresponding with acoustic information languages.
7. the screen transmission method as any one of claim 1 to 6, it is characterised in that render the text in screen picture is thrown The step of this information, includes:
Matching with different spokesman corresponding to renderer property, according to the renderer property throw screen picture in render the text envelope Breath;
Wherein, the renderer property includes at least one of:It is font color, font size, font weight, display orientation, individual Propertyization marks;The customized tags include following any:Underscore, word highlight color.
8. screen transmission method as claimed in claim 7, it is characterised in that the step of rendering the text message in throwing screen picture Including:
Matching sends the acoustic information of the surrounding environment transmitted by the source equipment of desktop data, will be single corresponding to the acoustic information One spokesman focuses on display the text message of the speaker as speaker in the form of being different from other spokesman.
9. screen transmission method as claimed in claim 8, it is characterised in that it is described will be corresponding with same spokesman by feature recognition Acoustic information the step of being converted into one text information after, in addition to:
It is according to timestamp that text message is associated with desktop data.
10. one kind passes screen device, it is characterised in that including:
Processing module, the acoustic information of surrounding environment for gathering and sending over for receiving source equipment, is gathered with reference to itself Surrounding environment acoustic information, acoustic information is converted into by text message corresponding with spokesman by feature recognition;
Rendering module, for rendering the text message in screen picture is thrown.
11. a kind of electronic equipment, it is characterised in that including:
Processor;
For storing the memory of processor-executable instruction;
Wherein, the processor is configured as performing the screen transmission method in the claim 1-9 described in any one.
12. a kind of computer-readable recording medium, is stored thereon with computer program, it is characterised in that the program is by processor The screen transmission method as described in any one in claim 1-9 is realized during execution.
CN201710666179.8A 2017-08-07 2017-08-07 Screen transmission method and device, electronic equipment and computer readable storage medium Active CN107527623B (en)

Priority Applications (2)

Application Number Priority Date Filing Date Title
CN201710666179.8A CN107527623B (en) 2017-08-07 2017-08-07 Screen transmission method and device, electronic equipment and computer readable storage medium
PCT/CN2017/116067 WO2019029073A1 (en) 2017-08-07 2017-12-14 Screen transmission method and apparatus, and electronic device, and computer readable storage medium

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201710666179.8A CN107527623B (en) 2017-08-07 2017-08-07 Screen transmission method and device, electronic equipment and computer readable storage medium

Publications (2)

Publication Number Publication Date
CN107527623A true CN107527623A (en) 2017-12-29
CN107527623B CN107527623B (en) 2021-02-09

Family

ID=60680627

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201710666179.8A Active CN107527623B (en) 2017-08-07 2017-08-07 Screen transmission method and device, electronic equipment and computer readable storage medium

Country Status (2)

Country Link
CN (1) CN107527623B (en)
WO (1) WO2019029073A1 (en)

Cited By (9)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN109151642A (en) * 2018-09-05 2019-01-04 北京今链科技有限公司 A kind of intelligent earphone, intelligent earphone processing method, electronic equipment and storage medium
CN111770319A (en) * 2019-10-18 2020-10-13 北京沃东天骏信息技术有限公司 Projection method, device, system and storage medium
CN112019786A (en) * 2020-08-24 2020-12-01 上海松鼠课堂人工智能科技有限公司 Intelligent teaching screen recording method and system
CN112684967A (en) * 2021-03-11 2021-04-20 荣耀终端有限公司 Method for displaying subtitles and electronic equipment
CN112887781A (en) * 2021-01-27 2021-06-01 维沃移动通信有限公司 Subtitle processing method and device
WO2021233218A1 (en) * 2020-05-19 2021-11-25 华为技术有限公司 Screen casting method, screen casting source end, screen casting destination end, screen casting system and storage medium
CN113746911A (en) * 2021-08-26 2021-12-03 科大讯飞股份有限公司 Audio processing method and related device, electronic equipment and storage medium
CN114125358A (en) * 2021-11-11 2022-03-01 北京有竹居网络技术有限公司 Cloud conference subtitle display method, system, device, electronic equipment and storage medium
CN115052126A (en) * 2022-08-12 2022-09-13 深圳市稻兴实业有限公司 Ultra-high definition video conference analysis management system based on artificial intelligence

Families Citing this family (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN111914115B (en) * 2019-05-08 2024-05-28 阿里巴巴集团控股有限公司 Sound information processing method and device and electronic equipment

Citations (8)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN103456305A (en) * 2013-09-16 2013-12-18 东莞宇龙通信科技有限公司 Terminal and speech processing method based on multiple sound collecting units
CN104796584A (en) * 2015-04-23 2015-07-22 南京信息工程大学 Prompt device with voice recognition function
US20160092154A1 (en) * 2014-09-30 2016-03-31 International Business Machines Corporation Content mirroring
CN105704538A (en) * 2016-03-17 2016-06-22 广东小天才科技有限公司 Method and system for generating audio and video subtitles
CN105913845A (en) * 2016-04-26 2016-08-31 惠州Tcl移动通信有限公司 Mobile terminal voice recognition and subtitle generation method and system and mobile terminal
CN106297794A (en) * 2015-05-22 2017-01-04 西安中兴新软件有限责任公司 The conversion method of a kind of language and characters and equipment
CN106657865A (en) * 2016-12-16 2017-05-10 联想(北京)有限公司 Method and device for generating conference summary and video conference system
CN106910504A (en) * 2015-12-22 2017-06-30 北京君正集成电路股份有限公司 A kind of speech reminding method and device based on speech recognition

Family Cites Families (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20120245936A1 (en) * 2011-03-25 2012-09-27 Bryan Treglia Device to Capture and Temporally Synchronize Aspects of a Conversation and Method and System Thereof
JP2014240940A (en) * 2013-06-12 2014-12-25 株式会社東芝 Dictation support device, method and program
CN205647778U (en) * 2016-04-01 2016-10-12 安徽听见科技有限公司 Intelligent conference system
CN106057193A (en) * 2016-07-13 2016-10-26 深圳市沃特沃德股份有限公司 Conference record generation method based on telephone conference and device
CN106911832B (en) * 2017-04-28 2020-06-02 四川音创伟业科技有限公司 Voice recording method and device

Patent Citations (8)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN103456305A (en) * 2013-09-16 2013-12-18 东莞宇龙通信科技有限公司 Terminal and speech processing method based on multiple sound collecting units
US20160092154A1 (en) * 2014-09-30 2016-03-31 International Business Machines Corporation Content mirroring
CN104796584A (en) * 2015-04-23 2015-07-22 南京信息工程大学 Prompt device with voice recognition function
CN106297794A (en) * 2015-05-22 2017-01-04 西安中兴新软件有限责任公司 The conversion method of a kind of language and characters and equipment
CN106910504A (en) * 2015-12-22 2017-06-30 北京君正集成电路股份有限公司 A kind of speech reminding method and device based on speech recognition
CN105704538A (en) * 2016-03-17 2016-06-22 广东小天才科技有限公司 Method and system for generating audio and video subtitles
CN105913845A (en) * 2016-04-26 2016-08-31 惠州Tcl移动通信有限公司 Mobile terminal voice recognition and subtitle generation method and system and mobile terminal
CN106657865A (en) * 2016-12-16 2017-05-10 联想(北京)有限公司 Method and device for generating conference summary and video conference system

Cited By (10)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN109151642A (en) * 2018-09-05 2019-01-04 北京今链科技有限公司 A kind of intelligent earphone, intelligent earphone processing method, electronic equipment and storage medium
CN111770319A (en) * 2019-10-18 2020-10-13 北京沃东天骏信息技术有限公司 Projection method, device, system and storage medium
WO2021233218A1 (en) * 2020-05-19 2021-11-25 华为技术有限公司 Screen casting method, screen casting source end, screen casting destination end, screen casting system and storage medium
CN112019786A (en) * 2020-08-24 2020-12-01 上海松鼠课堂人工智能科技有限公司 Intelligent teaching screen recording method and system
CN112019786B (en) * 2020-08-24 2021-05-25 上海松鼠课堂人工智能科技有限公司 Intelligent teaching screen recording method and system
CN112887781A (en) * 2021-01-27 2021-06-01 维沃移动通信有限公司 Subtitle processing method and device
CN112684967A (en) * 2021-03-11 2021-04-20 荣耀终端有限公司 Method for displaying subtitles and electronic equipment
CN113746911A (en) * 2021-08-26 2021-12-03 科大讯飞股份有限公司 Audio processing method and related device, electronic equipment and storage medium
CN114125358A (en) * 2021-11-11 2022-03-01 北京有竹居网络技术有限公司 Cloud conference subtitle display method, system, device, electronic equipment and storage medium
CN115052126A (en) * 2022-08-12 2022-09-13 深圳市稻兴实业有限公司 Ultra-high definition video conference analysis management system based on artificial intelligence

Also Published As

Publication number Publication date
WO2019029073A1 (en) 2019-02-14
CN107527623B (en) 2021-02-09

Similar Documents

Publication Publication Date Title
CN107527623A (en) Screen transmission method, device, electronic equipment and computer-readable recording medium
CN101689365B (en) Method of controlling a video conference
CN103327181B (en) Voice chatting method capable of improving efficiency of voice information learning for users
US11650790B2 (en) Centrally controlling communication at a venue
CN109951743A (en) Barrage information processing method, system and computer equipment
CN111048093A (en) Conference sound box, conference recording method, device, system and computer storage medium
CN105245355A (en) Intelligent voice shorthand conference system
US10313502B2 (en) Automatically delaying playback of a message
US20210312143A1 (en) Real-time call translation system and method
CN111107283B (en) Information display method, electronic equipment and storage medium
US20230247131A1 (en) Presentation of communications
CN114531425B (en) Processing method and processing device
US10580410B2 (en) Transcription of communications
CN210091177U (en) Conference system for realizing synchronous translation
US11830120B2 (en) Speech image providing method and computing device for performing the same
KR20180068655A (en) apparatus and method for generating text based on audio signal
Shang et al. Audio recordings dataset of genuine and replayed speech at both ends of a telecommunication channel
CN110931001B (en) Anti-noise audio transmission device facing voice recognition
US20230421702A1 (en) Distributed teleconferencing using personalized enhancement models
JP2022113375A (en) Information processing method and monitoring system
CN113919299A (en) Summary text generation method, projection device and computer readable storage medium
Patil et al. MuteTrans: A communication medium for deaf
CN117636928A (en) Pickup device and related audio enhancement method
CN109889764A (en) Conference system
CN114530159A (en) Multimedia resource integration scheduling method based on WebRTC technology

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant