CN107527623A - Screen transmission method, device, electronic equipment and computer-readable recording medium - Google Patents
Screen transmission method, device, electronic equipment and computer-readable recording medium Download PDFInfo
- Publication number
- CN107527623A CN107527623A CN201710666179.8A CN201710666179A CN107527623A CN 107527623 A CN107527623 A CN 107527623A CN 201710666179 A CN201710666179 A CN 201710666179A CN 107527623 A CN107527623 A CN 107527623A
- Authority
- CN
- China
- Prior art keywords
- acoustic information
- spokesman
- text message
- screen
- information
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Granted
Links
- 238000000034 method Methods 0.000 title claims abstract description 46
- 230000005540 biological transmission Effects 0.000 title claims abstract description 27
- 230000008569 process Effects 0.000 claims description 13
- 238000012545 processing Methods 0.000 claims description 13
- 238000009877 rendering Methods 0.000 claims description 10
- 239000000284 extract Substances 0.000 claims description 7
- 238000004590 computer program Methods 0.000 claims description 4
- 230000006870 function Effects 0.000 description 8
- 230000000694 effects Effects 0.000 description 7
- 230000008859 change Effects 0.000 description 6
- 238000005516 engineering process Methods 0.000 description 5
- 230000008901 benefit Effects 0.000 description 4
- 238000004458 analytical method Methods 0.000 description 3
- 238000006243 chemical reaction Methods 0.000 description 3
- 238000010586 diagram Methods 0.000 description 3
- 241001269238 Data Species 0.000 description 2
- 230000000739 chaotic effect Effects 0.000 description 2
- 238000004891 communication Methods 0.000 description 2
- 238000013461 design Methods 0.000 description 2
- 238000000605 extraction Methods 0.000 description 2
- 238000003672 processing method Methods 0.000 description 2
- 238000012552 review Methods 0.000 description 2
- 230000001360 synchronised effect Effects 0.000 description 2
- 238000000151 deposition Methods 0.000 description 1
- 230000004069 differentiation Effects 0.000 description 1
- 239000000686 essence Substances 0.000 description 1
- 230000004048 modification Effects 0.000 description 1
- 238000012986 modification Methods 0.000 description 1
- 238000005457 optimization Methods 0.000 description 1
- 230000004044 response Effects 0.000 description 1
- 241000894007 species Species 0.000 description 1
- 238000006467 substitution reaction Methods 0.000 description 1
Classifications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L21/00—Speech or voice signal processing techniques to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
- G10L21/02—Speech enhancement, e.g. noise reduction or echo cancellation
- G10L21/0208—Noise filtering
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/26—Speech to text systems
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F3/00—Input arrangements for transferring data to be processed into a form capable of being handled by the computer; Output arrangements for transferring data from processing unit to output unit, e.g. interface arrangements
- G06F3/14—Digital output to display device ; Cooperation and interconnection of the display device with other functional units
- G06F3/1454—Digital output to display device ; Cooperation and interconnection of the display device with other functional units involving copying of the display data of a local workstation or window to a remote workstation or window so that an actual copy of the data is displayed simultaneously on two or more displays, e.g. teledisplay
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L21/00—Speech or voice signal processing techniques to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
- G10L21/02—Speech enhancement, e.g. noise reduction or echo cancellation
- G10L21/0208—Noise filtering
- G10L2021/02087—Noise filtering the noise being separate speech, e.g. cocktail party
Landscapes
- Engineering & Computer Science (AREA)
- Computational Linguistics (AREA)
- Health & Medical Sciences (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Human Computer Interaction (AREA)
- Physics & Mathematics (AREA)
- Acoustics & Sound (AREA)
- Multimedia (AREA)
- Quality & Reliability (AREA)
- Signal Processing (AREA)
- Telephonic Communication Services (AREA)
- Two-Way Televisions, Distribution Of Moving Picture Or The Like (AREA)
- Circuits Of Receivers In General (AREA)
Abstract
The present invention provides a kind of screen transmission method, device, electronic equipment and computer-readable recording medium, the acoustic information for the surrounding environment that the screen transmission method is gathered and sended over by receiving source equipment, with reference to the acoustic information for the surrounding environment that itself is gathered, acoustic information corresponding with same spokesman is converted into by one text information by feature recognition, the text message is rendered in screen picture is thrown.The source proximity of devices generally held due to spokesman with oneself, the sound that thus the source equipment collects the spokesman in acoustic information is apparent, improve the accuracy that each spokesman is distinguished from acoustic information, it is easy to, according to acoustic information converting text information corresponding to same spokesman, improve the accuracy of speech recognition.
Description
Technical field
The present invention relates to passing to shield technical field, more particularly to a kind of screen transmission method, device, electronic equipment and computer-readable
Storage medium.
Background technology
Pass screen technology be primarily referred to as by mobile phone, apparatus such as computer screen on the content that shows and the sound (desktop of broadcasting
Data) it is synchronized to the technology that the display devices such as projecting apparatus, television set, meeting flat board are shown.Mobile phone, apparatus such as computer have
The advantages such as easy to operate, disposal ability is strong, and the display device such as meeting flat board the has advantage such as screen is big, audio is good, pass through biography
The advantage that screen technology can possesses both combines, and is widely used under the scenes such as meeting.
By taking conference scenario as an example, personnel participating in the meeting may use different language, accent, word speed, cause other personnels participating in the meeting
Possibly conferencing information can not be understood completely.Although current speech recognition can convert speech into captions, begged in meeting
More mouthfuls of people is miscellaneous during, and the captions of generation are also chaotic, and therefore, application effect of the speech recognition in conference scenario is not
It is good so that the information distortion or omission issued, discussed in meeting, to reduce the efficiency of meeting communication.
The content of the invention
In view of this, the present invention provides a kind of screen transmission method, device, electronic equipment and computer-readable recording medium, with
Overcome the problem of applying speech recognition ineffective in current conference scenario.
Specifically, the present invention is achieved through the following technical solutions:
A kind of screen transmission method, comprises the following steps:
The acoustic information for the surrounding environment that source equipment is gathered and sended over is received, the surrounding environment gathered with reference to itself
Acoustic information, acoustic information is converted into by text message corresponding with spokesman by feature recognition;
The text message is rendered in screen picture is thrown.
In one embodiment, the acoustic information for receiving the surrounding environment that source equipment is gathered and sended over, with reference to
The acoustic information of the surrounding environment of itself collection, text envelope corresponding with spokesman is converted into by feature recognition by acoustic information
The step of breath, includes:
The acoustic information of the surrounding environment of itself collection is analyzed and processed, changed acoustic information according to feature recognition
Into the first text message corresponding with spokesman;
The acoustic information for the surrounding environment that source equipment is gathered and sended over is received, according to the acoustic information to the first text
This information is corrected.
In one embodiment, the acoustic information for receiving the surrounding environment that source equipment is gathered and sended over, with reference to
The acoustic information of the surrounding environment of itself collection, text envelope corresponding with spokesman is converted into by feature recognition by acoustic information
The step of breath, includes:
The acoustic information for the surrounding environment that source equipment is gathered and sended over is received, by the acoustic information and itself collection
The acoustic information of surrounding environment analyzed and processed, extract acoustic information corresponding with spokesman;
Acoustic information corresponding with spokesman is converted into text message.
In one embodiment, the acoustic information by the acoustic information and the surrounding environment of itself collection is carried out at analysis
The step of managing, extracting acoustic information corresponding with spokesman includes:
Using the acoustic information and reference information that from the acoustic information that source equipment receives as reference information, itself is gathered
Correlation operation is carried out, ambient noise and/or the acoustic information of other spokesman is removed, extracts sound corresponding with single spokesman
Message ceases.
In one embodiment, the screen picture of throwing is the picture that the desktop data sent to source equipment is shown gained
Face, after described the step of extracting acoustic information corresponding with single spokesman, in addition to:
Acoustic information corresponding to spokesman is subjected to speech processes, the speech processes include gain process, attenuation processing,
Volume corresponding to acoustic information is set to be in preset range;
It is according to timestamp that the acoustic information after processing is associated with desktop data.
In one embodiment, the text message includes at least one of:
Text message corresponding with acoustic information languages;
Text message corresponding with target language;
Text message corresponding with subject kind in acoustic information;
Text message corresponding with acoustic information languages.
In one embodiment, throw shield picture in render the text message the step of include:
Matching with different spokesman corresponding to renderer property, according to the renderer property throw screen picture in render the text
This information;
Wherein, the renderer property includes at least one of:Font color, font size, font weight, display side
Position, customized tags;The customized tags include following any:Underscore, word highlight color.
In one embodiment, throw shield picture in render the text message the step of include:
Matching sends the acoustic information of the surrounding environment transmitted by the source equipment of desktop data, and the acoustic information is corresponding
Single spokesman as speaker, the text message of the speaker is focused on display in the form of being different from other spokesman.
It is described that acoustic information corresponding with same spokesman is converted into by same text by feature recognition in one embodiment
After the step of this information, in addition to:
It is according to timestamp that text message is associated with desktop data.
The invention also discloses one kind to pass screen device, including:
Processing module, the acoustic information of surrounding environment for gathering and sending over for receiving source equipment, with reference to itself
The acoustic information of the surrounding environment of collection, acoustic information is converted into by text message corresponding with spokesman by feature recognition;
Rendering module, for rendering the text message in screen picture is thrown.
The invention also discloses a kind of electronic equipment, including:
Processor;
For storing the memory of processor-executable instruction;
Wherein, the processor is configured as performing the screen transmission method as described in preceding any one.
The invention also discloses a kind of computer-readable recording medium, computer program is stored thereon with, the program is located
Manage the screen transmission method realized when device performs as described in preceding any one.
The acoustic information for the surrounding environment that the present invention gather by receiving source equipment and sended over, with reference to itself collection
Surrounding environment acoustic information, by feature recognition by acoustic information corresponding with same spokesman be converted into one text believe
Breath, the text message is rendered in screen picture is thrown.Due to the source proximity of devices that spokesman generally holds with oneself, thus should
The sound that source equipment collects the spokesman in acoustic information is apparent, improves and distinguishes each spokesman's from acoustic information
Accuracy, it is easy to, according to acoustic information converting text information corresponding to same spokesman, improve the accuracy of speech recognition.
Brief description of the drawings
Fig. 1 is a kind of flow chart of screen transmission method shown in an exemplary embodiment of the invention;
Fig. 2 a are the exemplary plots of the conference scenario shown in an exemplary embodiment of the invention;
Fig. 2 b are the refinement exemplary plots of the processing method to acoustic information shown in an exemplary embodiment of the invention;
Fig. 2 c are the refinement exemplary plots of the processing method to acoustic information shown in an exemplary embodiment of the invention;
Fig. 3 is a kind of flow chart of screen transmission method shown in an exemplary embodiment of the invention;
Fig. 4 is a kind of design sketch for rendering text message shown in an exemplary embodiment of the invention;
Fig. 5 is the logic diagram of a kind of electronic equipment shown in an exemplary embodiment of the invention;
Fig. 6 is a kind of logic diagram of biography screen device shown in an exemplary embodiment of the invention.
Embodiment
Here exemplary embodiment will be illustrated in detail, its example is illustrated in the accompanying drawings.Following description is related to
During accompanying drawing, unless otherwise indicated, the same numbers in different accompanying drawings represent same or analogous key element.Following exemplary embodiment
Described in embodiment do not represent and the consistent all embodiments of the present invention.On the contrary, they be only with it is such as appended
The example of the consistent apparatus and method of some aspects being described in detail in claims, of the invention.
It is only merely for the purpose of description specific embodiment in terminology used in the present invention, and is not intended to be limiting the present invention.
It is also intended in " one kind " of the singulative of the invention with used in appended claims, " described " and "the" including majority
Form, unless context clearly shows that other implications.It is also understood that term "and/or" used herein refers to and wrapped
Containing the associated list items purpose of one or more, any or all may be combined.
It will be appreciated that though various information, but this may be described using term first, second, third, etc. in the present invention
A little information should not necessarily be limited by these terms.These terms are only used for same type of information being distinguished from each other out.For example, do not departing from
In the case of the scope of the invention, the first information can also be referred to as the second information, and similarly, the second information can also be referred to as
One information.Depending on linguistic context, word as used in this " if " can be construed to " ... when " or " when ...
When " or " in response to determining ".
The equipment such as meeting flat board are because with giant-screen, audio is good, supports the advantages such as handwriting input, in recent years, in meeting field
Conjunction is widely used.As a rule, show on screen of the speaker by passing mobile phone, apparatus such as computer that screen technology uses oneself
The content shown and the sound (desktop data) played are synchronized to the display devices such as meeting flat board and are shown.However, personnel participating in the meeting
Different language, accent, word speed may be used, causes other personnels participating in the meeting possibly can not understand conferencing information completely.Current
Although speech recognition can convert speech into captions, more mouthfuls of people is miscellaneous during session discussing, and the captions of generation are also
Chaotic, therefore, application effect of the speech recognition in conference scenario is bad so that the information distortion issued, discussed in meeting
Or omit, reduce the efficiency of meeting communication.
On the other hand, the present invention proposes a kind of screen transmission method, as shown in figure 1, this method includes:
S110, the acoustic information for receiving the surrounding environment that source equipment is gathered and sended over, the week gathered with reference to itself
The acoustic information in collarette border, acoustic information is converted into by text message corresponding with spokesman by feature recognition;
S120, render the text message in screen picture is thrown.
As a rule, the equipment such as meeting flat board (hereinafter referred to as passing screen equipment), which needs to be placed on, is easy to all participants to see
The position seen, therefore pass position and the personnel participating in the meeting's holding certain distance of screen equipment.Show as shown in Figure 2 a for a mini-session
Be intended to, 4 personnels participating in the meeting 230 along round table 240 successively fall sit, pass screen equipment 210 be placed on personnel participating in the meeting 230 opposite (for example,
On wall), the speaker of meeting is configured with source equipment 220 (such as computer, microphone etc.), and other personnels participating in the meeting 230 can also
Configure source equipment 220.Because all personnels participating in the meeting 230 may make a speech (currently will be referred to as spokesman in talker), respectively
Spokesman's distance passes screen equipment 210 farther out, because sonic transmissions have decay, the noise of surrounding environment and depositing for other interference
The quality that source equipment 220 gathers sound, and spokesman will be as a rule less than by passing screen equipment 210 and gathering the quality of sound
When having supporting source equipment 220, the sound of the spokesman of the source equipment 220 collection can be apparent.
The unit 211 in screen equipment 210 is passed in Fig. 2 a and represents that microphone etc. can gather the sound of spokesman, certainly, also may be used
It is of the invention that this is not made to be the equipment (for example, omnidirectional microphone etc.) for the external collection sound being connected with passing screen equipment 210
Limitation.Source equipment 220 can also gather the sound of spokesman, and the acoustic information of collection is sent to and passes screen equipment 210, pass
Unit 212 in screen equipment 210 represents communicator, it is of course also possible to be that the modes such as bluetooth or wireless network send the sound
Information, meanwhile, meeting speaker also passes through source equipment 220 and desktop data is sent into biography screen equipment 210, passes screen equipment 210
Show the content (throwing screen picture) of the desktop data of source equipment 220.
Pass screen equipment 210 and the acoustic information that itself is gathered and the acoustic information that source equipment 220 gathers are subjected to total score
Analysis is handled, and so as to accurately identify the acoustic information of each spokesman, and acoustic information can be changed into text message
(each spokesman can be directed to one text message is set, or the text message of each spokesman is recorded in a data), then will
Text information is rendered in screen picture is thrown, and rendering effect can be similar to captions.
The acoustic information of itself collection is carried out to the mode of comprehensive analysis processing with the acoustic information that source equipment 220 gathers
Can have it is a variety of, such as:
The acoustic information of the surrounding environment of itself collection is analyzed and processed, changed acoustic information according to feature recognition
Into the first text message corresponding with spokesman;
The acoustic information for the surrounding environment that source equipment is gathered and sended over is received, according to the acoustic information to the first text
This information is corrected.
As shown in Figure 2 b, pass screen equipment 210 and source equipment 220 gathers the acoustic information of surrounding environment simultaneously, pass screen and set
Standby 210 are analyzed and processed the acoustic information 0# of itself collection, identified according to feature recognition from acoustic information 0# with respectively
Acoustic information corresponding to spokesman and change into the first text message (can be directed to each spokesman set one first text envelope
Cease, or the first text message of each spokesman recorded in a data), the acoustic information that source equipment 220 gathers itself
1#, which is sent to, passes screen equipment 210, passes screen equipment 210 and the first text message is corrected according to acoustic information 1#, so as to obtain
The high text message of the degree of accuracy.Wherein, correcting mode can be that acoustic information 1# is converted into text message 1#, by the first text
Information is compared with text message 1# to be corrected;Can also be that the first text message is answered by acoustic information 1#
Core;The concrete mode for correcting text in the application by sound is not limited to this, can also use other correcting modes.
It is, of course, also possible to acoustic information is handled in the following way:
The acoustic information for the surrounding environment that source equipment is gathered and sended over is received, by the acoustic information and itself collection
The acoustic information of surrounding environment analyzed and processed, extract acoustic information corresponding with spokesman;
Acoustic information corresponding with spokesman is converted into text message.
As shown in Figure 2 c, pass screen equipment 210 and source equipment 220 gathers the acoustic information of surrounding environment simultaneously, source is set
The acoustic information 1# of itself collection is sent to biography screen equipment 210 by standby 220, passes the acoustic information that screen equipment 210 gathers itself
0# is analyzed and processed with acoustic information 1#, so as to extract the acoustic information (acoustic information of extraction corresponding with each spokesman
Can be for single spokesman or comprising multiple spokesman), will acoustic information conversion corresponding with spokesman
Into text message, the acoustic information that noise even other spokesman have been filtered out due to the acoustic information of extraction (is designated as clean speech
Information), clean speech can obtain in the following manner:Using from the acoustic information that source equipment 210 receives as reference information,
Acoustic information and the reference information of itself collection are subjected to correlation operation, the method for correlation operation has a variety of, and designer can
Selected according to actual use situation;So as to remove the acoustic information of ambient noise and/or other spokesman, so as to can
Extract acoustic information (clean speech information) corresponding with single spokesman.Certainly, source equipment 210 can also be by collection
Acoustic information is decayed and filtered etc. processing, only retains the acoustic information of the spokesman using the source equipment 210, then should
Acoustic information is sent to screen equipment 210 is passed, and by improving the precision of reference information, can further be improved and single spokesman couple
The purity for the acoustic information answered.
It is higher that the accuracy of text message is converted according to clean speech information, and clean speech information can also be done further
Ground optimization processing.For example, the voice of part spokesman is too small, too big or the big minor swing of voice is larger, to this kind of acoustic information
Auditory effect will be improved by carrying out speech processes, particularly be reviewed and (listened) after record screen and/or record screen data are sent into place remote
When participating in the participant of meeting, processing procedure is as shown in Figure 3:
Acoustic information corresponding to spokesman is subjected to speech processes, the speech processes include gain process and/or decay
Processing, makes volume corresponding to acoustic information be in preset range;
It is according to timestamp that the acoustic information after processing is associated with desktop data.
By in the speech processes of clean speech information to preset range, spike low ebb is removed, it is possible to increase auditory effect, when
So, acoustic information is converted into text message again after volume adjustment can also being carried out, without the interference of spike low ebb, can also be carried
Height is converted into the degree of accuracy of text message.
With Internationalization level more and more higher, multilingual, such as Chinese, English, day may be used in a meeting
Language etc., so as to which text message can include at least one of:
Text message corresponding with acoustic information languages;For example, Chinese is changed into Chinese, English converts English, Sino-British
Mixed changes into Chinese and English etc.;
Text message corresponding with target language;For example, target language is Chinese, then Chinese is changed into Chinese, English
What conversion Chinese, China and Britain used with changes into Chinese etc.;
Text message corresponding with subject kind in acoustic information;For example, during subject kind is in Sino-British mixed acoustic information
Text, then change into Chinese by what the China and Britain used with;Subject kind is English in the mixed acoustic information of China and Britain, then the China and Britain are mixed
Change into English etc.;
Text message corresponding with acoustic information languages;For example, during time languages are in Sino-British mixed acoustic information
Text, then change into Chinese by what the China and Britain used with;Time languages are English in the mixed acoustic information of China and Britain, then the China and Britain are mixed
Change into English etc..
It is, of course, also possible to dialect is changed into text message corresponding to target language, for example, Guangdong language turns Chinese etc..
Due to there may be more people in meeting while making a speech, particularly during arguement etc., it may be difficult to whom tells and said
What word, by previous embodiment, the present invention can be directed to each spokesman generate corresponding to text message, thus,
When rendering the text message (captions) in throwing screen picture, it can distinguish different spokesman's using various forms of captions
Text message:
Matching with different spokesman corresponding to renderer property, according to the renderer property throw screen picture in render the text
This information;
Wherein, the renderer property includes at least one of:Font color, font size, font weight, display side
Position, customized tags;The customized tags include following any:Underscore, word highlight color.
As shown in figure 4, the captions band background color of a spokesman, the captions of another spokesman are without background color, certainly, renderer property
Species it is a lot, can also be using the mode such as different color.Can be by the captions of each spokesman corresponding to screen one
Fixed position is shown, can not also fix the position of captions appearance, or the captions of the common spokesman of similar barrage show at both ends
Show, the captions of speaker centre display etc..
As a rule, the speech content of the speaker of meeting is emphasis, therefore, can will text envelope corresponding with speaker
Breath is focused on display in the form of being different from other spokesman.It is considered that the source equipment for sending desktop data is that speaker makes
Source equipment, can according to MAC (Media Access Control, media access control) address of source equipment etc.,
It is speaker so as to distinguish which acoustic information, using corresponding to the acoustic information, single spokesman is as speaker, with area
The text message of the speaker is not focused on display in the form of other spokesman.For example, the as shown in figure 4, captions with background color
The captions of speaker are may be considered, the captions of no background color belong to common spokesman's.It is of course also possible to renderer property is changed,
For example, renderer property corresponding with MAC Address is set, according to each spokesman of MAC Address differentiation for sending acoustic information and correspondingly
Text message, and then for text message loading corresponding to renderer property be rendered into throwing screen picture in form captions.Though shown in Fig. 4
To throw the situation of screen (desktop data that a source equipment is only shown in biography screen equipment) by single speaker, but it is existing one at present
Received in individual biography screen equipment and show the implementation of multiple source equipment desktop datas, it is clear that passed and one is shown in screen equipment
Or the desktop data of multiple source equipment, do not change the use condition that the present invention passes screen scheme, therefore, the solution of the present invention
Situation suitable for showing multiple source equipment desktop datas being passed screen equipment.
Meeting usually requires to carry out minutes, only picture and/or the sound such as conventional video recording or record screen, reviews video recording
When it is uninteresting, and for being unfamiliar with the people of meeting situation, it is who says that light listening, which is difficult to what which said told, therefore,
Invention proposition is associated with desktop data by text message according to timestamp, in the desktop data that subsequent playback is recorded, right
The time answered text exhibition information successively, certainly, text information can also occur simultaneously with aforementioned sound information, so as to,
Review the speech content that each spokesman is easily discernible during video recording;For example, there are red, black, blue three-color captions, correspond to respectively
First, second, the third three spokesman, by by red captions are corresponding with the sound of first, the sound of black captions and second during viewing video recording
Correspondingly, blue captions are corresponding with third sound, can easily differentiate the speech content of each spokesman.Certainly, which also may be used
So that applied in teleconference, desktop data, acoustic information and/or the text message of local are sent into nonlocal equipment, increase
Add strange land people with a part in a conference to understand the mode of conference content, improve relay session effect.
Corresponding with the embodiment of foregoing screen transmission method, present invention also offers the embodiment for passing screen device.
The embodiment that the present invention passes screen device can be applied on meeting flat board.Device embodiment can be real by software
It is existing, it can also be realized by way of hardware or software and hardware combining.Exemplified by implemented in software, as on a logical meaning
Device, it is to be read corresponding computer program instructions in nonvolatile memory by the processor of meeting flat board where it
Operation is formed in internal memory.For hardware view, as shown in figure 5, one kind of meeting flat board where passing screen device for the present invention
Hardware structure diagram, in addition to the processor shown in Fig. 5, internal memory, network interface and nonvolatile memory, in embodiment
Meeting flat board where device can also include other hardware, this is repeated no more generally according to the actual functional capability of the biography screen.
Fig. 6 is refer to, the biography screen device 600 includes:
Processing module 610, the acoustic information of surrounding environment for gathering and sending over for receiving source equipment, with reference to from
The acoustic information of the surrounding environment of body collection, text envelope corresponding with spokesman is converted into by feature recognition by acoustic information
Breath;
Rendering module 620, for rendering the text message in screen picture is thrown.
Further, the invention also provides a kind of electronic equipment, including:
Processor;
For storing the memory of processor-executable instruction;
Wherein, the processor is configured as performing the screen transmission method as described in preceding any one.
Further, the invention also provides a kind of computer-readable recording medium, computer program is stored thereon with, should
The screen transmission method as described in preceding any one is realized when program is executed by processor.
Meeting flat board of the present invention, which has, passes screen function, and audio conversion text is added on the basis of screen function is passed in original
The functions such as word, the function can be realized using existing transliteration software, and the transliteration result of funcall transliteration software is shielded by passing;
Can also be by the function and service of transliteration software in screen function is passed;It is of course also possible to it can be realized according to actual conditions design is other
The plug-in unit of the function, this is not limited by the present invention.
The function of unit and the implementation process of effect specifically refer to and step are corresponded in the above method in said apparatus
Implementation process, it will not be repeated here.
For device embodiment, because it corresponds essentially to embodiment of the method, so related part is real referring to method
Apply the part explanation of example.Device embodiment described above is only schematical, wherein described be used as separating component
The unit of explanation can be or may not be physically separate, can be as the part that unit is shown or can also
It is not physical location, you can with positioned at a place, or can also be distributed on multiple NEs.Can be according to reality
Need to select some or all of module therein to realize the purpose of the present invention program.Those of ordinary skill in the art are not paying
In the case of going out creative work, you can to understand and implement.
The foregoing is merely illustrative of the preferred embodiments of the present invention, is not intended to limit the invention, all essences in the present invention
God any modification, equivalent substitution and improvements done etc., should be included within the scope of protection of the invention with principle.
Claims (12)
1. a kind of screen transmission method, it is characterised in that comprise the following steps:
The acoustic information for the surrounding environment that source equipment is gathered and sended over is received, with reference to the sound for the surrounding environment that itself is gathered
Message is ceased, and acoustic information is converted into text message corresponding with spokesman by feature recognition;
The text message is rendered in screen picture is thrown.
2. screen transmission method as claimed in claim 1, it is characterised in that the week for receiving source equipment and gathering and sending over
The acoustic information in collarette border, with reference to the acoustic information for the surrounding environment that itself is gathered, acoustic information is changed by feature recognition
Include into the step of text message corresponding with spokesman:
The acoustic information of surrounding environment of itself collection is analyzed and processed, according to feature recognition by acoustic information be converted into
First text message corresponding to spokesman;
The acoustic information for the surrounding environment that source equipment is gathered and sended over is received, according to the acoustic information to the first text envelope
Breath is corrected.
3. screen transmission method as claimed in claim 1, it is characterised in that the week for receiving source equipment and gathering and sending over
The acoustic information in collarette border, with reference to the acoustic information for the surrounding environment that itself is gathered, acoustic information is changed by feature recognition
Include into the step of text message corresponding with spokesman:
The acoustic information for the surrounding environment that source equipment is gathered and sended over is received, by the acoustic information and the week of itself collection
The acoustic information in collarette border is analyzed and processed, and extracts acoustic information corresponding with spokesman;
Acoustic information corresponding with spokesman is converted into text message.
4. screen transmission method as claimed in claim 3, it is characterised in that described by the acoustic information and surrounding's ring of itself collection
The acoustic information in border is analyzed and processed, extract acoustic information corresponding with spokesman the step of include:
So that from the acoustic information that source equipment receives as reference information, the acoustic information of itself collection to be carried out with reference information
Correlation operation, ambient noise and/or the acoustic information of other spokesman are removed, extract sound letter corresponding with single spokesman
Breath.
5. screen transmission method as claimed in claim 3, it is characterised in that the screen picture of throwing is the table sent to source equipment
Face data are shown the picture of gained, after described the step of extracting acoustic information corresponding with single spokesman, in addition to:
Acoustic information corresponding to spokesman is subjected to speech processes, the speech processes include gain process and/or attenuation processing,
Volume corresponding to acoustic information is set to be in preset range;
It is according to timestamp that the acoustic information after processing is associated with desktop data.
6. screen transmission method as claimed in claim 1, it is characterised in that the text message includes at least one of:
Text message corresponding with acoustic information languages;
Text message corresponding with target language;
Text message corresponding with subject kind in acoustic information;
Text message corresponding with acoustic information languages.
7. the screen transmission method as any one of claim 1 to 6, it is characterised in that render the text in screen picture is thrown
The step of this information, includes:
Matching with different spokesman corresponding to renderer property, according to the renderer property throw screen picture in render the text envelope
Breath;
Wherein, the renderer property includes at least one of:It is font color, font size, font weight, display orientation, individual
Propertyization marks;The customized tags include following any:Underscore, word highlight color.
8. screen transmission method as claimed in claim 7, it is characterised in that the step of rendering the text message in throwing screen picture
Including:
Matching sends the acoustic information of the surrounding environment transmitted by the source equipment of desktop data, will be single corresponding to the acoustic information
One spokesman focuses on display the text message of the speaker as speaker in the form of being different from other spokesman.
9. screen transmission method as claimed in claim 8, it is characterised in that it is described will be corresponding with same spokesman by feature recognition
Acoustic information the step of being converted into one text information after, in addition to:
It is according to timestamp that text message is associated with desktop data.
10. one kind passes screen device, it is characterised in that including:
Processing module, the acoustic information of surrounding environment for gathering and sending over for receiving source equipment, is gathered with reference to itself
Surrounding environment acoustic information, acoustic information is converted into by text message corresponding with spokesman by feature recognition;
Rendering module, for rendering the text message in screen picture is thrown.
11. a kind of electronic equipment, it is characterised in that including:
Processor;
For storing the memory of processor-executable instruction;
Wherein, the processor is configured as performing the screen transmission method in the claim 1-9 described in any one.
12. a kind of computer-readable recording medium, is stored thereon with computer program, it is characterised in that the program is by processor
The screen transmission method as described in any one in claim 1-9 is realized during execution.
Priority Applications (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201710666179.8A CN107527623B (en) | 2017-08-07 | 2017-08-07 | Screen transmission method and device, electronic equipment and computer readable storage medium |
PCT/CN2017/116067 WO2019029073A1 (en) | 2017-08-07 | 2017-12-14 | Screen transmission method and apparatus, and electronic device, and computer readable storage medium |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201710666179.8A CN107527623B (en) | 2017-08-07 | 2017-08-07 | Screen transmission method and device, electronic equipment and computer readable storage medium |
Publications (2)
Publication Number | Publication Date |
---|---|
CN107527623A true CN107527623A (en) | 2017-12-29 |
CN107527623B CN107527623B (en) | 2021-02-09 |
Family
ID=60680627
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN201710666179.8A Active CN107527623B (en) | 2017-08-07 | 2017-08-07 | Screen transmission method and device, electronic equipment and computer readable storage medium |
Country Status (2)
Country | Link |
---|---|
CN (1) | CN107527623B (en) |
WO (1) | WO2019029073A1 (en) |
Cited By (9)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN109151642A (en) * | 2018-09-05 | 2019-01-04 | 北京今链科技有限公司 | A kind of intelligent earphone, intelligent earphone processing method, electronic equipment and storage medium |
CN111770319A (en) * | 2019-10-18 | 2020-10-13 | 北京沃东天骏信息技术有限公司 | Projection method, device, system and storage medium |
CN112019786A (en) * | 2020-08-24 | 2020-12-01 | 上海松鼠课堂人工智能科技有限公司 | Intelligent teaching screen recording method and system |
CN112684967A (en) * | 2021-03-11 | 2021-04-20 | 荣耀终端有限公司 | Method for displaying subtitles and electronic equipment |
CN112887781A (en) * | 2021-01-27 | 2021-06-01 | 维沃移动通信有限公司 | Subtitle processing method and device |
WO2021233218A1 (en) * | 2020-05-19 | 2021-11-25 | 华为技术有限公司 | Screen casting method, screen casting source end, screen casting destination end, screen casting system and storage medium |
CN113746911A (en) * | 2021-08-26 | 2021-12-03 | 科大讯飞股份有限公司 | Audio processing method and related device, electronic equipment and storage medium |
CN114125358A (en) * | 2021-11-11 | 2022-03-01 | 北京有竹居网络技术有限公司 | Cloud conference subtitle display method, system, device, electronic equipment and storage medium |
CN115052126A (en) * | 2022-08-12 | 2022-09-13 | 深圳市稻兴实业有限公司 | Ultra-high definition video conference analysis management system based on artificial intelligence |
Families Citing this family (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN111914115B (en) * | 2019-05-08 | 2024-05-28 | 阿里巴巴集团控股有限公司 | Sound information processing method and device and electronic equipment |
Citations (8)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN103456305A (en) * | 2013-09-16 | 2013-12-18 | 东莞宇龙通信科技有限公司 | Terminal and speech processing method based on multiple sound collecting units |
CN104796584A (en) * | 2015-04-23 | 2015-07-22 | 南京信息工程大学 | Prompt device with voice recognition function |
US20160092154A1 (en) * | 2014-09-30 | 2016-03-31 | International Business Machines Corporation | Content mirroring |
CN105704538A (en) * | 2016-03-17 | 2016-06-22 | 广东小天才科技有限公司 | Method and system for generating audio and video subtitles |
CN105913845A (en) * | 2016-04-26 | 2016-08-31 | 惠州Tcl移动通信有限公司 | Mobile terminal voice recognition and subtitle generation method and system and mobile terminal |
CN106297794A (en) * | 2015-05-22 | 2017-01-04 | 西安中兴新软件有限责任公司 | The conversion method of a kind of language and characters and equipment |
CN106657865A (en) * | 2016-12-16 | 2017-05-10 | 联想(北京)有限公司 | Method and device for generating conference summary and video conference system |
CN106910504A (en) * | 2015-12-22 | 2017-06-30 | 北京君正集成电路股份有限公司 | A kind of speech reminding method and device based on speech recognition |
Family Cites Families (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20120245936A1 (en) * | 2011-03-25 | 2012-09-27 | Bryan Treglia | Device to Capture and Temporally Synchronize Aspects of a Conversation and Method and System Thereof |
JP2014240940A (en) * | 2013-06-12 | 2014-12-25 | 株式会社東芝 | Dictation support device, method and program |
CN205647778U (en) * | 2016-04-01 | 2016-10-12 | 安徽听见科技有限公司 | Intelligent conference system |
CN106057193A (en) * | 2016-07-13 | 2016-10-26 | 深圳市沃特沃德股份有限公司 | Conference record generation method based on telephone conference and device |
CN106911832B (en) * | 2017-04-28 | 2020-06-02 | 四川音创伟业科技有限公司 | Voice recording method and device |
-
2017
- 2017-08-07 CN CN201710666179.8A patent/CN107527623B/en active Active
- 2017-12-14 WO PCT/CN2017/116067 patent/WO2019029073A1/en active Application Filing
Patent Citations (8)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN103456305A (en) * | 2013-09-16 | 2013-12-18 | 东莞宇龙通信科技有限公司 | Terminal and speech processing method based on multiple sound collecting units |
US20160092154A1 (en) * | 2014-09-30 | 2016-03-31 | International Business Machines Corporation | Content mirroring |
CN104796584A (en) * | 2015-04-23 | 2015-07-22 | 南京信息工程大学 | Prompt device with voice recognition function |
CN106297794A (en) * | 2015-05-22 | 2017-01-04 | 西安中兴新软件有限责任公司 | The conversion method of a kind of language and characters and equipment |
CN106910504A (en) * | 2015-12-22 | 2017-06-30 | 北京君正集成电路股份有限公司 | A kind of speech reminding method and device based on speech recognition |
CN105704538A (en) * | 2016-03-17 | 2016-06-22 | 广东小天才科技有限公司 | Method and system for generating audio and video subtitles |
CN105913845A (en) * | 2016-04-26 | 2016-08-31 | 惠州Tcl移动通信有限公司 | Mobile terminal voice recognition and subtitle generation method and system and mobile terminal |
CN106657865A (en) * | 2016-12-16 | 2017-05-10 | 联想(北京)有限公司 | Method and device for generating conference summary and video conference system |
Cited By (10)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN109151642A (en) * | 2018-09-05 | 2019-01-04 | 北京今链科技有限公司 | A kind of intelligent earphone, intelligent earphone processing method, electronic equipment and storage medium |
CN111770319A (en) * | 2019-10-18 | 2020-10-13 | 北京沃东天骏信息技术有限公司 | Projection method, device, system and storage medium |
WO2021233218A1 (en) * | 2020-05-19 | 2021-11-25 | 华为技术有限公司 | Screen casting method, screen casting source end, screen casting destination end, screen casting system and storage medium |
CN112019786A (en) * | 2020-08-24 | 2020-12-01 | 上海松鼠课堂人工智能科技有限公司 | Intelligent teaching screen recording method and system |
CN112019786B (en) * | 2020-08-24 | 2021-05-25 | 上海松鼠课堂人工智能科技有限公司 | Intelligent teaching screen recording method and system |
CN112887781A (en) * | 2021-01-27 | 2021-06-01 | 维沃移动通信有限公司 | Subtitle processing method and device |
CN112684967A (en) * | 2021-03-11 | 2021-04-20 | 荣耀终端有限公司 | Method for displaying subtitles and electronic equipment |
CN113746911A (en) * | 2021-08-26 | 2021-12-03 | 科大讯飞股份有限公司 | Audio processing method and related device, electronic equipment and storage medium |
CN114125358A (en) * | 2021-11-11 | 2022-03-01 | 北京有竹居网络技术有限公司 | Cloud conference subtitle display method, system, device, electronic equipment and storage medium |
CN115052126A (en) * | 2022-08-12 | 2022-09-13 | 深圳市稻兴实业有限公司 | Ultra-high definition video conference analysis management system based on artificial intelligence |
Also Published As
Publication number | Publication date |
---|---|
WO2019029073A1 (en) | 2019-02-14 |
CN107527623B (en) | 2021-02-09 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN107527623A (en) | Screen transmission method, device, electronic equipment and computer-readable recording medium | |
CN101689365B (en) | Method of controlling a video conference | |
CN103327181B (en) | Voice chatting method capable of improving efficiency of voice information learning for users | |
US11650790B2 (en) | Centrally controlling communication at a venue | |
CN109951743A (en) | Barrage information processing method, system and computer equipment | |
CN111048093A (en) | Conference sound box, conference recording method, device, system and computer storage medium | |
CN105245355A (en) | Intelligent voice shorthand conference system | |
US10313502B2 (en) | Automatically delaying playback of a message | |
US20210312143A1 (en) | Real-time call translation system and method | |
CN111107283B (en) | Information display method, electronic equipment and storage medium | |
US20230247131A1 (en) | Presentation of communications | |
CN114531425B (en) | Processing method and processing device | |
US10580410B2 (en) | Transcription of communications | |
CN210091177U (en) | Conference system for realizing synchronous translation | |
US11830120B2 (en) | Speech image providing method and computing device for performing the same | |
KR20180068655A (en) | apparatus and method for generating text based on audio signal | |
Shang et al. | Audio recordings dataset of genuine and replayed speech at both ends of a telecommunication channel | |
CN110931001B (en) | Anti-noise audio transmission device facing voice recognition | |
US20230421702A1 (en) | Distributed teleconferencing using personalized enhancement models | |
JP2022113375A (en) | Information processing method and monitoring system | |
CN113919299A (en) | Summary text generation method, projection device and computer readable storage medium | |
Patil et al. | MuteTrans: A communication medium for deaf | |
CN117636928A (en) | Pickup device and related audio enhancement method | |
CN109889764A (en) | Conference system | |
CN114530159A (en) | Multimedia resource integration scheduling method based on WebRTC technology |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
GR01 | Patent grant | ||
GR01 | Patent grant |