CN108628837A - Using the simultaneous interpretation case of convolutional neural networks algorithm translation Spanish and Sichuan accent - Google Patents
Using the simultaneous interpretation case of convolutional neural networks algorithm translation Spanish and Sichuan accent Download PDFInfo
- Publication number
- CN108628837A CN108628837A CN201710172489.4A CN201710172489A CN108628837A CN 108628837 A CN108628837 A CN 108628837A CN 201710172489 A CN201710172489 A CN 201710172489A CN 108628837 A CN108628837 A CN 108628837A
- Authority
- CN
- China
- Prior art keywords
- simultaneous interpretation
- component
- spanish
- layer
- neural networks
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Pending
Links
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F40/00—Handling natural language data
- G06F40/40—Processing or translation of natural language
- G06F40/58—Use of machine translation, e.g. for multi-lingual retrieval, for server-side translation for client devices or for real-time translation
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06N—COMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N3/00—Computing arrangements based on biological models
- G06N3/02—Neural networks
- G06N3/08—Learning methods
Landscapes
- Engineering & Computer Science (AREA)
- Theoretical Computer Science (AREA)
- Physics & Mathematics (AREA)
- General Health & Medical Sciences (AREA)
- Health & Medical Sciences (AREA)
- Artificial Intelligence (AREA)
- General Physics & Mathematics (AREA)
- Computational Linguistics (AREA)
- General Engineering & Computer Science (AREA)
- Biomedical Technology (AREA)
- Evolutionary Computation (AREA)
- Molecular Biology (AREA)
- Computing Systems (AREA)
- Data Mining & Analysis (AREA)
- Biophysics (AREA)
- Mathematical Physics (AREA)
- Software Systems (AREA)
- Life Sciences & Earth Sciences (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Machine Translation (AREA)
Abstract
The invention discloses a kind of simultaneous interpretation casees using convolutional neural networks algorithm translation Spanish and Sichuan accent, including:Component one, simultaneous interpretation box main body part and its accessory;Large database concept in component two, cloud storage;The convolutional neural networks model carried in component three, cloud computing center is constituted, and by above-mentioned component, the present invention can substitute the advanced simultaneous interpretation translator of profession, be translated for user, the benefit brought is:Translation will not lead to the mistake caused by fatigue for a long time;Greatly reduce the fund cost for engaging simultaneous interpretation translator;Since, without sitting people, therefore volume very little can be that space is saved in international conference place, the seats for placing participants increase the number of participants of meeting more in the simultaneous interpretation case of the present invention.
Description
Technical field
The present invention relates to intelligent simultaneous interpretation casees, more particularly to using convolutional neural networks algorithm translation Spanish and four
The simultaneous interpretation case of Kawaguchi's sound.
Background technology
With the quickening of internationalization process, the demand of simultaneous interpretation translation is increasing, and in international conference, simultaneous interpretation case is indispensable
Important translation tool, traditional simultaneous interpretation case is a subvitreous closed box, and translator is sitting in wherein, passes through microphone pair
On meeting-place spokesman what is said or talked about carry out simultaneous interpretation translation, traditional simultaneous interpretation case overall volume is larger, occupy meeting-place space;Traditional
Simultaneous interpretation case closed structure ventilating air permeability is bad, easily causes translator's anoxic, under the translation of high intensity, be easy with
The extension of the time of meeting and make translator tired, to influence the accuracy rate of translation;And the same of profession is engaged in international conference
It is larger to pass translation expense, if spokesman has the people with Sichuan accent in meeting-place, then also needs to be equipped with the translator for understanding Sichuan words, such as
Fruit translation foreign side's offer again is difficult then to engage to the personnel that can carry out simultaneous interpretation translation to Sichuan dialect and Spanish.
Invention content
The invention mainly solves the technical problem of providing a kind of using convolutional neural networks algorithm translation Spanish and
The simultaneous interpretation case of Sichuan accent can substitute the advanced simultaneous interpretation translation of high wages, provide to the user will not because the translation time is long and
The translation error caused by fatigue, and can identify the Sichuan accent of user, avoid user from not speaking standard Chinese pronunciation, translator is again
It is ignorant of the difficult situation of Sichuan accent.
In order to solve the above technical problems, one aspect of the present invention is:It provides a kind of using convolutional Neural net
The simultaneous interpretation case of network algorithm translation Spanish and Sichuan accent, which is characterized in that including:Component one, simultaneous interpretation box main body part and
Its accessory;Large database concept in component two, cloud storage;The convolutional neural networks model structure carried in component three, cloud computing center
At.
Convolutional neural networks are a kind of perceptrons of multilayer, and every layer is made of two dimensional surface, and each plane is by multiple only
Vertical neuron forms, first comprising some simple members and complexity in network, is denoted as C members and S members respectively;C members condense together structure
At convolutional layer, S members, which condense together, constitutes down-sampling layer;The Sichuan accent audio or spanish audio of input by and filtering
Device and can biasing set carry out convolution, the N number of characteristic pattern of generation at C layer(N values can be manually set), then Feature Mapping figure is by asking
With weighted value and biasing, then pass through an activation primitive(Usually select Sigmoid functions)Obtain S layers of Feature Mapping figure.Root
According to the quantity for being manually set C layers and S layers, the above work recycles progress successively;Finally, the down-sampling to most tail portion and output layer into
The full connection of row, obtains output translation result to the end.
The simultaneous interpretation box main body of the present invention is case structure, is different from traditional simultaneous interpretation case, the present invention is due to using volume
The machine translation mode of product neural network algorithm, therefore without sitting people, therefore the simultaneous interpretation of the present invention in the simultaneous interpretation case at meeting scene
Case volume is much smaller than traditional simultaneous interpretation case, is provided in simultaneous interpretation box main body of the invention and to be connected for several audio transmission lines
The switch and master switch of total electric wire connecting junction, the interchanger of incoming fiber optic, each voice-grade channel of control;The accessory of simultaneous interpretation case includes:Meeting
Each attend a banquet the audio input line being connected and output line on field, microphone of attending a banquet, earphone of attending a banquet, attend a banquet on translation switch, light
It is fine.
Claims (7)
1. using the simultaneous interpretation case of convolutional neural networks algorithm translation Spanish and Sichuan accent, which is characterized in that including:Component
One, simultaneous interpretation box main body part and its accessory;Large database concept in component two, cloud storage;It is carried in component three, cloud computing center
Convolutional neural networks model is constituted
Component one according to claim 1, it is characterised in that:Simultaneous interpretation box main body is case structure, is different from traditional same
Biography case, machine translation mode of the present invention due to using convolutional neural networks algorithm, therefore in the simultaneous interpretation case at meeting scene
Without sitting people, therefore the simultaneous interpretation case volume of the present invention is much smaller than traditional simultaneous interpretation case, is provided in simultaneous interpretation box main body of the invention
For total electric wire connecting junction of several audio transmission lines connection, the interchanger of incoming fiber optic, each voice-grade channel of control switch and
Master switch;The accessory of simultaneous interpretation case includes:The each audio input line being connected and output line, microphone of attending a banquet, seat of attending a banquet on meeting-place
Seat earphone, attend a banquet on translation switch, optical fiber.
2. component two according to claim 1, it is characterised in that:The memory space of cloud storage configuration must not be less than 100TB;
In the space of cloud storage, taxonomic revision is carried out to big data information, inventory is divided to put;Large database concept in cloud storage space is at least
Need to include following several libraries:1)Spanish voice big data(By being no less than 100 people, divides men and women to carry out pronunciation respectively and record together
One Spanish;The total sentence number of Spanish of different content is not less than 100,000)、2)The voice large database concept of Sichuan accent
(By being no less than 100 people, divides men and women to carry out pronunciation respectively and record same sentence Sichuan accent sentence;The Sichuan accent language of different content
The total sentence number of sentence is not less than 100,000)、3)Spanish grammer and pronunciation rule database, 4)Chinese grammar and Sichuan accent pronunciation
Rule database.
3. component three according to claim 1, it is characterised in that:Big to convolutional neural networks algorithm model input audio
Before data, it need to be that every audio data carries out sound wave Image Rendering, then by drawn sound wave image input part three.
4. component three according to claim 1, it is characterised in that:Convolutional neural networks are a kind of perceptrons of multilayer, often
Layer is made of two dimensional surface, and each plane is made of multiple independent neurons, simple first and complicated comprising some in network
Member is denoted as C members and S members respectively;C members, which condense together, constitutes convolutional layer, and S members, which condense together, constitutes down-sampling layer;Input
Sichuan accent audio or spanish audio by with filter and can biasing set carry out convolution, the N number of characteristic pattern of generation at C layer(N
Value can be manually set), then Feature Mapping figure is by summation, weighted value and biasing, then passes through an activation primitive(Usually select
Sigmoid functions)Obtain S layers of Feature Mapping figure.
5. according to the quantity for being manually set C layers and S layers, the above work recycles progress successively;Finally, to the down-sampling of most tail portion and
Output layer is connected entirely, obtains output translation result to the end.
6. component three according to claim 1, it is characterised in that the process of convolution:It is gone with a trainable filter fx
The audio frequency sound image of one input of convolution(It is input picture at C1 layers, the input of convolutional layer later is then that the convolution of preceding layer is special
Sign figure), by an activation primitive (generally using Sigmoid functions), then plus one biases bx, obtains convolutional layer Cx.
7. concrete operation such as following formula, Mj is the value of input feature vector figure in formula:
The process of sub-sampling includes:The m pixel per neighborhood(M is to be manually set)Summation becomes a pixel, then passes through mark
Wx+1 weightings are measured, biasing bx+1 is further added by, Feature Mapping figure is then generated by activation primitive Sigmoid;From a plane to
The mapping of next plane can be regarded as making convolution algorithm, and S layers are considered as fuzzy filter, play Further Feature Extraction
Effect;Spatial resolution between hidden layer and hidden layer is successively decreased, and the number of planes contained by every layer is incremented by, and can be used for detecting more in this way
More characteristic informations;For sub-sampling layer, there is N number of input feature vector figure, just has N number of output characteristic pattern, only each feature
The size of figure has obtained corresponding change, concrete operation such as following formula, down in formula()Indicate down-sampling function;
。
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201710172489.4A CN108628837A (en) | 2017-03-22 | 2017-03-22 | Using the simultaneous interpretation case of convolutional neural networks algorithm translation Spanish and Sichuan accent |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201710172489.4A CN108628837A (en) | 2017-03-22 | 2017-03-22 | Using the simultaneous interpretation case of convolutional neural networks algorithm translation Spanish and Sichuan accent |
Publications (1)
Publication Number | Publication Date |
---|---|
CN108628837A true CN108628837A (en) | 2018-10-09 |
Family
ID=63706899
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN201710172489.4A Pending CN108628837A (en) | 2017-03-22 | 2017-03-22 | Using the simultaneous interpretation case of convolutional neural networks algorithm translation Spanish and Sichuan accent |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN108628837A (en) |
Cited By (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
WO2021051513A1 (en) * | 2019-09-19 | 2021-03-25 | 平安科技(深圳)有限公司 | Chinese-english translation method based on neural network, and related devices thereof |
Citations (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN203338350U (en) * | 2013-06-09 | 2013-12-11 | 武汉传神信息技术有限公司 | Machine simultaneous interpretation system |
-
2017
- 2017-03-22 CN CN201710172489.4A patent/CN108628837A/en active Pending
Patent Citations (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN203338350U (en) * | 2013-06-09 | 2013-12-11 | 武汉传神信息技术有限公司 | Machine simultaneous interpretation system |
Non-Patent Citations (1)
Title |
---|
FANDONG MENG ET AL: "Encoding Source Language with Convolutional Neural Network for Machine Translation", 《PROCEEDINGS OF THE 53RD ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS AND THE 7TH INTERNATIONAL JOINT CONFERENCE ON NATURAL LANGUAGE PROCESSING》 * |
Cited By (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
WO2021051513A1 (en) * | 2019-09-19 | 2021-03-25 | 平安科技(深圳)有限公司 | Chinese-english translation method based on neural network, and related devices thereof |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
Liu et al. | Divide and conquer: A deep CASA approach to talker-independent monaural speaker separation | |
WO2022048403A1 (en) | Virtual role-based multimodal interaction method, apparatus and system, storage medium, and terminal | |
Czyzewski et al. | An audio-visual corpus for multimodal automatic speech recognition | |
US5826234A (en) | Device and method for dubbing an audio-visual presentation which generates synthesized speech and corresponding facial movements | |
CN110223705A (en) | Phonetics transfer method, device, equipment and readable storage medium storing program for executing | |
CN106328126A (en) | Far-field speech recognition processing method and device | |
CN108847249A (en) | Sound converts optimization method and system | |
CN110517705B (en) | Binaural sound source positioning method and system based on deep neural network and convolutional neural network | |
Chen et al. | Visual acoustic matching | |
Yang et al. | Audio-visual speech codecs: Rethinking audio-visual speech enhancement by re-synthesis | |
CN107818785A (en) | A kind of method and terminal device that information is extracted from multimedia file | |
CN110136698A (en) | For determining the method, apparatus, equipment and storage medium of nozzle type | |
US20220358703A1 (en) | Method and device for generating speech video on basis of machine learning | |
JP7143847B2 (en) | Information processing system, information processing method, and program | |
WO2022106654A2 (en) | Methods and systems for video translation | |
CN113823273B (en) | Audio signal processing method, device, electronic equipment and storage medium | |
CN110475157A (en) | Multimedia messages methods of exhibiting, device, computer equipment and storage medium | |
CN109599094A (en) | The method of sound beauty and emotion modification | |
Majumder et al. | Active audio-visual separation of dynamic sound sources | |
CN113539232A (en) | Muslim class voice data set-based voice synthesis method | |
CN108628837A (en) | Using the simultaneous interpretation case of convolutional neural networks algorithm translation Spanish and Sichuan accent | |
CN114792524A (en) | Audio data processing method, apparatus, program product, computer device and medium | |
CN108495160A (en) | Intelligent control method, system, equipment and storage medium | |
CN116912375A (en) | Facial animation generation method and device, electronic equipment and storage medium | |
Tai et al. | Idanet: An information distillation and aggregation network for speech enhancement |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
WD01 | Invention patent application deemed withdrawn after publication |
Application publication date: 20181009 |
|
WD01 | Invention patent application deemed withdrawn after publication |