CN109801619A - A kind of across language voice identification method for transformation of intelligence - Google Patents

A kind of across language voice identification method for transformation of intelligence Download PDF

Info

Publication number
CN109801619A
CN109801619A CN201910112299.2A CN201910112299A CN109801619A CN 109801619 A CN109801619 A CN 109801619A CN 201910112299 A CN201910112299 A CN 201910112299A CN 109801619 A CN109801619 A CN 109801619A
Authority
CN
China
Prior art keywords
voice
speech recognition
module
speech
identification
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN201910112299.2A
Other languages
Chinese (zh)
Inventor
葛星
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Anhui Dachidu Network Media Co Ltd
Original Assignee
Anhui Dachidu Network Media Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Anhui Dachidu Network Media Co Ltd filed Critical Anhui Dachidu Network Media Co Ltd
Priority to CN201910112299.2A priority Critical patent/CN109801619A/en
Publication of CN109801619A publication Critical patent/CN109801619A/en
Pending legal-status Critical Current

Links

Landscapes

  • Machine Translation (AREA)

Abstract

The invention discloses a kind of across the language voice identification method for transformation of intelligence, and across the language voice identification method for transformation of the intelligence is the following steps are included: S1: voice is collected, S2: speech recognition, S3: voice conversion, S4: speech recognition judgement.The present invention is ingenious in design, it can be realized and identification conversion is carried out to voice, and method of the invention is convenient to voice progress denoising, to guarantee the sound quality of voice, the people of different zones are facilitated to carry out communication exchange, it is convenient that voice to be identified is collected by voice collection module, to guarantee to provide precondition for voice conversion;Pass through speech recognition module one and speech recognition module two, speech recognition module one is facilitated first to carry out identification judgement to voice, then the judgement after converting to voice is carried out by speech recognition module two again belong to that voice family of languages, it is convenient that Classification of Speech is stored, also facilitate other equipment to access speech database.

Description

A kind of across language voice identification method for transformation of intelligence
Technical field
The present invention relates to voice transformation technology field more particularly to a kind of across the language voice identification method for transformation of intelligence.
Background technique
Voice refers to that the mankind are issued by vocal organs, with definite meaning, purpose is for carrying out social friendship The sound on border.In the shape of language, sound, adopted three essential attributes, voice is the first attribute, and the language of the mankind is with language first The form of sound is formed, and has letterless language in the world, but not without the language of voice, voice plays conclusive branch in language Support effect.
Voice, i.e. the substance shell of language, are the external forms of language, are the most directly symbols of the thinking activities of recorder Number system.It is the sound with certain social effect of the vocal organs sending of people.The physical basis of voice mainly have pitch, Loudness of a sound, the duration of a sound, tone color, this is also four elements for constituting voice.
Existing speech category is not relatively more, and the exchange between people is then that communication exchange is carried out by voice, but not It is then to need voice to carry out to be converted into the mutually known voice of people with region people, to guarantee that people carry out communication exchange, When the people of different zones exchange, need to convert language, but present voice method for transformation, it cannot be to language The family of languages belonging to sound differentiated, therefore is inconvenient to store or inconvenient other equipment access speech database, existing Voice method for transformation, cannot identify and voice is pre-processed, therefore just will affect the sound quality of voice, so influence user Voice is understood, in consideration of it, the present invention provides a kind of across language voice identification method for transformation of intelligence.
Summary of the invention
The purpose of the present invention is to solve cannot differentiate in the prior art to the family of languages belonging to voice, therefore not side Just storage or inconvenient other equipment access speech database, and existing voice method for transformation cannot be identified to voice A kind of intelligence for being pre-processed, therefore just will affect the sound quality of voice, and then influenced the disadvantages of user understands voice, and propose Across the language voice identification method for transformation of energyization.
To achieve the goals above, present invention employs following technical solutions:
A kind of across language voice identification method for transformation of intelligence, across the language voice identification method for transformation of the intelligence includes following step It is rapid:
S1: voice is collected: the voice data of conversion to be identified is obtained by voice collection module;
S2: the voice data of collection first speech recognition: is carried out identification comparison by speech recognition module one;
S3: after the voice data identification comparison in S2, conversion classification voice conversion: is carried out to voice by voice conversion module;
S4: speech recognition judgement: the voice after converting in S3 carries out identification judgement by speech recognition module two, and speech recognition is looked for Data are then stored in speech database after to the corresponding voice family of languages, otherwise voice then enter in S3 converted again with And identification.
Preferably, the voice collection module includes speech preprocessing module, and speech preprocessing module is used for voice Denoising.
It preferably, include Speech comparison module, voice pair in the speech recognition module one and speech recognition module two Than module for carrying out identification judgement to voice data or the voice family of languages.
Preferably, the speech database is used to store the data of classification, wherein speech recognition module one and speech recognition Module two is compared with the voice data in speech database respectively, and speech recognition module one is for comparing identification voice data Data in library, speech recognition module two is for carrying out identification judgement to the voice family of languages.
Across the language voice identification method for transformation of a kind of intelligence proposed by the present invention, beneficial effect are: present invention design It is ingenious, it can be realized and identification conversion is carried out to voice, and method of the invention is convenient to voice progress denoising, to guarantee language The sound quality of sound facilitates the people of different zones to carry out communication exchange, by voice collection module, it is convenient to voice to be identified into Row is collected, to guarantee to provide precondition for voice conversion.
By speech recognition module one and speech recognition module two, speech recognition module one is facilitated first to identify to voice Then judgement is carried out the judgement after converting to voice by speech recognition module two again and belongs to that voice family of languages, convenient to language Sound classified storage also facilitates other equipment to access speech database.
Voice collection module first collects voice to be identified, and voice carries out at denoising voice by speech preprocessing module Reason, the voice after denoising compares judgement by the data in speech recognition module one and speech database, if voice number There is no corresponding voice data according to the data in library, then voice enters voice conversion module and converted, and otherwise voice is then It being stored in speech database, the voice after conversion is compared identification by speech recognition module two by voice conversion module, If the voice data after conversion meets the family of languages of voice data library standard, data are stored in speech database, otherwise language Sound then passes through voice conversion module and is converted again.
Detailed description of the invention
Fig. 1 is flow diagram of the invention.
Specific embodiment
Following will be combined with the drawings in the embodiments of the present invention, and technical solution in the embodiment of the present invention carries out clear, complete Site preparation description, it is clear that described embodiments are only a part of the embodiments of the present invention, instead of all the embodiments.
Referring to Fig.1, across the language voice identification method for transformation of a kind of intelligence, across the language voice identification conversion side of the intelligence Method the following steps are included:
S1: voice is collected: the voice data of conversion to be identified is obtained by voice collection module, voice collection module includes voice Preprocessing module, speech preprocessing module is for the denoising to voice;
S2: the voice data of collection first speech recognition: is carried out identification comparison by speech recognition module one;
S3: after the voice data identification comparison in S2, conversion classification voice conversion: is carried out to voice by voice conversion module;
S4: speech recognition judgement: the voice after converting in S3 carries out identification judgement by speech recognition module two, and speech recognition is looked for Data are then stored in speech database after to the corresponding voice family of languages, otherwise voice then enter in S3 converted again with And identification.
In speech recognition module one and speech recognition module two include Speech comparison module, Speech comparison module for pair Voice data or the voice family of languages carry out identification judgement, and speech database is used to store the data of classification, wherein speech recognition mould Block one and speech recognition module two are compared with the voice data in speech database respectively, speech recognition module one for pair Than the data in identification speech database, speech recognition module two is known for carrying out identification judgement to the voice family of languages by voice Other module one and speech recognition module two, facilitate speech recognition module one first to carry out identification judgement to voice, then pass through language again Sound identification module two carries out the judgement after converting to voice and belongs to that voice family of languages, convenient to store to Classification of Speech, also facilitates Other equipment access speech database.
The present invention is ingenious in design, can be realized and carries out identification conversion to voice, and method of the invention it is convenient to voice into Row denoising facilitates the people of different zones to carry out communication exchange to guarantee the sound quality of voice, by voice collection module, It is convenient that voice to be identified is collected, to guarantee to provide precondition for voice conversion.
Working principle: voice collection module first collects voice to be identified, and voice is by speech preprocessing module to voice Denoising is carried out, the voice after denoising compares judgement by the data in speech recognition module one and speech database, If the data in speech database do not have corresponding voice data, voice enters voice conversion module and is converted, Otherwise voice is then stored in speech database, and voice conversion module carries out the voice after conversion by speech recognition module two Comparison identification, if the voice data after conversion meets the family of languages of voice data library standard, data are stored in speech database Interior, otherwise voice then passes through voice conversion module and is converted again.
The foregoing is only a preferred embodiment of the present invention, but scope of protection of the present invention is not limited thereto, Anyone skilled in the art in the technical scope disclosed by the present invention, according to the technique and scheme of the present invention and its Inventive concept is subject to equivalent substitution or change, should be covered by the protection scope of the present invention.

Claims (4)

1. a kind of across language voice identification method for transformation of intelligence, which is characterized in that across the language voice identification conversion of the intelligence Method the following steps are included:
S1: voice is collected: the voice data of conversion to be identified is obtained by voice collection module;
S2: the voice data of collection first speech recognition: is carried out identification comparison by speech recognition module one;
S3: after the voice data identification comparison in S2, conversion classification voice conversion: is carried out to voice by voice conversion module;
S4: speech recognition judgement: the voice after converting in S3 carries out identification judgement by speech recognition module two, and speech recognition is looked for Data are then stored in speech database after to the corresponding voice family of languages, otherwise voice then enter in S3 converted again with And identification.
2. across the language voice identification method for transformation of a kind of intelligence according to claim 1, it is characterised in that: the voice Collection module includes speech preprocessing module, and speech preprocessing module is for the denoising to voice.
3. across the language voice identification method for transformation of a kind of intelligence according to claim 1, it is characterised in that: the voice Include Speech comparison module in identification module one and speech recognition module two, Speech comparison module be used for voice data or The voice family of languages carries out identification judgement.
4. across the language voice identification method for transformation of a kind of intelligence according to claim 1, it is characterised in that: the voice Database is used to store the data of classification, and wherein speech recognition module one and speech recognition module two are respectively and in speech database Voice data compare, speech recognition module one be used for compare identify speech database in data, speech recognition module Two for carrying out identification judgement to the voice family of languages.
CN201910112299.2A 2019-02-13 2019-02-13 A kind of across language voice identification method for transformation of intelligence Pending CN109801619A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201910112299.2A CN109801619A (en) 2019-02-13 2019-02-13 A kind of across language voice identification method for transformation of intelligence

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201910112299.2A CN109801619A (en) 2019-02-13 2019-02-13 A kind of across language voice identification method for transformation of intelligence

Publications (1)

Publication Number Publication Date
CN109801619A true CN109801619A (en) 2019-05-24

Family

ID=66562187

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201910112299.2A Pending CN109801619A (en) 2019-02-13 2019-02-13 A kind of across language voice identification method for transformation of intelligence

Country Status (1)

Country Link
CN (1) CN109801619A (en)

Citations (8)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN102074231A (en) * 2010-12-30 2011-05-25 万音达有限公司 Voice recognition method and system
US8374865B1 (en) * 2012-04-26 2013-02-12 Google Inc. Sampling training data for an automatic speech recognition system based on a benchmark classification distribution
US20160343368A1 (en) * 2013-01-17 2016-11-24 Speech Morphing Systems, Inc. Method and apparatus to model and transfer the prosody of tags across languages
CN106409285A (en) * 2016-11-16 2017-02-15 杭州联络互动信息科技股份有限公司 Method and apparatus for intelligent terminal device to identify language type according to voice data
CN107808659A (en) * 2017-12-02 2018-03-16 宫文峰 Intelligent sound signal type recognition system device
CN107945805A (en) * 2017-12-19 2018-04-20 程海波 A kind of intelligent across language voice identification method for transformation
CN108648747A (en) * 2018-03-21 2018-10-12 清华大学 Language recognition system
CN109065020A (en) * 2018-07-28 2018-12-21 重庆柚瓣家科技有限公司 The identification storehouse matching method and system of multilingual classification

Patent Citations (8)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN102074231A (en) * 2010-12-30 2011-05-25 万音达有限公司 Voice recognition method and system
US8374865B1 (en) * 2012-04-26 2013-02-12 Google Inc. Sampling training data for an automatic speech recognition system based on a benchmark classification distribution
US20160343368A1 (en) * 2013-01-17 2016-11-24 Speech Morphing Systems, Inc. Method and apparatus to model and transfer the prosody of tags across languages
CN106409285A (en) * 2016-11-16 2017-02-15 杭州联络互动信息科技股份有限公司 Method and apparatus for intelligent terminal device to identify language type according to voice data
CN107808659A (en) * 2017-12-02 2018-03-16 宫文峰 Intelligent sound signal type recognition system device
CN107945805A (en) * 2017-12-19 2018-04-20 程海波 A kind of intelligent across language voice identification method for transformation
CN108648747A (en) * 2018-03-21 2018-10-12 清华大学 Language recognition system
CN109065020A (en) * 2018-07-28 2018-12-21 重庆柚瓣家科技有限公司 The identification storehouse matching method and system of multilingual classification

Similar Documents

Publication Publication Date Title
CN108877801A (en) More wheel dialog semantics based on multi-modal Emotion identification system understand subsystem
CN108899050A (en) Speech signal analysis subsystem based on multi-modal Emotion identification system
CN108805087A (en) Semantic temporal fusion association based on multi-modal Emotion identification system judges subsystem
CN106357942A (en) Intelligent response method and system based on context dialogue semantic recognition
CN110459204A (en) Audio recognition method, device, storage medium and electronic equipment
CN107818785A (en) A kind of method and terminal device that information is extracted from multimedia file
CN105244042B (en) A kind of speech emotional interactive device and method based on finite-state automata
CN106294774A (en) User individual data processing method based on dialogue service and device
CN105427855A (en) Voice broadcast system and voice broadcast method of intelligent software
CN109344240A (en) A kind of data processing method, server and electronic equipment
CN106653019A (en) Man-machine conversation control method and system based on user registration information
CN106709804A (en) Interactive wealth planning consulting robot system
CN111723239A (en) Multi-mode-based video annotation method
CN110347811A (en) A kind of professional knowledge question and answer robot system based on artificial intelligence
CN105845143A (en) Speaker confirmation method and speaker confirmation system based on support vector machine
CN105957517A (en) Voice data structured conversion method and system based on open source API
CN113569924B (en) Emotion identification classification method based on support vector machine multi-core cooperation
CN110378190A (en) Video content detection system and detection method based on topic identification
CN114169364A (en) Electroencephalogram emotion recognition method based on space-time diagram model
Zhang et al. Research on spectrum sensing system based on composite neural network
CN109243458A (en) A kind of speech recognition system for intelligent robot
CN109801619A (en) A kind of across language voice identification method for transformation of intelligence
CN115861670A (en) Training method of feature extraction model and data processing method and device
CN108717851A (en) A kind of audio recognition method and device
CN115145402A (en) Intelligent toy system with network interaction function and control method

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
RJ01 Rejection of invention patent application after publication
RJ01 Rejection of invention patent application after publication

Application publication date: 20190524