CN101141666B - Method of converting text note to voice broadcast in mobile phone - Google Patents

Method of converting text note to voice broadcast in mobile phone Download PDF

Info

Publication number
CN101141666B
CN101141666B CN2006100624763A CN200610062476A CN101141666B CN 101141666 B CN101141666 B CN 101141666B CN 2006100624763 A CN2006100624763 A CN 2006100624763A CN 200610062476 A CN200610062476 A CN 200610062476A CN 101141666 B CN101141666 B CN 101141666B
Authority
CN
China
Prior art keywords
phonetic
storehouse
dynamic
mobile phone
voice
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Expired - Fee Related
Application number
CN2006100624763A
Other languages
Chinese (zh)
Other versions
CN101141666A (en
Inventor
江莉军
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
ZTE Corp
Original Assignee
ZTE Corp
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by ZTE Corp filed Critical ZTE Corp
Priority to CN2006100624763A priority Critical patent/CN101141666B/en
Publication of CN101141666A publication Critical patent/CN101141666A/en
Application granted granted Critical
Publication of CN101141666B publication Critical patent/CN101141666B/en
Expired - Fee Related legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Landscapes

  • Mobile Radio Communication Systems (AREA)
  • Telephone Function (AREA)
  • Telephonic Communication Services (AREA)

Abstract

The present invention discloses a method which is used to transform a text message into a voice playing in a cell phone. The method aims at the content of the characters in the existing message, to initialize the dynamic Pinyin base after starting up the cell phone; the dynamic Pinyin base is adjusted after receiving a new message; the characters in the text are transformed into the Pinyin and the tones, after the user pressing a voice playing key while reading the message, then each Pinyin data is called out from the dynamic Pinyin base, to form a Pinyin data flow, to be sent to a voice processing chip; the voice processing chip transforms the voice data flow into a simulating voice signal, and the simulating voice signal is sent to a speaker of the cell phone to play out. A big storage space of the cell phone is unnecessary by adopting the present invention, and the content of the message can be transformed into the voice to playing. The method is quite convenient for the cell phone user.

Description

In mobile phone, text SMS is converted to the method for speech play
Technical field
The present invention relates to the communication technology, particularly literal is to speech recognition and treatment technology.
Background technology
The SMS of text formatting must enter under the note reading model and could read.In life, have and many times be inconvenient to read, as driving, situations such as sleep; There is the part user can't read especially, as old man, children or Dyslexia personage.Be inconvenient to read under the situation of note, when checking that existing note or new message arrive, broadcasting, will bring great convenience the cellphone subscriber if can convert text SMS to voice by mobile phone.
Text-converted is become the technology of voice, i.e. " Text To Speech " is called for short TTS.Under the support of built-in chip,, literal is converted into natural-sounding stream intelligently by the design of neural net.The TTS speech synthesis technique of realizing on PC and the high-grade smart mobile phone of part is about to cover GB I and II Chinese character, has English interface, discerns Chinese and English automatically, supports Chinese and English to mix and reads.In addition, a kind of speech play technology of curing, for example bus stop reporting system is all ripe.Yet, for most mobile phones, both there be not enough resources to support tts system, also can't deal with the short message content of receiving that dynamic change with the speech play technology of solidifying.
As long as phonetic is arranged, how this reads just to know this Chinese character, and this is the basic principle of TTS technology.And present nearly all mobile phone all is with spelling input method, this means that Chinese character base in the mobile phone and phonetic storehouse have existed and corresponding relation arranged.This makes the realization of switch technology cheaply on mobile phone of Text To Speech become possibility just for the present invention saves a large amount of speech data memory spaces.
Summary of the invention
Technical problem to be solved by this invention is that a kind of method that realizes text SMS is converted to speech play in regular handset is provided.
Technical scheme of the present invention may further comprise the steps:
At the word content in the existing note, initialization is carried out in dynamic phonetic storehouse behind A, the mobile phone power-on; After receiving new message dynamic phonetic storehouse is adjusted; Described dynamic phonetic storehouse has statistical analysis of high frequency word and automatic adjustment capability, described dynamic phonetic storehouse comprises the higher Chinese character of frequency of utilization, Arabic numerals, punctuation mark and they the pairing phonetic that pronounces in Chinese, and English alphabet and their pronunciation, the higher Chinese character of described frequency of utilization upgrades in mobile phone initialization foundation and after at new message;
Dynamically the foundation in phonetic storehouse and the process of renewal are: free time behind mobile phone power-on, according to existing short message content, count high frequency Chinese, and from intrinsic phonetic storehouse, get access to corresponding phonetic and tone, they are saved in the dynamic phonetic storehouse; With the Arabic numerals in the intrinsic phonetic storehouse, punctuation mark and the pairing phonetic of their Chinese pronunciations and 26 English alphabets copy in the dynamic phonetic storehouse simultaneously; New message counts the high frequency word after coming again, and upgrades the dynamic voice storehouse;
B, user become phonetic and tone with the text conversion in the text press the speech play key when reading note after, each phonetic data are accessed from dynamic phonetic storehouse again, form the phonetic data flow, deliver to pronounciation processing chip;
C, pronounciation processing chip convert audio data stream to simulated audio signal and deliver in the mobile phone speaker and broadcast.
Word content in the described note comprises Chinese character, Arabic numerals, punctuation mark and English alphabet, and wherein the processing method to English is the simple letter-sound that is split as.
In such scheme, phonetic transcriptions of Chinese characters combination is added the voice of tone, the language of English alphabet, be put in the speech chip, form the correspondence table of a phonetic to phonetic code by the sequential write of phonetic code, speech chip is searched corresponding phonetic code according to phonetic stream, converts text SMS to audio data stream.
Method of the present invention does not need mobile phone that big memory space is arranged, and can solve the short message content that will receive and convert speech play to, and is concerning the cellphone subscriber, very convenient.
Description of drawings
Fig. 1 is a flow chart of the present invention.
Embodiment
Below in conjunction with Fig. 1 enforcement of the present invention is further described.
Behind A, the mobile phone power-on, initialization (promptly to the Chinese character part in the literal with counting the high frequency word, add that Arabic numerals, punctuation mark and English alphabet form dynamic phonetic storehouse) is carried out in dynamic phonetic storehouse in conjunction with Arabic numerals, punctuation mark and English alphabet; If do not receive new note, can directly carry out speech play to original note; If receive new message, then dynamic phonetic storehouse is adjusted;
B, user become phonetic and tone with the text conversion in the text press the speech play key when reading note after, each phonetic data are accessed from dynamic phonetic storehouse again, form the phonetic data flow, deliver to pronounciation processing chip;
C, pronounciation processing chip convert audio data stream to simulated audio signal and deliver in the mobile phone speaker and broadcast.
The present invention relates to an intrinsic phonetic storehouse and dynamic phonetic storehouse.This processing is the restriction for hardware memory resource that solves mobile phone and handset processes device chip arithmetic speed, improves the processing speed of whole system.Dynamically the phonetic stock is put in the volatile memory (such as RAM), and it is little to take storage space, and access speed is fast, no erasing times restriction.Dynamically the phonetic storehouse should have statistical analysis of high frequency word and automatic adjustment capability.
Intrinsic phonetic storehouse comprises Arabic numerals, punctuation mark and the pairing phonetic of their Chinese pronunciations and 26 English alphabets.Intrinsic phonetic storehouse is actually an expansion on the database that spelling input method carries in mobile phone, and the nonvolatile memory that takies (such as FLASH) space is minimum.The spelling input method phonetic storehouse that carries on the mobile phone is to obtain Chinese character by input Pinyin, and the present invention need change Chinese character to phonetic, and is with tone information.For the processing of English, be that the simple letter-sound that is split as broadcasts.
Dynamically the establishment and the method for updating in phonetic storehouse are:
The dynamically foundation in phonetic storehouse, be behind mobile phone power-on free time, according to existing short message content, count high frequency Chinese, from intrinsic phonetic storehouse, get access to the phonetic and the tone of correspondence, they are saved in the dynamic phonetic storehouse.With the Arabic numerals in the intrinsic phonetic storehouse, punctuation mark and the pairing phonetic of their Chinese pronunciations and 26 English alphabets copy in the dynamic phonetic storehouse simultaneously.New message counts the high frequency word after coming again, and upgrades the dynamic voice storehouse.
In the present Chinese mobile phone, spelling input method is the main input method of Chinese character.Except that the minority polyphone, be one-to-one relationship between Chinese character and the phonetic, from Chinese word library, choose a Chinese character, then there is a phonetic corresponding with it.This relation one to one can be passed through to improve the character library interface, thereby obtain easily.
To the processing of polyphone, from the context could the decision, in the present invention, we get frequency and the highest pronunciation occurs.
The following describes the processing procedure of text SMS to the phonetic data flow:
1, receives text SMS.
2, data if contain the character that dynamic phonetic storehouse does not have in the text note, are just extracted in the dynamic phonetic of contrast storehouse from intrinsic phonetic storehouse, and upgrade this dynamic phonetic storehouse.
3, Arabic numerals in the text SMS and punctuation mark, convert the Chinese character of Chinese pronunciations correspondence to, form the note of Chinese character and English alphabetic combination.
4, from dynamic sound bank, the phonetic and the tone of each character correspondence are combined into the phonetic data flow then in the note of search Chinese character and English alphabetic combination.
To the phonetic data transaction is become voice signal, can adopt the ISD4004 speech chip, this chip is deposited sound by section, every section sound, each sound 300ms.The phonetic transcriptions of Chinese characters combination adds that tone information amounts to 1311 voice (Chinese pronunciation that comprises the punctuation mark correspondence), add 26 English alphabets, totally 1337 voice, these 1337 voice can be put in the ISD4004 speech chip by the sequential write of phonetic code, form the correspondence table of a phonetic, as table 1 to phonetic code.After obtaining phonetic stream, phonetic stream is sent into speech chip, then can search corresponding phonetic code, convert audio data stream to by speech chip according to phonetic stream.
Table 1, the correspondence table from phonetic to the phonetic code
At different cell phone platforms, the execution mode difference that it is concrete, i.e. the method difference of embedded programming.The present invention is applicable to non intelligent mobile phone.To smart mobile phone, based on operating systems such as windows mobile,, then can call SDK easily if be equipped with the TTS speech engine of Microsoft, realize the process of speech conversion.

Claims (3)

1. method that text SMS is converted to speech play in mobile phone may further comprise the steps:
At the word content in the existing note, initialization is carried out in dynamic phonetic storehouse behind A, the mobile phone power-on; After receiving new message dynamic phonetic storehouse is adjusted; Described dynamic phonetic storehouse has statistical analysis of high frequency word and automatic adjustment capability, described dynamic phonetic storehouse comprises the higher Chinese character of frequency of utilization, Arabic numerals, punctuation mark and they the pairing phonetic that pronounces in Chinese, and English alphabet and their pronunciation, the higher Chinese character of described frequency of utilization upgrades in mobile phone initialization foundation and after at new message;
Dynamically the foundation in phonetic storehouse and the process of renewal are: free time behind mobile phone power-on, according to existing short message content, count high frequency Chinese, and from intrinsic phonetic storehouse, get access to corresponding phonetic and tone, they are saved in the dynamic phonetic storehouse; With the Arabic numerals in the intrinsic phonetic storehouse, punctuation mark and the pairing phonetic of their Chinese pronunciations and 26 English alphabets copy in the dynamic phonetic storehouse simultaneously; New message counts the high frequency word after coming again, and upgrades the dynamic voice storehouse;
B, user become phonetic and tone with the text conversion in the text press the speech play key when reading note after, each phonetic data are accessed from dynamic phonetic storehouse again, form the phonetic data flow, deliver to pronounciation processing chip;
C, pronounciation processing chip convert audio data stream to simulated audio signal and deliver in the mobile phone speaker and broadcast.
2. the described method that in mobile phone, text SMS is converted to speech play of claim 1, it is characterized in that, word content in the described note comprises Chinese character, Arabic numerals, punctuation mark and English alphabet, and wherein the processing method to English is the simple letter-sound that is split as.
3. the described method that in mobile phone, text SMS is converted to speech play of claim 2, it is characterized in that, the phonetic transcriptions of Chinese characters combination is added the voice of tone, the voice of English alphabet, sequential write by phonetic code is put in the speech chip, form the correspondence table of a phonetic to phonetic code, speech chip is searched corresponding phonetic code according to phonetic stream, converts text SMS to audio data stream.
CN2006100624763A 2006-09-05 2006-09-05 Method of converting text note to voice broadcast in mobile phone Expired - Fee Related CN101141666B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN2006100624763A CN101141666B (en) 2006-09-05 2006-09-05 Method of converting text note to voice broadcast in mobile phone

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN2006100624763A CN101141666B (en) 2006-09-05 2006-09-05 Method of converting text note to voice broadcast in mobile phone

Publications (2)

Publication Number Publication Date
CN101141666A CN101141666A (en) 2008-03-12
CN101141666B true CN101141666B (en) 2011-02-23

Family

ID=39193356

Family Applications (1)

Application Number Title Priority Date Filing Date
CN2006100624763A Expired - Fee Related CN101141666B (en) 2006-09-05 2006-09-05 Method of converting text note to voice broadcast in mobile phone

Country Status (1)

Country Link
CN (1) CN101141666B (en)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN103888611A (en) * 2014-03-20 2014-06-25 联想(北京)有限公司 Output method and communication devices

Families Citing this family (15)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN103108092A (en) * 2011-11-15 2013-05-15 希姆通信息技术(上海)有限公司 Communication terminal and information transmitting method between communication terminals
CN102421072A (en) * 2011-12-22 2012-04-18 苏州巴米特信息科技有限公司 Data communication method of voice message
CN102695148B (en) * 2012-06-01 2015-01-21 上海车音网络科技有限公司 Methods and devices for sending and receiving short message, and short message sending and receiving system
CN103902600B (en) * 2012-12-27 2017-12-01 富士通株式会社 Lists of keywords forming apparatus and method and electronic equipment
CN103973442A (en) * 2013-02-01 2014-08-06 国民技术股份有限公司 Verification code transmitting and acquiring methods, mobile phone and electronic equipment
CN104035550B (en) * 2013-03-07 2017-12-22 腾讯科技(深圳)有限公司 Information provides mode switching method and device
CN103257787B (en) * 2013-05-16 2016-07-13 小米科技有限责任公司 The open method of a kind of voice assistant application and device
US20150142444A1 (en) * 2013-11-15 2015-05-21 International Business Machines Corporation Audio rendering order for text sources
CN104900226A (en) * 2014-03-03 2015-09-09 联想(北京)有限公司 Information processing method and device
CN103929534B (en) * 2014-03-19 2017-05-24 联想(北京)有限公司 Information processing method and electronic equipment
CN105677008A (en) * 2014-11-21 2016-06-15 联想(北京)有限公司 Information processing method and electronic equipment
CN105573978A (en) * 2015-12-10 2016-05-11 温州雏鹰科技有限公司 Short message service information processing method and device
CN106791078A (en) * 2016-12-18 2017-05-31 程在舒 The speech playing method and application of mobile terminal new information and Domestic News
CN109509464B (en) * 2017-09-11 2022-11-04 珠海金山办公软件有限公司 Method and device for recording text reading as audio
CN112069805A (en) * 2019-12-20 2020-12-11 北京来也网络科技有限公司 Text labeling method, device, equipment and storage medium combining RPA and AI

Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN1455386A (en) * 2002-11-01 2003-11-12 中国科学院声学研究所 Imbedded voice synthesis method and system
CN1713761A (en) * 2004-06-22 2005-12-28 乐金电子(中国)研究开发中心有限公司 Speech outputting method for short message
CN1731509A (en) * 2005-09-02 2006-02-08 清华大学 Mobile speech synthesis method
CN1747500A (en) * 2005-09-30 2006-03-15 熊猫电子集团有限公司 Method and device for setting mobile communication terminal for person with short-eyesight

Patent Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN1455386A (en) * 2002-11-01 2003-11-12 中国科学院声学研究所 Imbedded voice synthesis method and system
CN1713761A (en) * 2004-06-22 2005-12-28 乐金电子(中国)研究开发中心有限公司 Speech outputting method for short message
CN1731509A (en) * 2005-09-02 2006-02-08 清华大学 Mobile speech synthesis method
CN1747500A (en) * 2005-09-30 2006-03-15 熊猫电子集团有限公司 Method and device for setting mobile communication terminal for person with short-eyesight

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
JP特开2002-368885A 2002.12.20

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN103888611A (en) * 2014-03-20 2014-06-25 联想(北京)有限公司 Output method and communication devices
CN103888611B (en) * 2014-03-20 2016-01-27 联想(北京)有限公司 A kind of output intent and communication equipment

Also Published As

Publication number Publication date
CN101141666A (en) 2008-03-12

Similar Documents

Publication Publication Date Title
CN101141666B (en) Method of converting text note to voice broadcast in mobile phone
CN101164102B (en) Methods and apparatus for automatically extending the voice vocabulary of mobile communications devices
EP2389672B1 (en) Method, apparatus and computer program product for providing compound models for speech recognition adaptation
CN111261144A (en) Voice recognition method, device, terminal and storage medium
CN101563683A (en) Method, apparatus and computer program product for providing flexible text based language identification
CN104050966A (en) Voice interaction method of terminal equipment and terminal equipment employing voice interaction method
CN112580335B (en) Method and device for disambiguating polyphone
CN101199122A (en) Using language models to expand wildcards
CN101542590A (en) Method, apparatus and computer program product for providing a language based interactive multimedia system
CN102543071A (en) Voice recognition system and method used for mobile equipment
CN103903621A (en) Method for voice recognition and electronic equipment
CN104202455A (en) Intelligent voice dialing method and intelligent voice dialing device
CN102831892A (en) Toy control method and system based on internet voice interaction
CN101174448A (en) Talking picture playing method and device, method for generating index file of talking picture
CN102847325A (en) Toy control method and system based on voice interaction of mobile communication terminal
CN101354886A (en) Apparatus for recognizing speech
CN1855223B (en) Audio font output device, font database, and language input front end processor
CN1455386A (en) Imbedded voice synthesis method and system
JP2009175630A (en) Speech recognition device, mobile terminal, speech recognition system, speech recognition device control method, mobile terminal control method, control program, and computer readable recording medium with program recorded therein
US20080091427A1 (en) Hierarchical word indexes used for efficient N-gram storage
CN1333501A (en) Dynamic Chinese speech synthesizing method
CN1972478A (en) A novel method for mobile phone reading short message
CN201788597U (en) Reading pen
CN117478239B (en) Method for generating and transmitting emergency information by using audio two-dimension code
CN1988462A (en) Method for sound quoted price for share information of action equipment

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
C14 Grant of patent or utility model
GR01 Patent grant
CF01 Termination of patent right due to non-payment of annual fee

Granted publication date: 20110223

Termination date: 20150905

EXPY Termination of patent right or utility model