TW200705222A - Method of synchronizing speech waveform playback and text display - Google Patents
Method of synchronizing speech waveform playback and text displayInfo
- Publication number
- TW200705222A TW200705222A TW094125461A TW94125461A TW200705222A TW 200705222 A TW200705222 A TW 200705222A TW 094125461 A TW094125461 A TW 094125461A TW 94125461 A TW94125461 A TW 94125461A TW 200705222 A TW200705222 A TW 200705222A
- Authority
- TW
- Taiwan
- Prior art keywords
- feature vector
- speech waveform
- vector sequence
- getting
- text display
- Prior art date
Links
Landscapes
- Document Processing Apparatus (AREA)
- Electrically Operated Instructional Devices (AREA)
- Compression, Expansion, Code Conversion, And Decoders (AREA)
Abstract
A method of synchronizing speech waveform playback and text display is disclosed. The synchronization can be performed in the syllable, character or word level. The method includes the following approaches: Getting input text, which includes multiple syllables, characters and words; Getting the reference feature vector sequence according to the input text by concatenating multiple reference feature vector sequences which are from a database of feature vector sequences of all linguistic units (like syllables, characters and words) for the target language or languages; Getting a speech waveform; Extracting the feature vector sequence from the speech waveform; Searching for the syllable boundaries by aligning the extracting feature vector sequence and the reference feature vector sequence, where the alignment is performed by using the Dynamic Time Warping (DTW) technique.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
TW094125461A TWI269191B (en) | 2005-07-27 | 2005-07-27 | Method of synchronizing speech waveform playback and text display |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
TW094125461A TWI269191B (en) | 2005-07-27 | 2005-07-27 | Method of synchronizing speech waveform playback and text display |
Publications (2)
Publication Number | Publication Date |
---|---|
TWI269191B TWI269191B (en) | 2006-12-21 |
TW200705222A true TW200705222A (en) | 2007-02-01 |
Family
ID=38291478
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
TW094125461A TWI269191B (en) | 2005-07-27 | 2005-07-27 | Method of synchronizing speech waveform playback and text display |
Country Status (1)
Country | Link |
---|---|
TW (1) | TWI269191B (en) |
Cited By (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
TWI470589B (en) * | 2011-08-12 | 2015-01-21 | Hwa Jiuh Digital Technology Ltd | Cloud digital speech recording system |
Families Citing this family (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
TWI402824B (en) * | 2009-10-15 | 2013-07-21 | Univ Nat Cheng Kung | A pronunciation variation generation method for spontaneous speech synthesis |
-
2005
- 2005-07-27 TW TW094125461A patent/TWI269191B/en not_active IP Right Cessation
Cited By (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
TWI470589B (en) * | 2011-08-12 | 2015-01-21 | Hwa Jiuh Digital Technology Ltd | Cloud digital speech recording system |
Also Published As
Publication number | Publication date |
---|---|
TWI269191B (en) | 2006-12-21 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
EP1675019B1 (en) | System and method for disambiguating non diacritized arabic words in a text | |
TW200638337A (en) | Using a spoken utterance for disambiguation of spelling inputs into a speech recognition system | |
Zair | Oscan in the Greek alphabet | |
DE602004018290D1 (en) | LANGUAGE RECOGNITION AND CORRECTION SYSTEM, CORRECTION DEVICE AND METHOD FOR GENERATING A LEXICON OF ALTERNATIVES | |
TW200707404A (en) | Speech recognition assisted autocompletion of composite characters | |
WO2006086511A3 (en) | Method and apparatus utilizing voice input to resolve ambiguous manually entered text input | |
Tachicart et al. | A hybrid approach to translate Moroccan Arabic dialect | |
WO2005089428A3 (en) | Language phonetic system and method thereof | |
Gulö | The Influence of Nias Language to Bahasa Indonesia | |
WO2004059461A3 (en) | Electronic dictionary with example sentences | |
WO2004114253A3 (en) | Method of teaching reading | |
TW200705222A (en) | Method of synchronizing speech waveform playback and text display | |
Tuisk | Tonal and temporal characteristics of disyllabic words in spontaneous Livonian | |
Dohlus | The role of phonology and phonetics in loanword adaptation: German and French front rounded vowels in Japanese | |
Yahalom | Palestinian Tradition | |
Tseng et al. | Prosodic differences between taiwanese L2 and North American L1 speakers—under-differentiation of lexical stress | |
Saikia et al. | Generating Manipuri English pronunciation dictionary using sequence labelling problem | |
Hui et al. | Adaptation of English stops into Mandarin Chinese. | |
Wang et al. | Rule-based korean grapheme to phoneme conversion using sound patterns | |
Youguang | The Chinese finger alphabet and the Chinese finger syllabary | |
TW200725505A (en) | System and method of dictation learning for correcting pronunciation | |
Inglis | Myanmar-based Khamti Shan orthography | |
Chirkova et al. | Běijīng, The Language of | |
Weng | Vernacular Language Movement | |
Chirkova et al. | Beijing Mandarin, the language of Beijing |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
MM4A | Annulment or lapse of patent due to non-payment of fees | ||
MM4A | Annulment or lapse of patent due to non-payment of fees |