CN104078044B - The method and apparatus of mobile terminal and recording search thereof - Google Patents

The method and apparatus of mobile terminal and recording search thereof Download PDF

Info

Publication number
CN104078044B
CN104078044B CN201410312543.7A CN201410312543A CN104078044B CN 104078044 B CN104078044 B CN 104078044B CN 201410312543 A CN201410312543 A CN 201410312543A CN 104078044 B CN104078044 B CN 104078044B
Authority
CN
China
Prior art keywords
content
text
voice
recording
key word
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
CN201410312543.7A
Other languages
Chinese (zh)
Other versions
CN104078044A (en
Inventor
姚光华
张圣杰
谭焕清
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Nubia Technology Co Ltd
Original Assignee
Nubia Technology Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Nubia Technology Co Ltd filed Critical Nubia Technology Co Ltd
Priority to CN201410312543.7A priority Critical patent/CN104078044B/en
Publication of CN104078044A publication Critical patent/CN104078044A/en
Application granted granted Critical
Publication of CN104078044B publication Critical patent/CN104078044B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Abstract

The invention discloses the method and apparatus of mobile terminal and recording search thereof, belong to technical field of mobile terminals.The method of this recording search comprises: the voice content voice document with time attribute being divided into the free attribute of some length of tape; Respectively every section of voice content is converted to corresponding content of text, and all content of text are stored in text, described content of text has the mark be associated with described time attribute; Storaged voice content and text; When receiving the key word of input, the mark of the content of text at search key place in text; Play and the voice content identifying the time attribute place be associated.The invention enables more accurate corresponding with voice content of content of text, in follow-up search, effectively can improve recording and searching located efficiency.

Description

The method and apparatus of mobile terminal and recording search thereof
Technical field
The present invention relates to technical field of mobile terminals, be specifically related to the method and apparatus of mobile terminal and recording search thereof.
Background technology
In the occasion such as meeting, Training and Learning, people like the voice content by sound recordings meeting-place, to facilitate Day-after-recall and study.During due to live recording, in order to be afraid of to miss certain section of voice, the general mode adopting omnidistance recording, recording file will be caused so larger, and record length is very long, when only wanting again to listen to certain section of recording substance in recording file in the future, user just finds the information oneself wanted after often needing to listen a lot of irrelevant voice content, and want notice to concentrate, be not so easy to just miss.
In order to easily find the information of recording substance, generally adopting in prior art and converting recording substance to text, then by carrying out keyword search coupling to text, thus the recording substance wanting to listen to roughly being located.But the disadvantage of this mode there will be error when being speech conversion text, namely the meaning that comprises of recording non-textual can embody completely, the tone of such as speaking, different linguistic context, with a word, say by the different tone or in different linguistic context, the meaning of expression may be just completely different, thus cause the effectiveness comparison of recording search location poor.
Summary of the invention
The invention provides the method and apparatus of a kind of mobile terminal and recording search thereof, to reach the object accurately navigating to the record length of wanting to listen, overcome above-mentioned owing to there will be the defect that error causes recording locating effect poor during speech conversion text.
The technical scheme that the present invention solves the problems of the technologies described above is as follows.
According to an aspect of the present invention, the one recording searching method provided, the method is applied to mobile terminal, and the method comprises:
Voice document with time attribute is divided into the voice content of the free attribute of some length of tape;
Respectively every section of voice content is converted to corresponding content of text, and all content of text are stored in text, described content of text has the mark be associated with described time attribute;
Storaged voice content and text;
When receiving the key word of input, the mark of the content of text at search key place in text;
Play and the voice content identifying the time attribute place be associated.
Preferably, above-mentioned the voice document with time attribute is divided into the voice document of the free attribute of some length of tape before, the method also comprises: record the voice document with time attribute;
Preferably, for the above-mentioned voice content voice document with time attribute being divided into the free attribute of some length of tape, the method also comprises: with preset the time interval for segmentation according to or with the speech pause in voice document for segmentation foundation;
Preferably, for the above-mentioned key word receiving input, the input mode of key word comprises phonetic entry;
Preferably, above-mentioned mark comprise be associated with the time attribute of corresponding voice content time index;
Preferably, for the mark of the content of text at search key place in text, and play and the voice content identifying the time attribute place be associated, the method also comprises: obtain the time index in the content of text of key word place; Obtain the voice content at the time attribute place be associated with time index.
According to another aspect of the present invention, the one recording searcher provided, this device comprises:
Recording segmentation module, for being divided into the voice content of the free attribute of some length of tape by the voice document with time attribute;
Voice conversion module, for respectively every section of voice content being converted to corresponding content of text, and all content of text are stored in text, described content of text has the mark be associated with described time attribute;
Memory module, for storaged voice content and text;
Search module, for when receiving the key word of input, the mark of the content of text at search key place in text;
Playing module, for the voice content play with identify the time attribute place be associated.
Preferably, this device above-mentioned also comprises the recording module for recording the voice document with time attribute;
Preferably, above-mentioned playing module also comprises:
Time index acquisition module, for obtaining the time index in the content of text of key word place;
Voice content acquisition module, for obtaining the voice content at the time attribute place be associated with time index.
According to a further aspect of the invention, a kind of mobile terminal provided, this mobile terminal comprises above-mentioned recording searcher.
The invention provides a kind of mobile terminal and theft preventing method thereof and device, by by with time attribute voice document with preset the time interval for segmentation foundation, or with the speech pause in voice document for segmentation is according to being divided into multistage voice content, again multistage voice content is converted to respectively corresponding content of text, make more accurate corresponding with voice content of content of text, thus in follow-up search, the corresponding content of text section of a keyword is there will not be to search out multiple voice content section, improve recording search locating accuracy, and then improve the efficiency of recording search location, the present invention is when search key, not only can by traditional keyboard or handwriting input mode, also by phonetic entry, improve experience of the present invention, by arranging the time index of content of text, this time index is corresponding with the time attribute of voice content, further raising subsequent searches locating accuracy.
Accompanying drawing explanation
Fig. 1 is the process flow diagram of the recording searching method according to the first embodiment of the present invention;
Fig. 2 is the process flow diagram of the recording searching method according to second embodiment of the present invention;
Fig. 3 is the process flow diagram of the recording searching method according to the 3rd embodiment of the present invention;
Fig. 4 is recording searcher exemplary block diagram according to an embodiment of the invention;
Fig. 5 is the exemplary block diagram of playing module according to an embodiment of the invention;
Fig. 6 is the exemplary block diagram of mobile terminal according to an embodiment of the invention.
Embodiment
Be described principle of the present invention and feature below in conjunction with accompanying drawing, example, only for explaining the present invention, is not intended to limit scope of the present invention.
Fig. 1 is the process flow diagram of the recording searching method according to the first embodiment of the present invention, and describe the recording searching method in the present invention below in conjunction with Fig. 1, the method is applied to mobile terminal, and as shown in Figure 1, the method comprises the following steps:
S01, is divided into the voice content of the free attribute of some length of tape by the voice document with time attribute;
S02, respectively every section of voice content is converted to corresponding content of text, and all content of text are stored in text, described content of text has the mark be associated with described time attribute;
S03, stores described voice content and text;
S04, when receiving the key word of input, the mark of the content of text at search key place in text;
S05, plays and the voice content identifying the time attribute place be associated.
Wherein, described the voice document with time attribute is divided into the step of the voice document of the free attribute of some length of tape before, also comprise and record with the voice document of time attribute, as recorded a period of time longer voice document by mobile phone.
Wherein, the described content of text respectively every section of voice content being converted to correspondence, and all content of text are stored in the step of text, comprise two types, one is: respectively every section of voice content is converted to corresponding content of text, and all content of text are combined into text; Another is: respectively every section of voice content is converted to corresponding content of text, and is preserved respectively by content of text corresponding for every section of voice content, forms the text that every section of voice content is corresponding.
Wherein, for the above-mentioned key word receiving input, the input mode of key word includes but not limited to phonetic entry, text event detection.When the key word inputted is voice, now voiced keyword is converted to word key word, then in text, mates this key word.
For the above-mentioned voice content voice document with time attribute being divided into the free attribute of some length of tape, can with the time interval of presetting for segmentation be according to segmentation, also can with the speech pause in voice document for segmentation be according to segmentation.This time interval of presetting can be arranged, according to the actual requirements flexibly as being set to 1 minute, when carrying out segmentation to voice document, the time attribute of first paragraph voice content is 0-1 minute, the time attribute of second segment voice content is 1-2 minute, the like, until segmentation completes; Segmentation can also be carried out with the speech pause in voice document, in meeting or Training and Learning process, speaker is when speaking, there will be pause, in short can once pause as often finished, now using pause point as waypoint, carry out segmentation to voice document, the time attribute of each section of voice content is the time between two adjacent pauses; No matter carry out segmentation in any mode, carry out speech conversion respectively after segmentation, make voice content convert the content of text corresponding with voice content to, and mix the time index corresponding with time attribute for each section of content of text.
Wherein, above-mentioned mark comprises the time index be associated with the time attribute of corresponding voice content.
Wherein, for the mark of the content of text at above-mentioned search key place in text, and play and the voice content identifying the time attribute place be associated, the method also comprises:
Obtain the time index in the content of text of key word place;
Obtain the voice content at the time attribute place be associated with time index.
Fig. 2 is the process flow diagram of the recording searching method according to second embodiment of the present invention, on the basis of first embodiment of Fig. 1, adds step S00 before step S01, records voice document with time attribute; With step S021 replacement step S02, step S021, with the time interval of presetting for segmentation is according to respectively every section of voice content being converted to corresponding content of text, and all content of text are stored in text, content of text has the mark be associated with time attribute; Last with step S041 replacement step S04, step S041, by keyboard or handwriting input key word, according to the mark of key word content of text at search key place in text.
Fig. 3 is the process flow diagram of the recording searching method according to the 3rd embodiment of the present invention, on the basis of first embodiment of Fig. 1, before step S01, adds step S00, records the voice document with time attribute; With step S022 replacement step S02, step S022, with the speech pause in voice document for segmentation is according to respectively every section of voice content being converted to corresponding content of text, and all content of text are stored in text, content of text has the mark be associated with time attribute; Last with step S042 replacement step S04, step S042, by phonetic entry key word, and according to the mark of key word content of text at search key place in text, the key word content of wherein phonetic entry first converts corresponding word to by existing voice identification software.
Fig. 4 is according to recording searcher exemplary block diagram of the present invention; Describe the recording searcher 100 in the present invention below according to Fig. 4, as shown in Figure 4, this device comprises:
Recording segmentation module 01, for being divided into the voice content of the free attribute of some length of tape by the voice document with time attribute;
Voice conversion module 02, for respectively every section of voice content being converted to corresponding content of text, and all content of text are stored in text, described content of text has the mark be associated with described time attribute;
Memory module 03, for storaged voice content and text;
Search module 04, for when receiving the key word of input, the mark of the content of text at search key place in text;
Playing module 05, for the voice content play with identify the time attribute place be associated.
Wherein, above-mentioned voice conversion module 02 is also for making described content of text with the time index of the time attribute of corresponding voice content.
As shown in Figure 1, above-mentioned recording searcher 100 can also comprise:
Recording module 00, for recording the voice document with time attribute.
Above-mentioned recording module 00 can with the equipment of a microphone as sound signal typing.When search key, microphone input audio signal can be used at key word inputting interface, be the key word that playing module can identify by existing speech recognition software by the Content Transformation in this sound signal again, carry out subsequent searches again, in the environment being not easy to sounding, also by touch screen hand-writing or board input key word, or by touch-screen dummy keyboard or physical keyboard input key word.
Fig. 5 is the exemplary block diagram according to playing module of the present invention, and as shown in Figure 5, above-mentioned recording searcher 100 can also comprise:
Time index acquisition module 14, for obtaining the time index in the content of text of key word place;
Voice content acquisition module 24, for obtaining the voice content at the time attribute place be associated with time index.
Fig. 6 is the exemplary block diagram of mobile terminal according to an embodiment of the invention, a kind of mobile terminal 11 as shown in Figure 6, and this mobile terminal comprises above-mentioned recording searcher.
The present invention by by with time attribute voice document with preset the time interval for segmentation foundation, or with the speech pause in voice document for segmentation is according to being divided into multistage voice content, again multistage voice content is converted to respectively corresponding content of text, make more accurate corresponding with voice content of content of text, thus in follow-up search, the corresponding content of text section of a keyword is there will not be to search out multiple voice content section, improve recording search locating accuracy, and then improve the efficiency of recording search location, the present invention is when search key, not only can by traditional keyboard or handwriting input mode, also by phonetic entry, improve experience of the present invention, by arranging the time index of content of text, this time index is corresponding with the time attribute of voice content, the accuracy rate of further raising subsequent searches location.
The foregoing is only preferred embodiment of the present invention, not in order to limit the present invention, within the spirit and principles in the present invention all, any amendment done, equivalent replacement, improvement etc., all should be included within protection scope of the present invention.

Claims (9)

1. a recording searching method, the method is applied to mobile terminal, it is characterized in that, the method comprises:
With preset the time interval or with the speech pause of a word in voice document for segmentation foundation, the voice document with time attribute is divided into the voice content of the free attribute of some length of tape;
Respectively every section of voice content is converted to corresponding content of text, and described all content of text are stored in text, described content of text has the mark be associated with described time attribute;
Store described voice content and described text;
When receiving the key word of input, in described text, mate described key word;
Obtain the mark of the content of text at described key word place;
Play the voice content at the time attribute place be associated with described mark.
2. one recording searching method according to claim 1, it is characterized in that, the method also comprises:
Record the voice document with time attribute.
3. one recording searching method according to claim 1, it is characterized in that, for the described key word receiving input, the input mode of described key word comprises phonetic entry.
4. the one recording searching method according to any one of claims 1 to 3, it is characterized in that, described mark comprises the time index be associated with the time attribute of corresponding voice content.
5. one recording searching method according to claim 4, it is characterized in that, the method also comprises:
Obtain the time index in the content of text of described key word place;
Obtain the voice content at the time attribute place be associated with described time index.
6. a recording searcher, is characterized in that, this device comprises:
Recording segmentation module, for preset the time interval or with the speech pause of a word in voice document for segmentation foundation, the voice document with time attribute is divided into the voice content of the free attribute of some length of tape;
Voice conversion module, for respectively every section of voice content being converted to corresponding content of text, and described all content of text are stored in text, described content of text has the mark be associated with described time attribute;
Memory module, for storing described voice content and described text;
Search module, for when receiving the key word of input, mates described key word in described text, and obtains the mark of the content of text at described key word place;
Playing module, for playing the voice content at the time attribute place be associated with described mark.
7. one recording searcher according to claim 6, it is characterized in that, this device also comprises:
Recording module, for recording the voice document with time attribute.
8. one recording searcher according to claim 6, it is characterized in that, this device also comprises:
Time index acquisition module, for obtaining the time index in the content of text of described key word place;
Voice content acquisition module, for obtaining the voice content at the time attribute place be associated with described time index.
9. a mobile terminal, is characterized in that: comprise the device according to any one of the claims 6 to 8.
CN201410312543.7A 2014-07-02 2014-07-02 The method and apparatus of mobile terminal and recording search thereof Active CN104078044B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201410312543.7A CN104078044B (en) 2014-07-02 2014-07-02 The method and apparatus of mobile terminal and recording search thereof

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201410312543.7A CN104078044B (en) 2014-07-02 2014-07-02 The method and apparatus of mobile terminal and recording search thereof

Publications (2)

Publication Number Publication Date
CN104078044A CN104078044A (en) 2014-10-01
CN104078044B true CN104078044B (en) 2016-03-30

Family

ID=51599267

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201410312543.7A Active CN104078044B (en) 2014-07-02 2014-07-02 The method and apparatus of mobile terminal and recording search thereof

Country Status (1)

Country Link
CN (1) CN104078044B (en)

Families Citing this family (21)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN104408162B (en) * 2014-12-05 2017-10-31 国家电网公司 A kind of multimedia system and processing method for being used to form text index
CN105787496A (en) * 2014-12-23 2016-07-20 联想(北京)有限公司 Data collection method and electronic device
US20160379641A1 (en) * 2015-06-29 2016-12-29 Microsoft Technology Licensing, Llc Auto-Generation of Notes and Tasks From Passive Recording
CN106558311B (en) * 2015-09-30 2020-11-27 北京奇虎科技有限公司 Voice content prompting method and device
CN105653729B (en) * 2016-01-28 2019-10-08 努比亚技术有限公司 A kind of device and method of recording file index
CN106057193A (en) * 2016-07-13 2016-10-26 深圳市沃特沃德股份有限公司 Conference record generation method based on telephone conference and device
CN106128460A (en) * 2016-08-04 2016-11-16 周奇 A kind of record labels method and device
CN106504773B (en) * 2016-11-08 2023-08-01 上海贝生医疗设备有限公司 Wearable device and voice and activity monitoring system
CN106357929A (en) * 2016-11-10 2017-01-25 努比亚技术有限公司 Previewing method based on audio file and mobile terminal
CN108874815A (en) * 2017-05-10 2018-11-23 北京国双科技有限公司 The search method and device of audio-video
CN107291676B (en) * 2017-06-20 2021-11-19 广东小天才科技有限公司 Method for cutting off voice file, terminal equipment and computer storage medium
CN110019923A (en) * 2017-07-18 2019-07-16 北京国双科技有限公司 The lookup method and device of speech message
CN109559764A (en) * 2017-09-27 2019-04-02 北京国双科技有限公司 The treating method and apparatus of audio file
CN108287930A (en) * 2018-03-08 2018-07-17 珠海格力电器股份有限公司 A kind of recording searching method, device and electronic equipment
CN110489589A (en) * 2018-05-11 2019-11-22 深圳市诚壹科技有限公司 A kind of recording file store method, device and terminal device
CN108874904B (en) * 2018-05-24 2022-04-29 平安科技(深圳)有限公司 Voice message searching method and device, computer equipment and storage medium
CN109274586A (en) * 2018-11-14 2019-01-25 深圳市云歌人工智能技术有限公司 Storage method, device and the storage medium of chat message
CN110287364B (en) * 2019-06-28 2021-10-08 合肥讯飞读写科技有限公司 Voice search method, system, device and computer readable storage medium
CN110636369A (en) * 2019-09-27 2019-12-31 维沃移动通信有限公司 Multimedia file playing method and mobile terminal
CN111092996A (en) * 2019-10-31 2020-05-01 国网山东省电力公司信息通信公司 Centralized scheduling recording system and control method
CN113724735A (en) * 2021-09-01 2021-11-30 广州博冠信息科技有限公司 Voice stream processing method and device, computer readable storage medium and electronic equipment

Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US7039585B2 (en) * 2001-04-10 2006-05-02 International Business Machines Corporation Method and system for searching recorded speech and retrieving relevant segments
CN1783073A (en) * 2004-09-01 2006-06-07 创新科技有限公司 A search system
CN101351838A (en) * 2005-12-30 2009-01-21 坦德伯格电信公司 Searchable multimedia stream
CN103065659A (en) * 2012-12-06 2013-04-24 广东欧珀移动通信有限公司 Multi-media recording method

Patent Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US7039585B2 (en) * 2001-04-10 2006-05-02 International Business Machines Corporation Method and system for searching recorded speech and retrieving relevant segments
CN1783073A (en) * 2004-09-01 2006-06-07 创新科技有限公司 A search system
CN101351838A (en) * 2005-12-30 2009-01-21 坦德伯格电信公司 Searchable multimedia stream
CN103065659A (en) * 2012-12-06 2013-04-24 广东欧珀移动通信有限公司 Multi-media recording method

Also Published As

Publication number Publication date
CN104078044A (en) 2014-10-01

Similar Documents

Publication Publication Date Title
CN104078044B (en) The method and apparatus of mobile terminal and recording search thereof
CN107016994B (en) Voice recognition method and device
US11037553B2 (en) Learning-type interactive device
CN108074576B (en) Speaker role separation method and system under interrogation scene
US8209171B2 (en) Methods and apparatus relating to searching of spoken audio data
US10489451B2 (en) Voice search system, voice search method, and computer-readable storage medium
WO2020043123A1 (en) Named-entity recognition method, named-entity recognition apparatus and device, and medium
WO2019148586A1 (en) Method and device for speaker recognition during multi-person speech
US8909525B2 (en) Interactive voice recognition electronic device and method
CN109256152A (en) Speech assessment method and device, electronic equipment, storage medium
US20120271631A1 (en) Speech recognition using multiple language models
CN105975569A (en) Voice processing method and terminal
JPWO2008114811A1 (en) Information search system, information search method, and information search program
CN101593519B (en) Method and device for detecting speech keywords as well as retrieval method and system thereof
US20130253932A1 (en) Conversation supporting device, conversation supporting method and conversation supporting program
CN109686383B (en) Voice analysis method, device and storage medium
Moore Automated transcription and conversation analysis
US20120035919A1 (en) Voice recording device and method thereof
CN103123644A (en) Voice data retrieval system and program product therefor
KR20140123369A (en) Question answering system using speech recognition and its application method thereof
KR20130086971A (en) Question answering system using speech recognition and its application method thereof
WO2014203328A1 (en) Voice data search system, voice data search method, and computer-readable storage medium
CN113782026A (en) Information processing method, device, medium and equipment
KR102536944B1 (en) Method and apparatus for speech signal processing
CN108364655A (en) Method of speech processing, medium, device and computing device

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
CB02 Change of applicant information

Address after: A District No. 9018 Han innovation building in Nanshan District high tech Zone in Shenzhen city of Guangdong Province, North Central Avenue, 518000 floor 10

Applicant after: Nubian Technologies Ltd.

Address before: A District No. 9018 Han innovation building in Nanshan District high tech Zone in Shenzhen city of Guangdong Province, North Central Avenue, 518000 floor 10

Applicant before: Shenzhen ZTE Mobile Tech Co., Ltd.

COR Change of bibliographic data
C14 Grant of patent or utility model
GR01 Patent grant