CN106128467A - Method of speech processing and device - Google Patents

Method of speech processing and device Download PDF

Info

Publication number
CN106128467A
CN106128467A CN201610395662.2A CN201610395662A CN106128467A CN 106128467 A CN106128467 A CN 106128467A CN 201610395662 A CN201610395662 A CN 201610395662A CN 106128467 A CN106128467 A CN 106128467A
Authority
CN
China
Prior art keywords
user
recommendation service
content recommendation
age
characteristic information
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN201610395662.2A
Other languages
Chinese (zh)
Inventor
黄宇
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Beijing Yunzhisheng Information Technology Co Ltd
Original Assignee
Beijing Yunzhisheng Information Technology Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Beijing Yunzhisheng Information Technology Co Ltd filed Critical Beijing Yunzhisheng Information Technology Co Ltd
Priority to CN201610395662.2A priority Critical patent/CN106128467A/en
Publication of CN106128467A publication Critical patent/CN106128467A/en
Pending legal-status Critical Current

Links

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L17/00Speaker identification or verification techniques
    • G10L17/06Decision making techniques; Pattern matching strategies
    • G10L17/14Use of phonemic categorisation or speech recognition prior to speaker recognition or verification
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/90Details of database functions independent of the retrieved data types
    • G06F16/95Retrieval from the web
    • G06F16/953Querying, e.g. by the use of web search engines
    • G06F16/9535Search customisation based on user profiles and personalisation
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/08Speech classification or search
    • G10L15/18Speech classification or search using natural language modelling
    • G10L15/1815Semantic context, e.g. disambiguation of the recognition hypotheses based on word meaning
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L25/00Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
    • G10L25/48Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 specially adapted for particular use
    • G10L25/51Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 specially adapted for particular use for comparison or discrimination
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/40Information retrieval; Database structures therefor; File system structures therefor of multimedia data, e.g. slideshows comprising image and additional audio data
    • G06F16/43Querying
    • G06F16/435Filtering based on additional data, e.g. user or group profiles
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/40Information retrieval; Database structures therefor; File system structures therefor of multimedia data, e.g. slideshows comprising image and additional audio data
    • G06F16/48Retrieval characterised by using metadata, e.g. metadata not derived from the content or metadata generated manually

Landscapes

  • Engineering & Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • Multimedia (AREA)
  • Health & Medical Sciences (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Human Computer Interaction (AREA)
  • Acoustics & Sound (AREA)
  • Databases & Information Systems (AREA)
  • Theoretical Computer Science (AREA)
  • Computational Linguistics (AREA)
  • Game Theory and Decision Science (AREA)
  • Computer Vision & Pattern Recognition (AREA)
  • Artificial Intelligence (AREA)
  • Business, Economics & Management (AREA)
  • Data Mining & Analysis (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Signal Processing (AREA)
  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)

Abstract

The present invention is that wherein, method includes about a kind of method of speech processing and device: receive the voice messaging of user's input;Described voice messaging is carried out Application on Voiceprint Recognition, to determine the characteristic information of described user;Described voice messaging is carried out voice and semantics recognition, to obtain text identification result;According to described characteristic information and described text identification result, determine the object content recommendation service corresponding with described voice messaging.By this technical scheme, the voice messaging of user's input is carried out Application on Voiceprint Recognition and voice, semantics recognition respectively, determine characteristic information and the recognition result of user, wherein, characteristic information can be sex and the age etc. of user, recognition result is the word content that the voice messaging identified is corresponding, and then determines the object content recommendation service of correspondence according to user's characteristic information and recognition result, thus meets the different recommended requirements of different user.

Description

Method of speech processing and device
Technical field
The present invention relates to voice processing technology field, particularly relate to a kind of method of speech processing and device.
Background technology
Speech recognition is a cross discipline.Recent two decades comes, and speech recognition technology obtains marked improvement, starts from experiment Market is moved towards in room.It is contemplated that, in coming 10 years, speech recognition technology will enter industry, household electrical appliances, communication, automotive electronics, doctor The every field such as treatment, home services, consumption electronic product.The application in some fields of the speech recognition dictation machine is by US News circle It is chosen as one of ten major issues of development of computer in 1997.A lot of experts think that speech recognition technology is between 2000 to 2010 One of development in science and technology technology that areas of information technology ten are the most important.Field involved by speech recognition technology includes: signal processing, Pattern recognition, theory of probability and theory of information, sound generating mechanism and hearing mechanism, artificial intelligence etc..
Summary of the invention
The embodiment of the present invention provides a kind of method of speech processing and device, in order to realize the different demands according to different user For user's recommendation service.
First aspect according to embodiments of the present invention, it is provided that a kind of method of speech processing, including:
Receive the voice messaging of user's input;
Described voice messaging is carried out Application on Voiceprint Recognition, to determine the characteristic information of described user;
Described voice messaging is carried out voice and semantics recognition, to obtain text identification result;
According to described characteristic information and described text identification result, determine that the object content corresponding with described voice messaging pushes away Recommend service.
In this embodiment, the voice messaging of user's input is carried out Application on Voiceprint Recognition and voice, semantics recognition respectively, determines The characteristic information of user and recognition result, wherein, characteristic information can be sex and the age etc. of user, and recognition result is to identify The word content that voice messaging out is corresponding, and then the object content of correspondence is determined according to user's characteristic information and recognition result Recommendation service, thus meet the different recommended requirements of different user.
In one embodiment, described characteristic information include following at least one:
The sex of described user and age.
The voice messaging of user's input is carried out Application on Voiceprint Recognition, it may be determined that the sex of user and age etc..
In one embodiment, described method also includes:
Export described object content recommendation service.
In this embodiment it is possible to output object content recommendation service, push away so that user can normally view content Recommend service, promote the experience of user.
In one embodiment, described according to described characteristic information with described text identification result, determine and described voice The object content recommendation service that information is corresponding, including:
The Keyword Tag of each content recommendation service is obtained from preset content recommendation service data base, wherein, described Keyword Tag includes COS, is suitable for the range of age and applicable sex;
According to described Keyword Tag, determine and mate with described text identification result, and mate with described characteristic information Object content recommendation service.
In this embodiment, in preset content recommendation service data base each content recommendation service with crucial sign Signing, wherein, Keyword Tag can be COS, such as amusement and recreation service, study class service etc., specifically, can have song Song, reading matter, cross-talk etc..Keyword Tag can also include being suitable for the range of age, if content recommendation service is to be suitable for old man, green grass or young crops less Year or child, it is, of course, also possible to include being suitable for sex.As such, it is possible to according to the feature of user and the voice content of input be User finds the content recommendation service being best suitable for its feature and voice content, recommends to meet its content required for user Recommendation service, promotes the experience of user.
In one embodiment, described according to described Keyword Tag, determine and mate with described text identification result, and with The object content recommendation service of described characteristic information coupling, including:
Determine that the target belonging to the age of user in described characteristic information is suitable for the range of age, and/or belonging to user's sex Target be suitable for sex;
Determine the destination service type comprised in described text identification result;
According to described Keyword Tag, determine and the destination service type matching in described text identification result, and with institute State target be suitable for the range of age and/or be suitable for the object content recommendation service of gender matched with described target.
In this embodiment it is possible to arrange multiple applicable the range of age, as arrange 1-6 year be child's section, 7-17 year be blue or green Juvenile section, 18-50 year for growing up section, within more than 50 years old, be section in old age, and for different age brackets, the commending contents that correspondence is different Service, as serviced for song class, recommendations song corresponding to child's section can be that recommendation song corresponding to nursery rhymes etc., youth's section is Cartoon theme songs etc., recommendation song corresponding to section of growing up is popular song etc., and song corresponding to old section be classics old song etc.. Simultaneously for different sexes, content recommendation service can also, as schoolgirl, corresponding recommendation song can be all man The song of singer, for boy student, corresponding recommendation song can be all the song of female singer.So, age of user institute is first determined The target belonged to is suitable for the range of age and/or target is suitable for sex, and then is user's content recommendation recommendation service according to these.
So, according to the demand of the user of different age group, for its recommendation service, so that the service recommended is more satisfied The requirement of user, is more suitable for user, promotes the experience of user.
Second aspect according to embodiments of the present invention, it is provided that a kind of voice processing apparatus, including:
Receiver module, for receiving the voice messaging of user's input;
First determines module, for described voice messaging being carried out Application on Voiceprint Recognition, to determine the characteristic information of described user;
Identification module, for carrying out voice and semantics recognition to described voice messaging, to obtain text identification result;
Second determines module, for according to described characteristic information and described text identification result, determines and believes with described voice The object content recommendation service that breath is corresponding.
In one embodiment, described device also includes:
Output module, is used for exporting described object content recommendation service.
In one embodiment, described characteristic information include following at least one:
The sex of described user and age.
In one embodiment, described second determines that module includes:
Obtain submodule, for obtaining the keyword of each content recommendation service from preset content recommendation service data base Label, wherein, described Keyword Tag includes COS, is suitable for the range of age and applicable sex;
Determine submodule, for according to described Keyword Tag, determine and mate with described text identification result, and with described The object content recommendation service of characteristic information coupling.
In one embodiment, described determine submodule for:
Determine that the target belonging to the age of user in described characteristic information is suitable for the range of age, and/or belonging to user's sex Target be suitable for sex;
Determine the destination service type comprised in described text identification result;
According to described Keyword Tag, determine and the destination service type matching in described text identification result, and with institute State target be suitable for the range of age and/or be suitable for the object content recommendation service of gender matched with described target.
It should be appreciated that it is only exemplary and explanatory, not that above general description and details hereinafter describe The present invention can be limited.
Other features and advantages of the present invention will illustrate in the following description, and, partly become from description Obtain it is clear that or understand by implementing the present invention.The purpose of the present invention and other advantages can be by the explanations write Structure specifically noted in book, claims and accompanying drawing realizes and obtains.
Below by drawings and Examples, technical scheme is described in further detail.
Accompanying drawing explanation
Accompanying drawing herein is merged in description and constitutes the part of this specification, it is shown that meet the enforcement of the present invention Example, and for explaining the principle of the present invention together with description.
Fig. 1 is the flow chart according to a kind of method of speech processing shown in an exemplary embodiment.
Fig. 2 is the flow chart according to the another kind of method of speech processing shown in an exemplary embodiment.
Fig. 3 is according to the flow chart of step S104 in a kind of method of speech processing shown in an exemplary embodiment.
Fig. 4 is according to the flow chart of step S302 in a kind of method of speech processing shown in an exemplary embodiment.
Fig. 5 is the block diagram according to a kind of voice processing apparatus shown in an exemplary embodiment.
Fig. 6 is the block diagram according to the another kind of voice processing apparatus shown in an exemplary embodiment.
Fig. 7 is to determine the block diagram of module according in the another kind of voice processing apparatus shown in an exemplary embodiment second.
Detailed description of the invention
Here will illustrate exemplary embodiment in detail, its example represents in the accompanying drawings.Explained below relates to During accompanying drawing, unless otherwise indicated, the same numbers in different accompanying drawings represents same or analogous key element.Following exemplary embodiment Described in embodiment do not represent all embodiments consistent with the present invention.On the contrary, they are only with the most appended The example of the apparatus and method that some aspects that described in detail in claims, the present invention are consistent.
Fig. 1 is the flow chart according to a kind of method of speech processing shown in an exemplary embodiment.This method of speech processing Being applied in terminal unit, this terminal unit can be mobile phone, computer, digital broadcast terminal, messaging devices, trip Play control station, tablet device, armarium, body-building equipment, arbitrary equipment with language process function such as personal digital assistant.
As it is shown in figure 1, the method comprising the steps of S101-S104:
In step S101, receive the voice messaging of user's input;
In step s 102, described voice messaging is carried out Application on Voiceprint Recognition, to determine the characteristic information of described user;
So-called vocal print (Voiceprint), is the sound wave spectrum carrying verbal information that shows of electricity consumption acoustic instrument.The mankind The generation of language is a complicated physiology physical process between Body Languages maincenter and phonatory organ, and people uses when speech Phonatory organ--tongue, tooth, larynx, lung, nasal cavity everyone widely different in terms of size and form, so any two people Vocal print collection of illustrative plates the most variant.Everyone existing relative stability of Speech acoustics feature, has again variability, be not absolute, Unalterable.This variation may be from physiology, pathology, psychology, simulates, pretends, also relevant with environmental disturbances.While it is true, Owing to everyone phonatory organ are not quite similar, the most in the ordinary course of things, remain to distinguish sound or the judgement of different people It is whether the sound of same people.
And by voice messaging being carried out Application on Voiceprint Recognition, the specific features of user can be identified, age of such as user, Sex etc..
In step s 103, described voice messaging is carried out voice and semantics recognition, to obtain text identification result;
Voice messaging is carried out voice and semantics recognition, then can identify the concrete content of text described in user, such as User has said " I wants to listen a first song ", then, after voice and speech recognition, the text identification result obtained is that " I wants to listen one First song ", corresponding word content will be changed into by the voice content described in user.
In step S104, according to described characteristic information and described text identification result, determine and described voice messaging pair The object content recommendation service answered.
In this embodiment, the voice messaging of user's input is carried out Application on Voiceprint Recognition and voice, semantics recognition respectively, determines The characteristic information of user and recognition result, wherein, characteristic information can be sex and the age etc. of user, and recognition result is to identify The word content that voice messaging out is corresponding, and then the object content of correspondence is determined according to user's characteristic information and recognition result Recommendation service, thus meet the different recommended requirements of different user.
In one embodiment, described characteristic information include following at least one:
The sex of described user and age.
The voice messaging of user's input is carried out Application on Voiceprint Recognition, it may be determined that the sex of user and age etc..
Fig. 2 is the flow chart according to the another kind of method of speech processing shown in an exemplary embodiment.
As in figure 2 it is shown, in one embodiment, said method also includes step S201:
In step s 201, described object content recommendation service is exported.
In this embodiment it is possible to output object content recommendation service, push away so that user can normally view content Recommend service, promote the experience of user.
Fig. 3 is according to the flow chart of step S104 in a kind of method of speech processing shown in an exemplary embodiment.
As it is shown on figure 3, in one embodiment, above-mentioned steps S104 can include step S301-S302:
In step S301, from preset content recommendation service data base, obtain the crucial sign of each content recommendation service Signing, wherein, described Keyword Tag includes COS, is suitable for the range of age and applicable sex;
In step s 302, according to described Keyword Tag, determine and mate with described text identification result, and with described spy Levy the object content recommendation service of information matches.
In this embodiment, in preset content recommendation service data base each content recommendation service with crucial sign Signing, wherein, Keyword Tag can be COS, such as amusement and recreation service, study class service etc., specifically, can have song Song, reading matter, cross-talk etc..Keyword Tag can also include being suitable for the range of age, if content recommendation service is to be suitable for old man, green grass or young crops less Year or child, it is, of course, also possible to include being suitable for sex.As such, it is possible to according to the feature of user and the voice content of input be User finds the content recommendation service being best suitable for its feature and voice content, recommends to meet its content required for user Recommendation service, promotes the experience of user.
Fig. 4 is according to the flow chart of step S302 in a kind of method of speech processing shown in an exemplary embodiment.
As shown in Figure 4, in one embodiment, above-mentioned steps S302 can include step S401-S403:
In step S401, determine that the target belonging to the age of user in described characteristic information is suitable for the range of age, and/or Target belonging to user's sex is suitable for sex;
In step S402, determine the destination service type comprised in described text identification result;
In step S403, according to described Keyword Tag, determine and the destination service class in described text identification result Type mates, and is suitable for the range of age with described target and/or is suitable for the object content recommendation service of gender matched with described target.
In this embodiment it is possible to arrange multiple applicable the range of age, as arrange 1-6 year be child's section, 7-17 year be blue or green Juvenile section, 18-50 year for growing up section, within more than 50 years old, be section in old age, and for different age brackets, the commending contents that correspondence is different Service, as serviced for song class, recommendations song corresponding to child's section can be that recommendation song corresponding to nursery rhymes etc., youth's section is Cartoon theme songs etc., recommendation song corresponding to section of growing up is popular song etc., and song corresponding to old section be classics old song etc.. Simultaneously for different sexes, content recommendation service can also, as schoolgirl, corresponding recommendation song can be all man The song of singer, for boy student, corresponding recommendation song can be all the song of female singer.So, age of user institute is first determined The target belonged to is suitable for the range of age and/or target is suitable for sex, and then is user's content recommendation recommendation service according to these.
Such as, when user input voice information: " I wants to listen a first song ".Equipment this voice messaging is carried out Application on Voiceprint Recognition and By voice semantics recognition, voice, semantics recognition, determined that user wants to listen a first song, determined the age of user by Application on Voiceprint Recognition And/or sex, if identifying age of user is 5 years old, then it belongs to child's section, and equipment is from preset content recommendation service data base Middle search key label is the content recommendation service of song and child, if the content recommendation service found has one, the most defeated Going out this content recommendation service, the content recommendation service as found has multiple, then can randomly select, such as the commending contents clothes found Business has many first children's songs, can be with the first children's song of shuffle one.It is similar to, if identifying age of user is 15 years old, then It belongs to teenager section, then equipment can be with the first cartoon theme song of shuffle one.If identifying user is 30 years old, then it belongs to In adult section, then equipment can be with the first popular song of shuffle one.If identifying age of user is 60 years old, then it belongs to old Section, then equipment can be with the first old song of shuffle one.
The most such as, user input voice information " my talking book to be listened ", equipment carries out Application on Voiceprint Recognition to this voice messaging With voice, semantics recognition, determine that user wants to listen talking book by voice semantics recognition, determined the year of user by Application on Voiceprint Recognition Age and/or sex, if identifying age of user is 5 years old, then it belongs to child's section, and equipment is from preset content recommendation service data In storehouse, search key label is the content recommendation service of talking book and child, if the content recommendation service found has one Individual, then export this content recommendation service, the content recommendation service as found has multiple, then can randomly select, in finding Hold recommendation service and have multiple children stories, then can be with one children stories of shuffle.It is similar to, if identifying age of user Be 15 years old, then it belongs to teenager section, then equipment can be with one section of prose of shuffle.If identifying user is 30 years old, then its Belong to adult section, then equipment can be with one story of pursuing a goal with determination of shuffle.If identifying age of user is 60 years old, then it belongs to old Year section, then equipment can be with one cross-talk of shuffle.
So, according to the demand of the user of different age group, for its recommendation service, so that the service recommended is more satisfied The requirement of user, is more suitable for user, promotes the experience of user.
Following for apparatus of the present invention embodiment, may be used for performing the inventive method embodiment.
Fig. 5 is the block diagram according to a kind of voice processing apparatus shown in an exemplary embodiment, and this device can be by soft Part, hardware or both be implemented in combination with become the some or all of of terminal unit.As it is shown in figure 5, this voice processing apparatus Including:
Receiver module 51, for receiving the voice messaging of user's input;
First determines module 52, for described voice messaging is carried out Application on Voiceprint Recognition, to determine that the feature of described user is believed Breath;
Identification module 53, for carrying out voice and semantics recognition to described voice messaging, to obtain text identification result;
Second determines module 54, for according to described characteristic information and described text identification result, determines and described voice The object content recommendation service that information is corresponding.
In this embodiment, the voice messaging of user's input is carried out Application on Voiceprint Recognition and voice, semantics recognition respectively, determines The characteristic information of user and recognition result, wherein, characteristic information can be sex and the age etc. of user, and recognition result is to identify The word content that voice messaging out is corresponding, and then the object content of correspondence is determined according to user's characteristic information and recognition result Recommendation service, thus meet the different recommended requirements of different user.
Fig. 6 is the block diagram according to the another kind of voice processing apparatus shown in an exemplary embodiment.
As shown in Figure 6, in one embodiment, said apparatus also includes:
Output module 61, is used for exporting described object content recommendation service.
In this embodiment it is possible to output object content recommendation service, push away so that user can normally view content Recommend service, promote the experience of user.
In one embodiment, described characteristic information include following at least one:
The sex of described user and age.
Fig. 7 is to determine the block diagram of module according in the another kind of voice processing apparatus shown in an exemplary embodiment second.
As it is shown in fig. 7, in one embodiment, described second determines that module 54 includes:
Obtain submodule 71, for obtaining the key of each content recommendation service from preset content recommendation service data base Sign label, wherein, described Keyword Tag includes COS, is suitable for the range of age and applicable sex;
Determine submodule 72, for according to described Keyword Tag, determine and mate with described text identification result, and with institute State the object content recommendation service of characteristic information coupling.
In this embodiment, in preset content recommendation service data base each content recommendation service with crucial sign Signing, wherein, Keyword Tag can be COS, such as amusement and recreation service, study class service etc., specifically, can have song Song, reading matter, cross-talk etc..Keyword Tag can also include being suitable for the range of age, if content recommendation service is to be suitable for old man, green grass or young crops less Year or child, it is, of course, also possible to include being suitable for sex.As such, it is possible to according to the feature of user and the voice content of input be User finds the content recommendation service being best suitable for its feature and voice content, recommends to meet its content required for user Recommendation service, promotes the experience of user.
In one embodiment, described determine submodule 72 for:
Determine that the target belonging to the age of user in described characteristic information is suitable for the range of age, and/or belonging to user's sex Target be suitable for sex;
Determine the destination service type comprised in described text identification result;
According to described Keyword Tag, determine and the destination service type matching in described text identification result, and with institute State target be suitable for the range of age and/or be suitable for the object content recommendation service of gender matched with described target.
In this embodiment it is possible to arrange multiple applicable the range of age, as arrange 1-6 year be child's section, 7-17 year be blue or green Juvenile section, 18-50 year for growing up section, within more than 50 years old, be section in old age, and for different age brackets, the commending contents that correspondence is different Service, as serviced for song class, recommendations song corresponding to child's section can be that recommendation song corresponding to nursery rhymes etc., youth's section is Cartoon theme songs etc., recommendation song corresponding to section of growing up is popular song etc., and song corresponding to old section be classics old song etc.. Simultaneously for different sexes, content recommendation service can also, as schoolgirl, corresponding recommendation song can be all man The song of singer, for boy student, corresponding recommendation song can be all the song of female singer.So, age of user institute is first determined The target belonged to is suitable for the range of age and/or target is suitable for sex, and then is user's content recommendation recommendation service according to these.
Such as, when user input voice information: " I wants to listen a first song ".Equipment this voice messaging is carried out Application on Voiceprint Recognition and By voice semantics recognition, voice, semantics recognition, determined that user wants to listen a first song, determined the age of user by Application on Voiceprint Recognition And/or sex, if identifying age of user is 5 years old, then it belongs to child's section, and equipment is from preset content recommendation service data base Middle search key label is the content recommendation service of song and child, if the content recommendation service found has one, the most defeated Going out this content recommendation service, the content recommendation service as found has multiple, then can randomly select, such as the commending contents clothes found Business has many first children's songs, can be with the first children's song of shuffle one.It is similar to, if identifying age of user is 15 years old, then It belongs to teenager section, then equipment can be with the first cartoon theme song of shuffle one.If identifying user is 30 years old, then it belongs to In adult section, then equipment can be with the first popular song of shuffle one.If identifying age of user is 60 years old, then it belongs to old Section, then equipment can be with the first old song of shuffle one.
The most such as, user input voice information " my talking book to be listened ", equipment carries out Application on Voiceprint Recognition to this voice messaging With voice, semantics recognition, determine that user wants to listen talking book by voice semantics recognition, determined the year of user by Application on Voiceprint Recognition Age and/or sex, if identifying age of user is 5 years old, then it belongs to child's section, and equipment is from preset content recommendation service data In storehouse, search key label is the content recommendation service of talking book and child, if the content recommendation service found has one Individual, then export this content recommendation service, the content recommendation service as found has multiple, then can randomly select, in finding Hold recommendation service and have multiple children stories, then can be with one children stories of shuffle.It is similar to, if identifying age of user Be 15 years old, then it belongs to teenager section, then equipment can be with one section of prose of shuffle.If identifying user is 30 years old, then its Belong to adult section, then equipment can be with one story of pursuing a goal with determination of shuffle.If identifying age of user is 60 years old, then it belongs to old Year section, then equipment can be with one cross-talk of shuffle.
So, according to the demand of the user of different age group, for its recommendation service, so that the service recommended is more satisfied The requirement of user, is more suitable for user, promotes the experience of user.
Those skilled in the art are it should be appreciated that embodiments of the invention can be provided as method, system or computer program Product.Therefore, the reality in terms of the present invention can use complete hardware embodiment, complete software implementation or combine software and hardware Execute the form of example.And, the present invention can use at one or more computers wherein including computer usable program code The shape of the upper computer program implemented of usable storage medium (including but not limited to disk memory and optical memory etc.) Formula.
The present invention is with reference to method, equipment (system) and the flow process of computer program according to embodiments of the present invention Figure and/or block diagram describe.It should be understood that can the most first-class by computer program instructions flowchart and/or block diagram Flow process in journey and/or square frame and flow chart and/or block diagram and/or the combination of square frame.These computer programs can be provided Instruction arrives the processor of general purpose computer, special-purpose computer, Embedded Processor or other programmable data processing device to produce A raw machine so that the instruction performed by the processor of computer or other programmable data processing device is produced for real The device of the function specified in one flow process of flow chart or multiple flow process and/or one square frame of block diagram or multiple square frame now.
These computer program instructions may be alternatively stored in and computer or other programmable data processing device can be guided with spy Determine in the computer-readable memory that mode works so that the instruction being stored in this computer-readable memory produces and includes referring to Make the manufacture of device, this command device realize at one flow process of flow chart or multiple flow process and/or one square frame of block diagram or The function specified in multiple square frames.
These computer program instructions also can be loaded in computer or other programmable data processing device so that at meter Perform sequence of operations step on calculation machine or other programmable devices to produce computer implemented process, thus at computer or The instruction performed on other programmable devices provides for realizing at one flow process of flow chart or multiple flow process and/or block diagram one The step of the function specified in individual square frame or multiple square frame.
Obviously, those skilled in the art can carry out various change and the modification essence without deviating from the present invention to the present invention God and scope.So, if these amendments of the present invention and modification belong to the scope of the claims in the present invention and equivalent technologies thereof Within, then the present invention is also intended to comprise these change and modification.

Claims (10)

1. a method of speech processing, it is characterised in that including:
Receive the voice messaging of user's input;
Described voice messaging is carried out Application on Voiceprint Recognition, to determine the characteristic information of described user;
Described voice messaging is carried out voice and semantics recognition, to obtain text identification result;
According to described characteristic information and described text identification result, determine that the object content corresponding with described voice messaging recommends clothes Business.
Method the most according to claim 1, it is characterised in that described method also includes:
Export described object content recommendation service.
Method the most according to claim 1, it is characterised in that described characteristic information include following at least one:
The sex of described user and age.
Method the most according to claim 3, it is characterised in that described tie according to described characteristic information and described text identification Really, determine the object content recommendation service corresponding with described voice messaging, including:
The Keyword Tag of each content recommendation service, wherein, described key is obtained from preset content recommendation service data base Sign label include COS, are suitable for the range of age and applicable sex;
According to described Keyword Tag, determine and mate with described text identification result, and the target mated with described characteristic information Content recommendation service.
Method the most according to claim 4, it is characterised in that described according to described Keyword Tag, determines and described literary composition This recognition result mates, and the object content recommendation service mated with described characteristic information, including:
Determine that the target belonging to the age of user in described characteristic information is suitable for the range of age, and/or the mesh belonging to user's sex Mark is suitable for sex;
Determine the destination service type comprised in described text identification result;
According to described Keyword Tag, determine and the destination service type matching in described text identification result, and with described mesh Mark is suitable for the range of age and/or is suitable for the object content recommendation service of gender matched with described target.
6. a voice processing apparatus, it is characterised in that including:
Receiver module, for receiving the voice messaging of user's input;
First determines module, for described voice messaging being carried out Application on Voiceprint Recognition, to determine the characteristic information of described user;
Identification module, for carrying out voice and semantics recognition to described voice messaging, to obtain text identification result;
Second determines module, for according to described characteristic information and described text identification result, determines and described voice messaging pair The object content recommendation service answered.
Device the most according to claim 6, it is characterised in that described device also includes:
Output module, is used for exporting described object content recommendation service.
Device the most according to claim 6, it is characterised in that described characteristic information include following at least one:
The sex of described user and age.
Device the most according to claim 8, it is characterised in that described second determines that module includes:
Obtain submodule, for obtaining the crucial sign of each content recommendation service from preset content recommendation service data base Signing, wherein, described Keyword Tag includes COS, is suitable for the range of age and applicable sex;
Determine submodule, for according to described Keyword Tag, determine and mate with described text identification result, and with described feature The object content recommendation service of information matches.
Device the most according to claim 9, it is characterised in that described determine submodule for:
Determine that the target belonging to the age of user in described characteristic information is suitable for the range of age, and/or the mesh belonging to user's sex Mark is suitable for sex;
Determine the destination service type comprised in described text identification result;
According to described Keyword Tag, determine and the destination service type matching in described text identification result, and with described mesh Mark is suitable for the range of age and/or is suitable for the object content recommendation service of gender matched with described target.
CN201610395662.2A 2016-06-06 2016-06-06 Method of speech processing and device Pending CN106128467A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201610395662.2A CN106128467A (en) 2016-06-06 2016-06-06 Method of speech processing and device

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201610395662.2A CN106128467A (en) 2016-06-06 2016-06-06 Method of speech processing and device

Publications (1)

Publication Number Publication Date
CN106128467A true CN106128467A (en) 2016-11-16

Family

ID=57270826

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201610395662.2A Pending CN106128467A (en) 2016-06-06 2016-06-06 Method of speech processing and device

Country Status (1)

Country Link
CN (1) CN106128467A (en)

Cited By (30)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN107170456A (en) * 2017-06-28 2017-09-15 北京云知声信息技术有限公司 Method of speech processing and device
CN107230397A (en) * 2017-07-26 2017-10-03 绮语(北京)文化传媒有限公司 A kind of parent-offspring's children education audio generation and processing method and device
CN107274900A (en) * 2017-08-10 2017-10-20 北京灵隆科技有限公司 Information processing method and its system for control terminal
CN107705793A (en) * 2017-09-22 2018-02-16 百度在线网络技术(北京)有限公司 Information-pushing method, system and its equipment based on Application on Voiceprint Recognition
CN107825433A (en) * 2017-10-27 2018-03-23 安徽硕威智能科技有限公司 A kind of card machine people of children speech instruction identification
CN107844586A (en) * 2017-11-16 2018-03-27 百度在线网络技术(北京)有限公司 News recommends method and apparatus
CN107886949A (en) * 2017-11-24 2018-04-06 科大讯飞股份有限公司 A kind of content recommendation method and device
CN108021622A (en) * 2017-11-21 2018-05-11 北京金山安全软件有限公司 Information determination method and device, electronic equipment and storage medium
CN108182946A (en) * 2017-12-25 2018-06-19 广州势必可赢网络科技有限公司 Vocal music mode selection method and device based on voiceprint recognition
WO2018108080A1 (en) * 2016-12-13 2018-06-21 北京奇虎科技有限公司 Voiceprint search-based information recommendation method and device
TWI638352B (en) * 2017-06-02 2018-10-11 元鼎音訊股份有限公司 Electronic device capable of adjusting output sound and method of adjusting output sound
CN108786127A (en) * 2018-06-25 2018-11-13 王芳 Parachute-type lifting body manoeuvring platform
CN108882032A (en) * 2018-06-08 2018-11-23 百度在线网络技术(北京)有限公司 Method and apparatus for output information
CN108922525A (en) * 2018-06-19 2018-11-30 Oppo广东移动通信有限公司 Method of speech processing, device, storage medium and electronic equipment
CN109237740A (en) * 2018-07-31 2019-01-18 珠海格力电器股份有限公司 Control method and device of electric appliance, storage medium and electric appliance
CN109545232A (en) * 2019-01-21 2019-03-29 美的集团武汉制冷设备有限公司 Information-pushing method, information push-delivery apparatus and interactive voice equipment
CN109582822A (en) * 2018-10-19 2019-04-05 百度在线网络技术(北京)有限公司 A kind of music recommended method and device based on user speech
CN109640142A (en) * 2018-12-21 2019-04-16 咪咕数字传媒有限公司 Content recommendation method and device, equipment and storage medium
WO2019177102A1 (en) * 2018-03-14 2019-09-19 株式会社ウフル Ai speaker system, method for controlling ai speaker system, and program
WO2020052135A1 (en) * 2018-09-10 2020-03-19 珠海格力电器股份有限公司 Music recommendation method and apparatus, computing apparatus, and storage medium
CN111683181A (en) * 2020-04-27 2020-09-18 平安科技(深圳)有限公司 Voice-based user gender and age identification method and device and computer equipment
CN111862947A (en) * 2020-06-30 2020-10-30 百度在线网络技术(北京)有限公司 Method, apparatus, electronic device, and computer storage medium for controlling smart device
CN111933151A (en) * 2020-08-16 2020-11-13 云知声智能科技股份有限公司 Method, device and equipment for processing call data and storage medium
CN112333550A (en) * 2020-06-19 2021-02-05 深圳Tcl新技术有限公司 Program query method, device, equipment and computer storage medium
CN112333542A (en) * 2020-08-20 2021-02-05 深圳Tcl新技术有限公司 Program recommendation page determining method, device and equipment and readable storage medium
CN112884423A (en) * 2019-11-29 2021-06-01 北京国双科技有限公司 Information processing method and device, electronic equipment and storage medium
CN113488037A (en) * 2020-07-10 2021-10-08 青岛海信电子产业控股股份有限公司 Speech recognition method
CN113538048A (en) * 2021-07-12 2021-10-22 深圳市明源云客电子商务有限公司 Demand information obtaining method and device, terminal equipment and storage medium
CN113539274A (en) * 2021-06-15 2021-10-22 复旦大学附属肿瘤医院 Voice processing method and device
CN114726635A (en) * 2022-04-15 2022-07-08 北京三快在线科技有限公司 Authority verification method, device, electronic equipment and medium

Citations (11)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN104299148A (en) * 2013-07-15 2015-01-21 武汉好气质科技有限公司 System and method for publishing advertisements on waterfall-type webpage
CN104731917A (en) * 2015-03-25 2015-06-24 百度在线网络技术(北京)有限公司 Recommendation method and device
CN104795067A (en) * 2014-01-20 2015-07-22 华为技术有限公司 Voice interaction method and device
CN104992706A (en) * 2015-05-15 2015-10-21 百度在线网络技术(北京)有限公司 Voice-based information pushing method and device
CN105095406A (en) * 2015-07-09 2015-11-25 百度在线网络技术(北京)有限公司 Method and apparatus for voice search based on user feature
CN105095427A (en) * 2015-07-17 2015-11-25 小米科技有限责任公司 Search recommendation method and device
CN105306815A (en) * 2015-09-30 2016-02-03 努比亚技术有限公司 Shooting mode switching device, method and mobile terminal
CN105391730A (en) * 2015-12-02 2016-03-09 北京云知声信息技术有限公司 Information feedback method, device and system
CN105426508A (en) * 2015-11-30 2016-03-23 百度在线网络技术(北京)有限公司 Webpage generation method and apparatus
CN105444332A (en) * 2014-08-19 2016-03-30 青岛海尔智能家电科技有限公司 Equipment voice control method and device
CN105635795A (en) * 2015-12-30 2016-06-01 小米科技有限责任公司 Collection method and apparatus of television user behavior information

Patent Citations (11)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN104299148A (en) * 2013-07-15 2015-01-21 武汉好气质科技有限公司 System and method for publishing advertisements on waterfall-type webpage
CN104795067A (en) * 2014-01-20 2015-07-22 华为技术有限公司 Voice interaction method and device
CN105444332A (en) * 2014-08-19 2016-03-30 青岛海尔智能家电科技有限公司 Equipment voice control method and device
CN104731917A (en) * 2015-03-25 2015-06-24 百度在线网络技术(北京)有限公司 Recommendation method and device
CN104992706A (en) * 2015-05-15 2015-10-21 百度在线网络技术(北京)有限公司 Voice-based information pushing method and device
CN105095406A (en) * 2015-07-09 2015-11-25 百度在线网络技术(北京)有限公司 Method and apparatus for voice search based on user feature
CN105095427A (en) * 2015-07-17 2015-11-25 小米科技有限责任公司 Search recommendation method and device
CN105306815A (en) * 2015-09-30 2016-02-03 努比亚技术有限公司 Shooting mode switching device, method and mobile terminal
CN105426508A (en) * 2015-11-30 2016-03-23 百度在线网络技术(北京)有限公司 Webpage generation method and apparatus
CN105391730A (en) * 2015-12-02 2016-03-09 北京云知声信息技术有限公司 Information feedback method, device and system
CN105635795A (en) * 2015-12-30 2016-06-01 小米科技有限责任公司 Collection method and apparatus of television user behavior information

Cited By (39)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2018108080A1 (en) * 2016-12-13 2018-06-21 北京奇虎科技有限公司 Voiceprint search-based information recommendation method and device
TWI638352B (en) * 2017-06-02 2018-10-11 元鼎音訊股份有限公司 Electronic device capable of adjusting output sound and method of adjusting output sound
CN107170456A (en) * 2017-06-28 2017-09-15 北京云知声信息技术有限公司 Method of speech processing and device
CN107230397A (en) * 2017-07-26 2017-10-03 绮语(北京)文化传媒有限公司 A kind of parent-offspring's children education audio generation and processing method and device
CN107274900B (en) * 2017-08-10 2020-09-18 北京京东尚科信息技术有限公司 Information processing method for control terminal and system thereof
CN107274900A (en) * 2017-08-10 2017-10-20 北京灵隆科技有限公司 Information processing method and its system for control terminal
CN107705793A (en) * 2017-09-22 2018-02-16 百度在线网络技术(北京)有限公司 Information-pushing method, system and its equipment based on Application on Voiceprint Recognition
CN107705793B (en) * 2017-09-22 2023-01-31 百度在线网络技术(北京)有限公司 Information pushing method, system and equipment based on voiceprint recognition
CN107825433A (en) * 2017-10-27 2018-03-23 安徽硕威智能科技有限公司 A kind of card machine people of children speech instruction identification
CN107844586A (en) * 2017-11-16 2018-03-27 百度在线网络技术(北京)有限公司 News recommends method and apparatus
CN108021622A (en) * 2017-11-21 2018-05-11 北京金山安全软件有限公司 Information determination method and device, electronic equipment and storage medium
CN107886949A (en) * 2017-11-24 2018-04-06 科大讯飞股份有限公司 A kind of content recommendation method and device
CN108182946B (en) * 2017-12-25 2021-04-13 广州势必可赢网络科技有限公司 Vocal music mode selection method and device based on voiceprint recognition
CN108182946A (en) * 2017-12-25 2018-06-19 广州势必可赢网络科技有限公司 Vocal music mode selection method and device based on voiceprint recognition
WO2019177102A1 (en) * 2018-03-14 2019-09-19 株式会社ウフル Ai speaker system, method for controlling ai speaker system, and program
CN108882032A (en) * 2018-06-08 2018-11-23 百度在线网络技术(北京)有限公司 Method and apparatus for output information
CN108922525A (en) * 2018-06-19 2018-11-30 Oppo广东移动通信有限公司 Method of speech processing, device, storage medium and electronic equipment
WO2019242414A1 (en) * 2018-06-19 2019-12-26 Oppo广东移动通信有限公司 Voice processing method and apparatus, storage medium, and electronic device
CN108786127A (en) * 2018-06-25 2018-11-13 王芳 Parachute-type lifting body manoeuvring platform
CN109237740A (en) * 2018-07-31 2019-01-18 珠海格力电器股份有限公司 Control method and device of electric appliance, storage medium and electric appliance
WO2020052135A1 (en) * 2018-09-10 2020-03-19 珠海格力电器股份有限公司 Music recommendation method and apparatus, computing apparatus, and storage medium
CN109582822A (en) * 2018-10-19 2019-04-05 百度在线网络技术(北京)有限公司 A kind of music recommended method and device based on user speech
CN109640142A (en) * 2018-12-21 2019-04-16 咪咕数字传媒有限公司 Content recommendation method and device, equipment and storage medium
CN109640142B (en) * 2018-12-21 2021-08-06 咪咕数字传媒有限公司 Content recommendation method and device, equipment and storage medium
CN109545232A (en) * 2019-01-21 2019-03-29 美的集团武汉制冷设备有限公司 Information-pushing method, information push-delivery apparatus and interactive voice equipment
CN112884423A (en) * 2019-11-29 2021-06-01 北京国双科技有限公司 Information processing method and device, electronic equipment and storage medium
CN111683181A (en) * 2020-04-27 2020-09-18 平安科技(深圳)有限公司 Voice-based user gender and age identification method and device and computer equipment
CN111683181B (en) * 2020-04-27 2022-04-12 平安科技(深圳)有限公司 Voice-based user gender and age identification method and device and computer equipment
CN112333550A (en) * 2020-06-19 2021-02-05 深圳Tcl新技术有限公司 Program query method, device, equipment and computer storage medium
CN112333550B (en) * 2020-06-19 2024-01-19 深圳Tcl新技术有限公司 Program query method, device, equipment and computer storage medium
CN111862947A (en) * 2020-06-30 2020-10-30 百度在线网络技术(北京)有限公司 Method, apparatus, electronic device, and computer storage medium for controlling smart device
CN113488037A (en) * 2020-07-10 2021-10-08 青岛海信电子产业控股股份有限公司 Speech recognition method
CN113488037B (en) * 2020-07-10 2024-04-12 海信集团控股股份有限公司 Speech recognition method
CN111933151A (en) * 2020-08-16 2020-11-13 云知声智能科技股份有限公司 Method, device and equipment for processing call data and storage medium
CN112333542A (en) * 2020-08-20 2021-02-05 深圳Tcl新技术有限公司 Program recommendation page determining method, device and equipment and readable storage medium
CN113539274A (en) * 2021-06-15 2021-10-22 复旦大学附属肿瘤医院 Voice processing method and device
CN113538048A (en) * 2021-07-12 2021-10-22 深圳市明源云客电子商务有限公司 Demand information obtaining method and device, terminal equipment and storage medium
CN114726635A (en) * 2022-04-15 2022-07-08 北京三快在线科技有限公司 Authority verification method, device, electronic equipment and medium
CN114726635B (en) * 2022-04-15 2023-09-12 北京三快在线科技有限公司 Authority verification method and device, electronic equipment and medium

Similar Documents

Publication Publication Date Title
CN106128467A (en) Method of speech processing and device
US11302337B2 (en) Voiceprint recognition method and apparatus
US11705096B2 (en) Autonomous generation of melody
US9547471B2 (en) Generating computer responses to social conversational inputs
Storkel Learning new words
CN105895105B (en) Voice processing method and device
US11264006B2 (en) Voice synthesis method, device and apparatus, as well as non-volatile storage medium
CN107170456A (en) Method of speech processing and device
CN103730032B (en) Multi-medium data control method and system
Rettberg Hand signs for lip-syncing: The emergence of a gestural language on musical. ly as a video-based equivalent to emoji
CN109801527B (en) Method and apparatus for outputting information
CN106847284A (en) Electronic equipment, computer-readable recording medium and voice interactive method
CN108900612A (en) Method and apparatus for pushed information
Gibson Sociophonetics of popular music: Insights from corpus analysis and speech perception experiments
CN107908743A (en) Artificial intelligence application construction method and device
CN111243604B (en) Training method for speaker recognition neural network model supporting multiple awakening words, speaker recognition method and system
CN106686226A (en) Method and system for playing audio of terminal
CN106847273B (en) Awakening word selection method and device for voice recognition
CN108847066A (en) A kind of content of courses reminding method, device, server and storage medium
Ramati et al. Use this sound: Networked ventriloquism on Yiddish TikTok
CN106098057A (en) Play word speed management method and device
CN111339352B (en) Audio generation method, device and storage medium
CN114297354B (en) Bullet screen generation method and device, storage medium and electronic device
US20140113264A1 (en) Emotion exchange apparatus and method for providing thereof
KR20140088327A (en) Method for studying language apply to dynamic conversation, system and apparatus thereof

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
RJ01 Rejection of invention patent application after publication

Application publication date: 20161116

RJ01 Rejection of invention patent application after publication