The information service system of interactive telephone phonetic and method
The present invention relates to the information service system and the method for phone information service technology, particularly a kind of interactive telephone phonetic.
Existing phone information service technology, as shown in Figure 1, it is for being the schematic diagram of the sound telecommunications services system of representative with automatic information information desk 168.Its characteristic feature is to use the code retrieval of information inquiry and the recording storage of the information content.This type systematic generally is made up of subscriber phone, telephone network, urban node three parts.Urban node mainly comprises: access and Switching Module, service management module, voice document library module and database module; During user inquiring information, the fixing message code of numerical key input with subscriber phone, message code is sent to urban node by telephone network, the access and the Switching Module of urban node receive message code, retrieve the recording file corresponding with this message code, and play this recording file by sound card, and voice signal is sent to subscriber phone by telephone network, and the user hears the information content.This type systematic has following feature:
With the subscriber phone is information inquiry terminal, proposes search request in the mode by the numerical key on the telephone set;
When the user proposed search request, what transmitted to urban node by the subscriber phone end by telephone network was the Dual Tone Multifrequency signal of expression 0-9;
The storage of information is with record type, and the content of storage is a recording file;
The sound communication system in each city is isolated operation separately, and not interconnected between the system, data are not shared.
There is following deficiency in automatic information information desk 168 for the tradition sound telecommunications services system of representative:
Information inquiry requires to propose in the digital code mode, requires user's recall info code or carries code book.But the contents are multifarious and disorderly for the sound telecommunications services, and every corresponding digital code of content requires the user to remember code and hardly may.Simultaneously, because the randomness of user's use sound telecommunications services is very big, the possibility of carrying code book is also very little.Lack convenient method that a response user inquiring requires and be and hinder one of main bottleneck that tradition sound communication system develops rapidly;
Information stores is with record type, and processes such as recording, editor, replacement must be passed through in any more new capital of the information content, and the information content can't possess real-time, and this is to hinder two of main bottleneck that tradition sound communication system develops rapidly;
Urban node need be stored the recording file of flood tide, requires the urban node memory space huge.The speech data of flood tide need be managed by the large database of complexity simultaneously, causes system's investment high.The investment of a medium-sized city tradition sound telecommunications services system is generally all at nearly ten million yuan.Simultaneously, along with more new capital of every information must be recorded, be replaced, the system maintenance cost is also high.
Gu Li system causes a large amount of overlapping investment separately.Simultaneously, Gu Li system has more restricted the rapidly abundant of information source separately, and this is to hinder three of main bottleneck that tradition sound communication system develops rapidly.
The object of the present invention is to provide a kind of information service system of using interactive telephone phonetic, be referred to as " phone is because of the spy ".
Another object of the present invention is to provide a kind of information service method that uses interactive telephone phonetic.
Utilize system and method for the present invention can make people can utilize dumb terminals such as telephone set (comprising mobile phone) or facsimile machine, realize the information service of obtaining and sending of information by voice (being aided with a small amount of button).
For achieving the above object, the present invention takes following measure:
The information service system of a kind of interactive telephone phonetic of the present invention comprises urban node, and urban node comprises: message control module, business data module, access and Switching Module, service management module; Each module is connected by a computer network;
It is characterized in that: urban node also comprises the remote communication module that is connected with computer network.
Wherein: described urban node also comprises the speech-recognition services module that is connected with computer network;
The speech-recognition services module is carried out automatic gain control and treatment, adaptive noise filtering and acoustic mode identification to telephone speech signal and is handled.
Wherein: described urban node also comprises: the phonetic synthesis service module that is connected with computer network;
The phonetic synthesis module provides the conversion from Text To Speech, through syntactic analysis vocabulary cut apart, fundamental tone is selected and splicing, and oral cavity pronunciation simulation, synthetic speech signal.
Wherein: described urban node also comprises the phonetic synthesis service module that is connected with computer network;
The phonetic synthesis module provides the conversion from Text To Speech, through syntactic analysis vocabulary cut apart, fundamental tone is selected and splicing, and oral cavity pronunciation simulation, synthetic speech signal.
The information service system of another kind of interactive telephone phonetic of the present invention comprises urban node, and urban node comprises: message control module, business data module, access and Switching Module, service management module; Each module is connected by a local area network (LAN);
It is characterized in that: comprise that also the network between each urban node connects.
Wherein: described urban node also comprises the remote communication module that is connected with described local area network (LAN).
Wherein: described urban node also comprises the www service module that is connected with local area network (LAN).
Wherein: described urban node also comprises the speech-recognition services module that is connected with local area network (LAN).
Wherein: described urban node also comprises: the phonetic synthesis service module that is connected with local area network (LAN).
Wherein: the network between described each urban node connects and meets ICP/IP protocol.
Wherein: the network between described each urban node is connected to IP network.
The information service method of a kind of interactive telephone phonetic of the present invention comprises the steps:
A, user dial shortcode by terminal equipment, insert urban node by telephone network and access relaying;
The voice signal that b, user give an oral account Business Name receives through inserting with Switching Module, and through speech-recognition services module identification services classification, determines class of service and location thereof; If not in this locality;
C, be forwarded to the purpose urban node through remote communication module and wide area network;
The urban node in d, purpose city after the speech-recognition services module identifies Business Name, enters the operation flow of this business through message control module;
E, user further with system interaction.
Wherein: among the described step b, identify class of service in this locality through the speech-recognition services module; Carry out following steps:
Through the operation flow that message control module enters corresponding service, the user further with system interaction.
Wherein: if Query Result (Email that comprises reception) is a text formatting, then carry out following steps: the phonetic synthesis service module converts content of text to voice signal, at last by inserting and Switching Module output voice signal.
Wherein: among the described step b, system can enroll the Mail Contents of user by telephone microphone oral account, and saves as certain phonetic matrix, sends as the content of Email.
Reaching embodiment in conjunction with the accompanying drawings is described in detail as follows specific structural features of the present invention and method feature:
Brief Description Of Drawings:
Fig. 1: the circuit block diagram of urban node in the existing automatic information information desk system;
Fig. 2: the circuit block diagram of the urban node among the present invention;
Fig. 3: networking schematic diagram of the present invention;
Fig. 4: the flow chart of the call voice identification in the urban node of the present invention;
Fig. 5: the flow chart that the call voice in the urban node of the present invention is synthetic.
Fig. 6: the schematic flow sheet of the inventive method.
The information service system of interactive telephone phonetic of the present invention comprises the two large divisions: each urban node (see figure 2) and networking blueprint (see figure 3); As shown in Figure 2, urban node mainly is made up of message control module 1, business data module 2, access and Switching Module 3, speech-recognition services module 4, phonetic synthesis service module 5, service management module 6, www service module 7, remote communication module 8; Logically, Each performs its own functions, collaborative work for each module; On structure, each module can operate on the machine in the lump, also may operate on two or many machines, and these machines are connected by local area network (LAN) 9.For the ease of the course of work of complete embodiment urban node, also comprised subscriber phone (or mobile phone) 10, telefax machine 11, outside automatic station 12, outside manual board 13, telephone network 14 among Fig. 2 and inserted relaying 15.
Different business has different operation flows; just differ greatly as the professional operation flow of corporation information query with voice Email business; add on the message control module 1 and carry all professional operation flows; operation flow has branch usually; message control module 1 (has been sent out certain bar voice command as the user according to inserting the incident that reports with Switching Module 3; or pressed key certain number); determine next step action; it for example is playing alert tones; still connect input user's input signal, or Query Database etc.Dynamic load, the unloading of message control module 1 supporting business, and do not influence the operation of other business.
Business data module 2 is being deposited the various data of urban node, comprises business datum, user data and metering data etc., and business data module 2 is supported the large-scale concurrent visit, and has the mechanism of backing up in realtime.
Insert with Switching Module 3 and can regard the powerful switch that computer and voice/fax card combine as, major function is as follows:
By voice/fax card tie trunk, has signaling processing ability.
Can discern calling number, discern user key-press, call speech-recognition services module 4 identification user voice commands, detection of call incident, and report message control module 1.
Can be according to the instruction of message control module 1, the controlling call process.
Can initiatively breathe out or diverting call to outside automatic station 12 or outside manual board 13 etc.
Which voice command what speech-recognition services module 4 can discern from a series of candidate's command list (CLIST)s that the user sends according to call voice is, has realized the revolution of telephone subscriber's input mode from the button to the phonetic entry.Its schematic flow sheet is seen Fig. 4, and at first, telephone speech signal again through the adaptive noise filtering, utilizes acoustic mode identification at last after the automatic gain control and treatment.
Phonetic synthesis module 5 provides the conversion from Text To Speech, has broken through the restriction of information prescoring, makes inquiry dynamic in real time and magnanimity information become possibility, and the phonetic synthesis module belongs to the category of ip intelligent peripherals in intelligent network.Its schematic flow sheet as shown in Figure 5, text is cut apart through syntactic analysis vocabulary, selects and splicing through fundamental tone again, and is last, through oral cavity pronunciation simulation, synthetic speech.
Service management module 6 is professional in order to create, load, to unload, monitoring service operation situation, management statistics miscellaneous service data, user data and metering data.
WWW service module 7 makes the user also can inquire about information in the urban node business data module 2 by Web browser, simultaneously the information in the business data module 2 can be inquired about by other urban nodes in operation flow.
Remote communication module 8 is bridges that urban node leads to oracle (comprising the website on the Internet) and other urban nodes, has comprised functions such as fire compartment wall, router simultaneously.
Between the urban node, urban node and information source be all interconnected by ICP/IP protocol, from physically, can pass through public networks such as ChinaNet, ChinaGBN and connect, also can be by private line access.Like this, all information is with distributed each urban node and the information source of leaving in, but abundant sharing formed a huge distributed information storehouse jointly.Can make a mirror-image copies at each urban node for some information commonly used, exchange the raising of access speed with certain redundant storage for.
As shown in Figure 6, the flow process of always working comprises the steps: step 601: the user passes through terminal equipments such as telephone set (or mobile phone) 10, facsimile machine 11, group shortcode (the unified number in the whole nation, as: 17999) by telephone network 14 and access relaying 15 connecting systems; Carry out step 602 again: system is by the sound card playing welcome announcement; Carry out step 603: the user gives an oral account Business Name (as " stock market "); Carry out step 604: identify Business Name through speech-recognition services module 4; Carry out step 605: message control module 1 is judged professional location, is not local as this business location; Carry out step 606: be transferred to professional place urban node, transmitted unified professional numbering simultaneously; Carry out step 607: continue and user interactions by operation flow, give an oral account after enterprise name speech-recognition services module identifies the enterprise name of user's oral account, extract company information as the prompting user, synthetic through phonetic synthesis service module 5, and play to the user; As this business location is local, also carry out step 607.
In step 607, utilize native system, the user can further pass through voice command or button and system interaction, at different professional or according to this node of input inquiry business data module 2 of user, perhaps, perhaps send information (as Email) and give remote object by remote communication module 8 inquiries even modification remote information source.For the situation of Query Information, if Query Result is a text formatting, phonetic synthesis service module 5 can convert resulting text to the voice of clear and natural, plays to the user by access and Switching Module 3 at last and listens; If user terminal is a facsimile machine 11, system also can fax text and figure to the user.Some professional meetings be transferred to outside automatic station 12 (as the polling telephone system of civil aviaton) or outside manual board 13 (as phone outpatient service hot line) with user's calling by inserting with Switching Module 3, to make full use of existing resource and to expand service range.
The service that can open at present comprises: real-time stock market, personal communication basis, voice mail, the inquiry of national industrial and commercial enterprises, flight inquiring, part commodity market etc.
The concrete workflow of each service item of the present invention is described below in conjunction with embodiment:
1, the real-time stock market of user inquiring
The user is by telephone set 10, dial shortcode, by telephone network 14 and access relaying 15 connecting systems, the user gives an oral account Business Name " stock market ", after speech-recognition services module 4 identifies Business Name " stock market ", enter the operation flow of this business through message control module 1, at this moment, the user can give an oral account stock name, after speech-recognition services module 4 identifies stock name, obtain fresh picture through business data module, the fresh picture data are synthesized voice, play to the user through inserting with Switching Module through the phonetic synthesis service module.
2, personal communication originally
The user is by telephone set 10, put through system's shortcode, by telephone network 14 and access relaying 15 connecting systems, oral account " communication originally ", speech-recognition services module 4 identify Business Name " communication this ", system prompt: " please import account number ", after the user imports number of the account, system access its personal communication this, prompting: " may I ask you whom look for ", the user answers: " certain so-and-so ", after speech-recognition services module 4 identifies name of contact person, obtain its telephone number through business data module, synthesize voice, play: " certain so-and-so; telephone number: * * * * * *; may I ask and connect or gravity treatment? " through inserting with Switching Module through the phonetic synthesis service module,, insert with Switching Module and be transferred to this number for the user as user's oral account " connection ".
3, send voice mail
The user puts through system's shortcode, by telephone network 14 and access relaying 15 connecting systems, say: " sending out mail ", speech-recognition services module 4 identifies after the Business Name, system is asked: " whom sends to? " the user answers: " certain so-and-so ", system prompt: " please give an oral account Mail Contents ", the user can begin to give an oral account Mail Contents, oral account finishes by a key, system record is the Mail Contents of user's oral account down, and saves as certain phonetic matrix (as the ADPCM form), sends as the content of Email.
4, receiving voicemail
The user puts through system's shortcode, by telephone network 14 and access relaying 15 connecting systems, say: " receiving emails ", speech-recognition services module 4 identifies after the Business Name, system receives mail from the mail server that the user sets, and is text as Mail Contents, then synthesizes speech play by the phonetic synthesis service module and listens to the user, as Mail Contents is the voice of certain form, plays to the user after then being converted to the form that sound card can play.
Compared with prior art, the present invention has following effect:
Because the present invention is provided with speech-recognition services module and phonetic synthesis clothes in each urban node The affair module, the present invention can be completely achieved the people and exchanges by natural language with system, realizes remote information Retrieval.
The user of all ages and classes, different sexes is local in difference, the different phone of use is put through same Individual special service number (as: 17999) all available natural language is realized dialogue with system;
Under the average operating, a discrimination of system can be more than 90%, and the secondary discrimination reaches 98%:
The user sends retrieval, and to require to the time of hearing result for retrieval be 1~3 second;
The naturalness of phonetic synthesis, definition are 4 minutes (5 are divided into announcer's level);
In sum, the invention a kind of people and system exchange the method that realizes information inquiry by natural language, native system has been realized following breakthrough:
1, the user replaces digital code proposition query demand with natural language, has broken through conventional art Awkward restriction bottleneck;
2, information is stored with textual form, realizes the real-time of Text To Speech with speech synthesis technique Change, realized the real-time response of information updating;
3, take the hard disk resource less than 1% of recording file with textual form canned data file. Simultaneously, speech synthesis technique has been realized the conversion of information by Text To Speech automatically, has saved numerous and jumbled Recording, montage, replacement process, significantly reduced the maintenance cost of system.
4, realize the national network of each urban node with ICP/IP protocol, with distributed information storehouse shape Formula realizes sharing of each information source. This structure has guaranteed that the information source of a large amount of high-qualitys can be with plug-in shape Formula is shared by the system in the whole nation. Sound letter field belongs to information service field, the information service field success Key be the abundant and practical of information source, on national network, information source is plug-in and by specialty separately Information source supplier is responsible for safeguarding and has also ensured the accurate and real-time of information source.