CN108831434A - voice interactive system and method - Google Patents

voice interactive system and method Download PDF

Info

Publication number
CN108831434A
CN108831434A CN201810529253.6A CN201810529253A CN108831434A CN 108831434 A CN108831434 A CN 108831434A CN 201810529253 A CN201810529253 A CN 201810529253A CN 108831434 A CN108831434 A CN 108831434A
Authority
CN
China
Prior art keywords
voice
intelligent agent
customer end
simple customer
user
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN201810529253.6A
Other languages
Chinese (zh)
Inventor
尹绍华
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Individual
Original Assignee
Individual
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Individual filed Critical Individual
Priority to CN201810529253.6A priority Critical patent/CN108831434A/en
Publication of CN108831434A publication Critical patent/CN108831434A/en
Pending legal-status Critical Current

Links

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/28Constructional details of speech recognition systems
    • G10L15/30Distributed recognition, e.g. in client-server systems, for mobile phones or network applications
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L13/00Speech synthesis; Text to speech systems
    • G10L13/02Methods for producing synthetic speech; Speech synthesisers
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/26Speech to text systems
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L67/00Network arrangements or protocols for supporting network services or applications
    • H04L67/50Network services
    • H04L67/56Provisioning of proxy services

Landscapes

  • Engineering & Computer Science (AREA)
  • Computational Linguistics (AREA)
  • Health & Medical Sciences (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Human Computer Interaction (AREA)
  • Physics & Mathematics (AREA)
  • Acoustics & Sound (AREA)
  • Multimedia (AREA)
  • Computer Networks & Wireless Communication (AREA)
  • Signal Processing (AREA)
  • Telephonic Communication Services (AREA)

Abstract

This application involves a kind of voice interactive system and methods, including:Multiple simple customer ends;Intelligent agent;The intelligent agent is connect with each simple customer end respectively;The simple customer end is used to receiving and/or playing voice, and voice transfer is carried out between the intelligent agent;The intelligent agent is used to carry out the relevant treatment that intelligent sound interacts with Cloud Server, and obtains relevant information, and the relevant information is converted to voice signal.By the way that intelligent agent is arranged, simple customer end only needs to carry out voice input, output, completes voice communication with intelligent agent, it can allows user to complete the interactive voice of intelligence, therefore the simplification of family terminal can be used, expand application scenarios, reduce production cost.

Description

Voice interactive system and method
Technical field
This application involves technical field of voice interaction more particularly to a kind of voice interactive system and methods.
Background technique
Artificial Intelligence Development quickly, is applied more and more extensive in many occasions at present.Wherein, based on the people of interactive voice Work intellectual product has more application and new product to occur at present.
Its implementation is typically all the interaction framework based on SmartClient and cloud server, complicated calculating collection In in cloud server, so as to simplify client, save the cost of client.
In the related technology, although simplifying client, still certain intelligence requires client, for example, client If end will be interacted with cloud server, need to realize basic interaction logic, the realization of these interaction logics is to equipment It there are certain requirements, such as the ability in operation third party's intelligent sound processing terminal library, the energy of related resource obtained by network Power etc. realizes that the expense of above-mentioned terminal is not low;Some traditional audio frequency apparatuses simultaneously such as simulate intercommunication terminal, can not Using, or if transformation can improve cost.
Summary of the invention
To be overcome the problems, such as present in the relevant technologies at least to a certain extent, the application provides a kind of voice interactive system And method.
According to the embodiment of the present application in a first aspect, provide a kind of voice interactive system, including:Multiple simple customer ends; Intelligent agent;The intelligent agent is connect with each simple customer end respectively;The simple customer end is for receiving and/or playing Voice, and voice transfer is carried out between the intelligent agent;The intelligent agent is used to carry out intelligent language with Cloud Server The relevant treatment of sound interaction, and relevant information is obtained, the relevant information is converted into voice signal.
Optionally, it is connected between the intelligent agent and each simple customer end using analog communication links, alternatively, Using TCP/IP link connection.
Optionally, each simple customer end is deployed in user local, for completing and the interaction of user.
Optionally, intelligent agent is deployed in user local, manages multiple simple customer ends;Alternatively, deployment is beyond the clouds.
Optionally, the simple customer end includes:Voice acquisition module, for receiving the voice of user;Voice plays mould Block, for playing voice to user;Transmission module, for by the received voice transfer of voice acquisition module to intelligent agent, with And the voice that intelligent agent is sent is received, and received voice is sent to voice playing module.
Optionally, the simple customer end further includes following one or more:Voice coding module, for institute's predicate The received voice of sound acquisition module is encoded, and the voice after coding is sent to the transmission module;Tone decoding module, For receiving the encoded voice of intelligent agent transmission, it is sent to voice after being decoded to encoded voice and plays mould Block;External control interface for receiving the phonetic order converted by voice, and is transferred to third party.
Optionally, the intelligent sound interaction process and obtain the ability that user needs resource that the intelligent agent is used for, Including the one or more of following item:Speech recognition, speech synthesis, semantic understanding, resource acquisition, speech detection, user make by oneself Justice processing logic.
According to the second aspect of the embodiment of the present application, a kind of voice interactive method is provided, including:Simple customer end and intelligence Voice transfer is carried out between agency, wherein the simple customer end is when receiving the voice of user's sending, by received voice It is sent to the intelligent agent, alternatively, the simple customer end receives the voice that the intelligent agent is sent and plays to user; The intelligent agent carries out the relevant treatment that intelligent sound interacts with Cloud Server, and obtains relevant information, by the correlation Information is converted to voice signal;Wherein, each intelligent agent is connect with multiple simple customer ends.
Optionally, the relevant treatment further includes:The intelligent agent is by speech recognition as a result, processing refers to for control It enables, returns to simple customer end.
Technical solution provided by the present application can include the following benefits:
By the way that intelligent agent is arranged, simple customer end only needs to carry out voice input, output, completes audio with intelligent agent Communication, it can allow user to complete the interactive voice of intelligence, therefore the simplification of family terminal can be used, expand application scenarios, drop Low production cost.
It should be understood that above general description and following detailed description be only it is exemplary and explanatory, not The application can be limited.
Detailed description of the invention
The drawings herein are incorporated into the specification and forms part of this specification, and shows the implementation for meeting the application Example, and together with specification it is used to explain the principle of the application.
Fig. 1 is the structural schematic diagram for the voice interactive system that the application one embodiment provides;
Fig. 2 is the structural schematic diagram at the simple customer end for the voice interactive system that another embodiment of the application provides;
Fig. 3 is the flow chart for the voice interactive method that another embodiment of the application provides;
Fig. 4 is the flow chart for the voice interactive method that another embodiment of the application provides;
Fig. 5 is the flow chart for the voice interactive method that another embodiment of the application provides;
Fig. 6 is the flow chart for the voice interactive method that another embodiment of the application provides.
Specific embodiment
Example embodiments are described in detail here, and the example is illustrated in the accompanying drawings.Following description is related to When attached drawing, unless otherwise indicated, the same numbers in different drawings indicate the same or similar elements.Following exemplary embodiment Described in embodiment do not represent all embodiments consistent with the application.On the contrary, they be only with it is such as appended The example of the consistent device and method of some aspects be described in detail in claims, the application.
Fig. 1 is the structural schematic diagram for the voice interactive system that the application one embodiment provides.As shown in Figure 1, with two For simple customer end.The voice interactive system includes:Multiple simple customer ends 1;Intelligent agent 2;The intelligent agent 2 is distinguished It is connect with each simple customer end 1;The simple customer end 1 for receiving and/or play voice, and with the intelligent agent Voice transfer is carried out between 2;The intelligent agent 2 is used to carry out the relevant treatment that intelligent sound interacts with Cloud Server 3, and The information that user needs is obtained, the resource is converted into voice signal.
What above-mentioned simple customer end can have audio outputs and inputs function and transmitting function, simple customer end It is deployed on local, to interact with user.Further, simple customer end can also have the function of audio encoding and decoding.
Above-mentioned intelligent agent can connect multiple simple customer ends, can collect and send audio-frequency information, can also complete With the interaction of Cloud Server, such as:Speech recognition, speech synthesis, semantics recognition, resource acquisition etc..Pass through multiple simple customers End connects the same intelligent agent, can be the service of multiple simple customer ends by an intelligent agent, simple without one Client corresponds to an intelligent agent, to realize simple, saving cost.
Above-mentioned relevant treatment may include speech recognition, speech synthesis, semantics recognition, resource acquisition etc.;Above-mentioned related letter Breath may include speech answering, control instruction etc..
In the present embodiment, by the way that intelligent agent is arranged, simple customer end only needs to carry out voice input, output, with intelligence Agency completes voice communication, it can allows user to complete the interactive voice of intelligence, therefore the simplification of family terminal can be used, expand Application scenarios reduce production cost.
In some embodiments, connected between the intelligent agent 2 and each simple customer end 1 using analog communication links It connects, alternatively, using transmission control protocol/Internet Protocol (Transmission Control Protocol/ Internet Protocol, TCP/IP) link connection.
It should be noted that analogue communication is to utilize the variation of the amplitude, frequency or phase of sine wave, or utilize pulse Amplitude, width or change in location simulate original signal, to achieve the purpose that communication, therefore referred to as analogue communication.TCP/IP chain Road connection is the digital audio and video signals that coding is transmitted by TCP/IP.Two kinds of communication connection modes are all made of the prior art, this Place is no longer described in detail.
Transmission control protocol/Internet Protocol is the most basic agreement in internet, because of the base of special Internet Plinth is made of the IP agreement of network layer and the Transmission Control Protocol of transport layer.
When using TCP/IP link connection mode, it can increase between simple customer end and intelligent agent some simple Interactive instruction plays some specified prompt tone etc., for example, voice transmission is completed to simplify voice input/output control System.
In the present embodiment, by using analog communication links connection or TCP/IP between intelligent agent and simple customer end Link connection, convenient for carrying out information transmission, interactive voice between both ends, in order to save bandwidth, simple customer end can also increase Audio coding or decoded function.
It should be noted that each simple customer end is deployed in user local, for completing and the interaction of user.
In some embodiments, intelligent agent 2 is deployed in the multiple simple customer ends of user's local management;Alternatively, being deployed in cloud End.
In the present embodiment, intelligent agent can be deployed in user local and cloud, can be with management and service in multiple simple Client, to reduce the cost of implementation of whole system.
As shown in Fig. 2, in some embodiments, the simple customer end 1 includes:Voice acquisition module 11 is used for receiving The voice at family;Voice playing module 12, for playing voice to user;Transmission module 13, for connecing voice acquisition module 11 The voice transfer of receipts to intelligent agent 2, and, receive the voice that intelligent agent 2 is sent, and received voice be sent to voice Playing module 12.
It should be noted that above-mentioned voice acquisition module can be microphone, the voice playing module can be loudspeaking Device.
In the present embodiment, by the way that voice acquisition module, voice playing module, transmission module is arranged, meet simple customer end Basic function need, carry out voice collecting, play, and the communication interaction with intelligent agent.
As shown in Fig. 2, the simple customer end further includes following one or more in some embodiments:
Voice coding module 14, for being encoded to the received voice of the voice acquisition module 11, and will be after coding Voice be sent to the transmission module 13;
Tone decoding module 15 carries out encoded voice for receiving the encoded voice of the transmission of intelligent agent 2 Voice playing module 12 is sent to after decoding.
External control interface for receiving the phonetic order converted by voice, and is transferred to third party.
It should be noted that the purpose that voice is encoded be in order to by voice digitization and using people voiced process Present in the auditory properties of redundancy and people reduce encoding rate.
Further, above-mentioned voice acquisition module can be microphone, and above-mentioned voice playing module can be loudspeaker.
It is understood that voice coding and tone decoding can realize that this will not be detailed here using the relevant technologies.
In the present embodiment, by setting voice coding module and tone decoding module, the transmission of voice signal can be compressed Bandwidth increases the efficiency of transmission of channel;External control interface is set, and simple customer end can receive and transmit the control of voice conversion System instruction, realizes the third-party human-computer interaction function of voice control.
In some embodiments, the intelligent sound interaction process and acquisition user that the intelligent agent 2 is used for need resource Ability, including the one or more of following item:Speech recognition, speech synthesis, semantic understanding, resource acquisition, speech detection, use The customized processing logic in family.
It should be noted that intelligent agent can be with the interaction of multiple cloud resource servers, such as speech recognition server, sound Sound synthesis server, semantic understanding and answering server, Resource Server.
In the present embodiment, multiple function is integrated in one by intelligent agent, reduces the function at simple customer end, reduces simple visitor Family end cost.
As shown in figure 3, the application also provides a kind of voice interactive method, include the following steps:
S31:Received voice is sent to intelligent agent when receiving the voice of user's sending by simple customer end;
S32:Intelligent agent carries out the relevant treatment that intelligent sound interacts with Cloud Server, and obtains relevant information, will The relevant information is converted to voice signal;
Wherein, each intelligent agent is connect with multiple simple customer ends.
S33:Simple customer end receives the voice that intelligent agent is sent and plays to user.
In the present embodiment, by intelligent agent, speech recognition, synthesis, semantics recognition etc. can be carried out, and client only needs Voice input, output are carried out, is communicated with intelligent agent, and audio is encoded or decoded, simplifies client, Reduce production cost.
As shown in figure 4, including the following steps the embodiment of the present application also provides a kind of process of voice response:
S41:The connection for arriving intelligent agent is initiated after user's triggering is spoken in simple customer end;
S42:Simple customer end constantly sends the voice input " what is your name " being collected into and arrives intelligent agent;
S43:Intelligent agent collects voice input, detects after the completion of speaking, and sends voice and is input to speech-recognition services, And waiting voice input results;
S44:Intelligent agent receives speech recognition result, sends the text identified to semantic understanding and answer service, and Awaiting reply;
S45:It after intelligent agent receives a reply, is replied if it is text, then sends text and reply to speech synthesis service, and Waiting voice composite result;
S46:Intelligent agent receives speech synthesis as a result, and speech synthesis result is sent back simple customer end;
S47:Simple customer end plays speech synthesis result to user, answers " I is red bean " herein.
In the present embodiment, by the interaction of intelligent agent and Cloud Server, it can be carried out speech recognition, semantics recognition, go forward side by side Voice response process is completed in row speech synthesis sequence of operations, reduces the requirement to client, reduces client cost of manufacture.
As shown in figure 5, including the following steps the embodiment of the present application also provides a broadcasting music process:
S51:The connection for arriving intelligent agent is initiated after user's triggering is spoken in simple customer end;
S52:Simple customer end constantly sends the voice input " I wants to listen lustily water " being collected into and arrives intelligent agent;
S53:Intelligent agent collects voice input, detects after the completion of speaking, and sends voice and is input to speech-recognition services, And waiting voice input results;
S54:Intelligent agent receives speech recognition result, sends the text identified to semantic understanding and answer service, and Awaiting reply;
S55:After intelligent agent receives a reply, if it is the answer of playing music, then it transmit a request to music service Device requests MP3 music;
S56:After intelligent agent receives MP3 stream, the ability whether simple customer end has decoding MP3 is judged, if it is not, Intelligent agent first decodes MP3 stream, if so, then without decoding step;
S57:MP3 circulation is issued simple customer end by intelligent agent;
S58:Simple customer end plays music.
In the present embodiment, by the interaction of intelligent agent and Cloud Server, it can be carried out speech recognition, semantics recognition, go forward side by side Row speech synthesis sequence of operations completes control music process, reduces the requirement to client, reduces client and is fabricated to This.
As shown in fig. 6, including the following steps the embodiment of the present application also provides a voice control process:
S61:Simple customer end can initiate the connection of intelligent agent after user sets out and speaks;
S62:After user speaks, simple customer end can constantly send the sound being collected into intelligent agent, use at this time Say " opening TV " in family;
S63:Intelligent agent collects voice input, detects completion of speaking, that is, sends voice to speech-recognition services, and Awaiting reply;
S64:After intelligent agent receives a reply, if it is a control instruction, i.e., control instruction is sent to simple visitor Family end;
S65:Control instruction is sent to third party by simple customer end, opens electricity here by serial ports control infrared transmitter Depending on machine.
In the present embodiment, by the interaction of intelligent agent and Cloud Server, speech recognition, semantics recognition can be carried out, complete Voice control TV is controlled, the requirement to client is reduced, reduces the cost of manufacture of remote controler.
About the device in above-described embodiment, wherein modules execute the concrete mode of operation in related this method Embodiment in be described in detail, no detailed explanation will be given here.
It is understood that same or similar part can mutually refer in the various embodiments described above, in some embodiments Unspecified content may refer to the same or similar content in other embodiments.
It should be noted that term " first ", " second " etc. are used for description purposes only in the description of the present application, without It can be interpreted as indication or suggestion relative importance.In addition, in the description of the present application, unless otherwise indicated, the meaning of " multiple " Refer at least two.
Any process described otherwise above or method description are construed as in flow chart or herein, and expression includes It is one or more for realizing specific logical function or process the step of executable instruction code module, segment or portion Point, and the range of the preferred embodiment of the application includes other realization, wherein can not press shown or discussed suitable Sequence, including according to related function by it is basic simultaneously in the way of or in the opposite order, to execute function, this should be by the application Embodiment person of ordinary skill in the field understood.
Those skilled in the art are understood that realize all or part of step that above-described embodiment method carries It suddenly is that relevant hardware can be instructed to complete by program, the program can store in a kind of computer-readable storage medium In matter, which when being executed, includes the steps that one or a combination set of embodiment of the method.
In the description of this specification, reference term " one embodiment ", " some embodiments ", " example ", " specifically show The description of example " or " some examples " etc. means specific features, structure, material or spy described in conjunction with this embodiment or example Point is contained at least one embodiment or example of the application.In the present specification, schematic expression of the above terms are not Centainly refer to identical embodiment or example.Moreover, particular features, structures, materials, or characteristics described can be any One or more embodiment or examples in can be combined in any suitable manner.
Although embodiments herein has been shown and described above, it is to be understood that above-described embodiment is example Property, it should not be understood as the limitation to the application, those skilled in the art within the scope of application can be to above-mentioned Embodiment is changed, modifies, replacement and variant.

Claims (9)

1. a kind of voice interactive system, which is characterized in that including:
Multiple simple customer ends;
Intelligent agent;
The intelligent agent is connect with each simple customer end respectively;
The simple customer end is used to receiving and/or playing voice, and voice transfer is carried out between the intelligent agent;
The intelligent agent is used to carry out the relevant treatment that intelligent sound interacts with Cloud Server, and obtains relevant information, will The relevant information is converted to voice signal.
2. system according to claim 1, which is characterized in that between the intelligent agent and each simple customer end It is connected using analog communication links, alternatively, using TCP/IP link connection.
3. system according to claim 1, which is characterized in that each simple customer end is deployed in user local, has been used for At the interaction with user.
4. system according to claim 3, which is characterized in that intelligent agent is deployed in user local, manages multiple simple Client;Alternatively, deployment is beyond the clouds.
5. system according to claim 1, which is characterized in that the simple customer end includes:
Voice acquisition module, for receiving the voice of user;
Voice playing module, for playing voice to user;
Transmission module is used for the received voice transfer of voice acquisition module to intelligent agent, and, it receives intelligent agent and sends Voice, and received voice is sent to voice playing module.
6. system according to claim 5, which is characterized in that the simple customer end further includes such as the next item down or more ?:
Voice coding module is sent out for encoding to the received voice of the voice acquisition module, and by the voice after coding Give the transmission module;
Tone decoding module, for receiving the encoded voice of intelligent agent transmission, after being decoded to encoded voice It is sent to voice playing module;
External control interface for receiving the phonetic order converted by voice, and is transferred to third party.
7. system according to claim 1, which is characterized in that the intelligent sound interaction process that the intelligent agent is used for And the ability that user needs resource is obtained, including the one or more of following item:Speech recognition, speech synthesis, semantic understanding, money Source acquisition, speech detection, the customized processing logic of user.
8. a kind of voice interactive method, which is characterized in that including:
Voice transfer is carried out between simple customer end and intelligent agent, wherein the simple customer end is receiving user's sending Voice when, received voice is sent to the intelligent agent, alternatively, the simple customer end receives intelligent agent hair The voice that send simultaneously plays to user;
The intelligent agent carries out the relevant treatment that intelligent sound interacts with Cloud Server, and obtains relevant information, will be described Relevant information is converted to voice signal;
Wherein, each intelligent agent is connect with multiple simple customer ends.
9. according to the method described in claim 8, it is characterized in that, the relevant treatment further includes:The intelligent agent is by language Sound identification as a result, processing be control instruction, return to simple customer end.
CN201810529253.6A 2018-05-29 2018-05-29 voice interactive system and method Pending CN108831434A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201810529253.6A CN108831434A (en) 2018-05-29 2018-05-29 voice interactive system and method

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201810529253.6A CN108831434A (en) 2018-05-29 2018-05-29 voice interactive system and method

Publications (1)

Publication Number Publication Date
CN108831434A true CN108831434A (en) 2018-11-16

Family

ID=64146083

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201810529253.6A Pending CN108831434A (en) 2018-05-29 2018-05-29 voice interactive system and method

Country Status (1)

Country Link
CN (1) CN108831434A (en)

Cited By (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN110136712A (en) * 2019-05-07 2019-08-16 甄十信息科技(上海)有限公司 Realize the method and device of intelligent sound AI function
CN111128193A (en) * 2019-12-27 2020-05-08 科大讯飞股份有限公司 Voice interaction method, network analysis end and client
CN112185394A (en) * 2019-07-02 2021-01-05 北京小鸟听听科技有限公司 Playing method, device and playing system of equipment group
CN112201240A (en) * 2020-09-27 2021-01-08 上汽通用五菱汽车股份有限公司 Vehicle control method, vehicle-mounted screenless device, server and readable storage medium

Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN1989733A (en) * 2004-05-21 2007-06-27 电缆优势软件有限公司 Remote access system and method and intelligent agent therefor
WO2016127550A1 (en) * 2015-02-13 2016-08-18 百度在线网络技术(北京)有限公司 Method and device for human-machine voice interaction
CN106027806A (en) * 2016-07-25 2016-10-12 乐视控股(北京)有限公司 Smart home device control method and mobile terminal
CN106710594A (en) * 2016-11-17 2017-05-24 北京中科汇联科技股份有限公司 Intelligent speech interaction system based on cloud end
CN107256708A (en) * 2016-12-13 2017-10-17 佛山市小鲜互联电器科技有限公司 A kind of intelligent man-machine interaction system based on refrigerator
CN107966910A (en) * 2017-11-30 2018-04-27 深圳Tcl新技术有限公司 Method of speech processing, intelligent sound box and readable storage medium storing program for executing

Patent Citations (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN1989733A (en) * 2004-05-21 2007-06-27 电缆优势软件有限公司 Remote access system and method and intelligent agent therefor
CN101272384A (en) * 2004-05-21 2008-09-24 语音移动有限公司 Remote access system and method and intelligent agent therefor
WO2016127550A1 (en) * 2015-02-13 2016-08-18 百度在线网络技术(北京)有限公司 Method and device for human-machine voice interaction
CN106027806A (en) * 2016-07-25 2016-10-12 乐视控股(北京)有限公司 Smart home device control method and mobile terminal
CN106710594A (en) * 2016-11-17 2017-05-24 北京中科汇联科技股份有限公司 Intelligent speech interaction system based on cloud end
CN107256708A (en) * 2016-12-13 2017-10-17 佛山市小鲜互联电器科技有限公司 A kind of intelligent man-machine interaction system based on refrigerator
CN107966910A (en) * 2017-11-30 2018-04-27 深圳Tcl新技术有限公司 Method of speech processing, intelligent sound box and readable storage medium storing program for executing

Cited By (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN110136712A (en) * 2019-05-07 2019-08-16 甄十信息科技(上海)有限公司 Realize the method and device of intelligent sound AI function
CN112185394A (en) * 2019-07-02 2021-01-05 北京小鸟听听科技有限公司 Playing method, device and playing system of equipment group
CN111128193A (en) * 2019-12-27 2020-05-08 科大讯飞股份有限公司 Voice interaction method, network analysis end and client
CN112201240A (en) * 2020-09-27 2021-01-08 上汽通用五菱汽车股份有限公司 Vehicle control method, vehicle-mounted screenless device, server and readable storage medium
CN112201240B (en) * 2020-09-27 2023-03-14 上汽通用五菱汽车股份有限公司 Vehicle control method, vehicle-mounted screenless device, server and readable storage medium

Similar Documents

Publication Publication Date Title
CN108831434A (en) voice interactive system and method
CN108777751A (en) A kind of call center system and its voice interactive method, device and equipment
CN111739553B (en) Conference sound collection, conference record and conference record presentation method and device
CN103081004B (en) For the method and apparatus providing input to voice-enabled application program
US20040267527A1 (en) Voice-to-text reduction for real time IM/chat/SMS
CN109413286A (en) A kind of intelligent customer service voice response system and method
WO2010000161A1 (en) Voice conversation method and apparatus based on instant communication system
CN104883720A (en) Object network access method, object network access guiding and control methods, and corresponding terminals
JP2012137777A (en) Method and apparatus for processing input speech signal during presentation of output audio signal
JP2006528804A (en) Methods, systems, and computer programs to enable telephone users to participate in instant messaging-based meetings (access to extended conferencing services using telechat systems)
CN107767879A (en) Audio conversion method and device based on tone color
CN113784163B (en) Live wheat-connecting method and related equipment
US20100211389A1 (en) System of communication employing both voice and text
KR102549204B1 (en) Device, server and method for providing speech recognition service
CN108922528A (en) Method and apparatus for handling voice
CN105094641A (en) Question-answer interaction method and device based on voice
CN102420897B (en) Mobile phone communication information transmitting method and device
US20050042581A1 (en) Communication service system and method based on open application programming interface for disabled persons
CN108304153A (en) Voice interactive method and device
CN106603669A (en) Control method and system for distributed type main equipment and auxiliary equipment
US20030120492A1 (en) Apparatus and method for communication with reality in virtual environments
TW504921B (en) Ringer for satellite user terminals
CN110225287A (en) Audio-frequency processing method and device
KR102605178B1 (en) Device, method and computer program for generating voice data based on family relationship
US6498834B1 (en) Speech information communication system

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
RJ01 Rejection of invention patent application after publication

Application publication date: 20181116

RJ01 Rejection of invention patent application after publication