CA2481080A1

CA2481080A1 - Method and system for detecting and extracting named entities from spontaneous communications

Info

Publication number: CA2481080A1
Application number: CA002481080A
Authority: CA
Inventors: Allen Louis Gorin; Frederic Bechet; Jeremy Huntley Wright; Dilek Z. Hakkani-Tur
Original assignee: Individual
Current assignee: Nuance Communications Inc
Priority date: 2002-04-05
Filing date: 2003-04-07
Publication date: 2003-10-23
Anticipated expiration: 2023-04-07
Also published as: CA2481080C; EP1497751A4; AU2003224846A1; WO2003088080A1; EP1497751A1

Abstract

The invention concerns a method and system for detecting and extracting name d entities from spontaneous communications (Fig.1). The method may recognizing input communications from a user (150), detecting contextual named entities (160) from the recognized input communications (150) and outputting the contextual named entities to a language understanding unit (170).

Claims

1. A method for processing input communications with a user, comprising:
recognizing input communications from the user;
detecting contextual named entities from the recognized input communications; and outputting the contextual named entities to a language understanding unit.

2. The method of claim 1, further comprising:
producing a lattice from the recognized communications, wherein the contextual named entities are detected from the lattice.

3. The method of claim 1, wherein the contextual named entities are detected using a named entity language model.

4. The method of claim 1, further comprising:
inserting named entity tags into the detected contextual named entities.

5. The method of claim 1, further comprising:
classifying the input communications according to confidence scores.

6. The method of claim 1, further comprising:
performing a composition function using a named entity language model.

7. The method of claim 6, wherein the composition function is a matching technique.

8. The method of claim 1, further comprising:
determining N-best values for each named entity detected.

9. The method of claim 1, wherein outputting step outputs N-best named entity values to the language understanding unit.

10. The method of claim 1, wherein the input communications include communications in one or more languages.

11. The method of claim 1, wherein the input communications include at least one of verbal and non-verbal speech.

12. The method of claim 11, wherein the non-verbal speech includes the use of at least one of gestures, body movements, head movements, non-responses, text, keyboard entries, keypad entries, mouse clicks, DTMF codes, pointers, stylus, cable set-top box entries, graphical user interface entries and touchscreen entries.

13. The method of claim 1, wherein the input communications include multimodal speech.

14. The method of claim 1, further comprising:
making processing decisions based on the detected contextual named entities.

15. The method of claim 1, wherein the named entities are represented by at least one of a tag, a context and a value.

16. A system that processes input communication with a user, comprising:
a recognizer that recognizes input communications from the user;
and a named entity detector that detects contextual named entities from the recognized input communication, and outputs the contextual named entities to a language understanding unit.

17. The system of claim 16, wherein the recognizer produces a lattice from the recognized communications, and the named entity detector detects the contextual named entities from the lattice.

18. The system of claim 16, wherein the named entity detector detects the contextual named entities using a named entity language model.

19. The system of claim 16, further comprising:
a named entity tagger that inserts named entity tags into the detected contextual named entities.

a task classification processor makes task classification decisions based on the detected contextual named entities.

32. The task classification system of claim 31, wherein the task classification processor includes a dialogue manager that conducts dialogue with the user based on the detected named entities.

33. The task classification system of claim 31, wherein the task classification processor includes a language understanding unit that computes a confidence function to determine whether the user's input communication can be classified according to task.

34. The task classification system of claim 33, wherein if the task cannot be classified, the dialogue manager conducts dialogue with the user based on the detected contextual named entities.