US20140365206A1 - Method and system for idea spotting in idea-generating social media platforms - Google Patents

Method and system for idea spotting in idea-generating social media platforms Download PDF

Info

Publication number
US20140365206A1
US20140365206A1 US13/911,508 US201313911508A US2014365206A1 US 20140365206 A1 US20140365206 A1 US 20140365206A1 US 201313911508 A US201313911508 A US 201313911508A US 2014365206 A1 US2014365206 A1 US 2014365206A1
Authority
US
United States
Prior art keywords
idea
core
submission
identified
identifying
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Abandoned
Application number
US13/911,508
Inventor
Gregorio Convertino
Agnes Sandor
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Xerox Corp
Original Assignee
Xerox Corp
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Xerox Corp filed Critical Xerox Corp
Priority to US13/911,534 priority Critical patent/US9400779B2/en
Priority to US13/911,508 priority patent/US20140365206A1/en
Assigned to XEROX CORPORATION reassignment XEROX CORPORATION ASSIGNMENT OF ASSIGNORS INTEREST (SEE DOCUMENT FOR DETAILS). Assignors: CONVERTINO, GREGORIO, SANDOR, AGNES
Publication of US20140365206A1 publication Critical patent/US20140365206A1/en
Abandoned legal-status Critical Current

Links

Images

Classifications

    • G06F17/2765
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F40/00Handling natural language data
    • G06F40/30Semantic analysis
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F40/00Handling natural language data
    • G06F40/10Text processing
    • G06F40/166Editing, e.g. inserting or deleting
    • G06F40/174Form filling; Merging
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F40/00Handling natural language data
    • G06F40/20Natural language analysis
    • G06F40/205Parsing
    • G06F40/211Syntactic parsing, e.g. based on context-free grammar [CFG] or unification grammars
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F40/00Handling natural language data
    • G06F40/20Natural language analysis
    • G06F40/279Recognition of textual entities
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F40/00Handling natural language data
    • G06F40/20Natural language analysis
    • G06F40/279Recognition of textual entities
    • G06F40/289Phrasal analysis, e.g. finite state techniques or chunking
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F8/00Arrangements for software engineering
    • G06F8/10Requirements analysis; Specification techniques
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F8/00Arrangements for software engineering
    • G06F8/20Software design
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F8/00Arrangements for software engineering
    • G06F8/30Creation or generation of source code

Definitions

  • the exemplary embodiment relates to idea generation and finds particular application in connection with a system and method for identifying the core of an idea submission.
  • Idea-generating social media platforms include Idea Management Systems (IMS's), such as employee and customer suggestion systems, Q&A sites, argumentation systems, eGovernment sites and other similar computer-implemented systems where ideas are shared, discussed, and selected by a community of users. Examples of such platforms are those used by a community for the purpose of innovation.
  • the innovation community may include the employees, partners, or customers of a company or the members of a civic community, city, region, or nation.
  • the ideas submitted may be proposed solutions to an existing problem or proposals for a new item. For example, the innovation community proposes and selects new services or products for an organization with which it is associated.
  • idea submissions in IMSs usually include two parts, a title and an idea description or body.
  • the title intended to be a concise formulation of the core of the idea and the description is intended to include a fuller description of the idea that is summarized in the title.
  • the idea submissions are often submitted by people who are not familiar with the process of idea submissions and thus the title is often not reflective of the core of the idea that is expressed in the description.
  • the titles are often not well formulated, the actual idea cannot be understood on the basis of the title alone.
  • the submitter may draft a title before formulating the description, and thus it may not be reflective of the idea which emerges in the course of writing the description.
  • the title may be too general or too specific to be helpful.
  • the description may contain a sometimes lengthy description of the idea and it may take the recipient of such a submission considerable time to identify the core of the idea hidden within the description. Additionally, some idea submissions may include more than one core idea in the description, while the title may focus on one of them.
  • Opinion mining and opinion detection systems are disclosed, for example, in U.S. Pub. No. 20120245923, published on Sep. 27, 2012, entitled CORPUS-BASED SYSTEM AND METHOD FOR ACQUIRING POLAR ADJECTIVES, by Caroline Brun; U.S. Pub. No. 20130096909, published on Apr. 18, 2013, entitled SYSTEM AND METHOD FOR SUGGESTION MINING, by Caroline Brun et al.; U.S. application Ser. No. 13/400,263, filed on Feb. 20, 2012, entitled SYSTEM AND METHOD FOR PROVIDING RECOMMENDATIONS BASED ON INFORMATION EXTRACTED FROM REVIEWERS' COMMENTS, by Anna Stavrianou, et al; U.S. application Ser.
  • 20040158454 entitled SYSTEM AND METHOD FOR DYNAMICALLY DETERMINING THE ATTITUDE OF AN AUTHOR OF A NATURAL LANGUAGE DOCUMENT, by Livia Polanyi, et al.; Caroline Brun, “Detecting Opinions Using Deep Syntactic Analysis,” Proc. Recent Advances in Natural Language Processing (RANLP), Hissar, Bulgaria (2011); Moghaddam, et al., “Opinion Digger: An Unsupervised Opinion miner from Unstructured Product Reviews,” in Proc. 19 th Conf. on Information and Knowledge Management (CIKM'10), 2010.
  • a method for identifying the core of an idea includes receiving an idea submission which includes a textual description of an idea, natural language processing the textual description of the idea to identify dependencies in at least a part of the textual description, and providing for identifying directive illocutionary acts in the textual description, based on the identified dependencies.
  • the method further includes providing for identifying an idea core of the idea submission, based on an identified directive illocutionary act and outputting information based on the identified idea core. At least one of the natural language processing, identifying directive illocutionary acts, identifying the idea core, and outputting information may be performed with a computer processor.
  • a system for identifying the core of an idea includes a linguistic processing component which natural language processes an idea submission which includes a textual description of an idea to identify dependencies in the textual description.
  • An idea processing component identifies directive illocutionary acts in the textual description, based on the identified dependencies and identifies an idea core of the idea submission, based on an identified directive illocutionary act.
  • An output component outputs information based on the identified core.
  • a processor implements the linguistic processing component, idea processing component, and output component.
  • a method for generating a system for idea processing includes storing a set of terms in non-transitory memory for each of a set of domain dependent concepts and a set of terms for each of a plurality of domain-independent concepts.
  • a plurality of patterns is stored for identifying a core of an idea based on the occurrence of a specified syntactic relation in a sentence of an idea submission description, at least some of the patterns specifying that a term in one of the domain-dependent or domain-independent concepts be a syntactic relation with another term in the comment.
  • the patterns are stored in non-transitory memory for application to sentences of an idea submission description by an idea processing component. At least one of the storing terms and storing patterns may be performed with a computer processor.
  • FIG. 1 is a functional block diagram of an environment in which an idea management system operates in accordance with one aspect of the exemplary embodiment
  • FIG. 2 illustrates a template idea submission form and a completed submission with comments
  • FIG. 3 is a flow chart illustrating a method for identifying the core of an idea submission in accordance with another aspect of the exemplary embodiment
  • FIG. 4 is a flow chart illustrating a method for forming the system of FIG. 1 ;
  • FIG. 5 illustrates a screen shot of a graphical user interface which displays a pop up when the system determines that the comment is concerning additional information
  • FIG. 6 illustrates a message to the author of the idea submission proposing a modification to the idea description
  • FIG. 7 is a screenshot illustrating the automatic highlighting of an identified idea in an idea submission.
  • FIG. 8 is a screenshot illustrating user modifications to the highlighting of an identified idea in an idea submission.
  • aspects of the exemplary embodiment relate to an idea management system which analyses idea submissions, in particular, by identifying the core of the ideas submitted by members of an innovation community.
  • the innovation community proposes and selects new services or products for an organization with which it is associated.
  • the community may include the organization's employees, partners, or customers.
  • other idea-generating social media platforms that have similar idea creation and deliberation functionalities can also be enhanced by the methods and systems proposed herein.
  • FIG. 1 is a block diagram illustrating an environment 10 for supporting sharing information and core identification, in accordance with one embodiment.
  • a data repository 12 stores shared content, such as submitted ideas (suggestions) and optionally also comments on the ideas.
  • the ideas and respective comments are generated by submitters on associated user devices 14 , 16 , etc.
  • An idea management system (IMS) 18 processes the shared content and outputs information based thereon.
  • the idea management system 18 automatically identifies the core of the idea submission, based on textual content of the idea submission, other than the title.
  • the idea management system may also provide an analysis of the comments on the ideas submitted, as described in above-mentioned copending application Ser. No. 13/______ entitled METHOD AND SYSTEM FOR CLASSIFYING REVIEWERS' COMMENTS AND RECOMMENDING RELATED ACTIONS IN IDEA-GENERATING SOCIAL MEDIA PLATFORMS.
  • each user device 14 , 16 , etc. is a Web-enabled device that executes a Web browser and/or email program, which supports interfacing tools and information exchange with a Web server that is linked to the system 18 .
  • a wired or wireless digital data communications network 20 such as the Internet or a local intranet, provides an infrastructure for exchange of digital information.
  • the network 20 provides interconnectivity between the user devices 14 , 16 , data repository 12 , and system 18 . Users can access and upload content, such as idea submissions 22 and comments 24 on the submitted ideas, and the like to the network 20 , which are then stored in the data repository 12 and processed by the system 18 .
  • User devices 14 , 16 can include desktop, laptop, or tablet computers, handheld devices, such as mobile telephones and mobile Internet devices, and other computing devices capable of communicating via the network.
  • the users form an innovation community. Some of the users serve as submitters of ideas. One or more of the users may be tasked with providing comments on submissions of other users, or all users may serve as commenters.
  • the ideas submitted may include suggestions for solving an existing problem, suggestions for modifications to an item, such as a product, service, or company procedure, suggestions for a new item, such as a product service, or company procedure, or the like.
  • the comments 24 may relate to any aspect of the submission.
  • FIG. 1 shows a submitter 26 , who generates an idea submission 22 and a commenter 28 , who provides comments 24 on the submission, i.e., the comments are provided by a different person 28 from the person 26 who generated the idea submission. It is to be appreciated that in an innovation community there may be a large number of submitters and a large number of commenters. In some cases submitters may serve as commenters on the ideas of others, and vice versa.
  • the idea submissions 22 A, 22 B may be submitted in response to a request for submissions on a given topic or may be submitted without any specific request.
  • Submitters 26 may generate their submission by adding information to one or more fields of a template submission form 30 .
  • One of the fields may be a title field 32 , in which the submitter provides a title 34 for the idea, if this has not been decided in advance.
  • Another of the fields may be a suggestion field 36 in which the user enters a textual description 38 of the idea.
  • commenters 28 may add comments to the same form 30 , e.g., the form may include one or more comments field(s) 40 , 42 for one or more commenters to add their textual comments 44 , 46 .
  • commenters may enter their textual comments on a separate comments form.
  • the commenter may also provide a rating 48 of the idea (on a scale of, for example, 1 to 5 stars, where 5 represents a highly positive rating and 1 represents a highly negative rating).
  • the idea management system 18 includes memory 50 which stores instructions 52 for performing the exemplary method and a processor 54 in communication with the memory for executing the instructions.
  • the system 18 may be hosted by a suitable computing device or devices, such as the illustrated server computer 56 .
  • the system communicates with the network 20 via a network interface 58 .
  • Hardware components 50 , 54 , 58 of the system may communicate via a data control bus 60 .
  • system 18 is hosted by a computing device within the organization that requests the idea submissions.
  • processing of the submissions may be performed by an external web service and the results returned to the requesting organization.
  • the host computer 56 may be a PC, such as a desktop, a laptop, palmtop computer, portable digital assistant (PDA), server computer, cellular telephone, tablet computer, pager, combination thereof, or other computing device capable of executing instructions for performing the exemplary method.
  • the memory 50 may represent any type of non-transitory computer readable medium such as random access memory (RAM), read only memory (ROM), magnetic disk or tape, optical disk, flash memory, or holographic memory. In one embodiment, the memory 50 comprises a combination of random access memory and read only memory. In some embodiments, the processor 54 and memory 50 may be combined in a single chip.
  • the network interface 58 allows the computer to communicate with other devices via the computer network 20 , and may comprise a modulator/demodulator (MODEM) a router, a cable, and and/or Ethernet port.
  • MODEM modulator/demodulator
  • Memory 50 stores processed data as well as instructions for performing the exemplary method described below.
  • the digital processor 54 can be variously embodied, such as by a single-core processor, a dual-core processor (or more generally by a multiple-core processor), a digital processor and cooperating math coprocessor, a digital controller, or the like.
  • the digital processor 54 in addition to controlling the operation of the computer 56 , executes instructions stored in memory 50 for performing the method outlined in FIG. 3 .
  • the instructions 52 may include some or all of the following: a natural language linguistic processing component 62 , an idea processing component 64 for identifying a core of the idea, a comment classification component (classifier) 66 , a comment interpretation component 68 , and an information output component 68 .
  • the linguistic processing component 62 may include a natural language parser which processes at least one of the title 34 , the free text description 38 of the suggestion and the free text comments 44 , 46 to identify words and dependencies (syntactic and/or semantic relations) between words of the text.
  • the idea processing component 64 identifies a probable core 80 of the idea from the suggestion 38 and optionally compares this with the title 34 of the idea.
  • the comment classification component 66 classifies the comments based on the processed text and assigns one or more class labels 82 to the comments.
  • the classes may be selected from a predefined set of classes which express the commenter's reaction to the idea.
  • Comment classes may include some or all of the following: a first class for comments conveying additional ideas; a second class for evaluative comments (agree/disagree, positive/ negative, pros and cons), and a third class for comments that concern the idea status within the idea management system.
  • Classes may have subclasses. For example, the evaluative comments class may have a ‘positive’ comments subclass and a ‘negative’ comments subclass, etc.
  • the comment interpretation component 68 may suggest or implement one of two or more different actions. For example, evaluative comments may be turned into votes, and/or comments conveying additional ideas may be appended to the idea description. In one embodiment, the actions are recommended to a reviewer 84 on an associated user interface 86 , e.g., a display device of a client computing device analogous to devices 14 , 16 .
  • the comment interpretation component 68 may include a tool for refining the content, which is called on when the comment is assigned to the first comment class, a tool for assessing the value of the idea, which is called on when the comment is assigned to the second comment class, and a tool for managing the process, when the comment is assigned to the third comment class. Comments classed as unclassified may be ignored.
  • the classification component 66 is rule based, i.e., applies a set of grammar rules to the processed text and depending on the output of the applied rules, determines a class.
  • the rules may include patterns which when met by text of a comment, return a class and/or subclass for the textual content.
  • the classification component employs a probabilistic classifier model (or models) trained to output a class based on the processed text. Such a classifier model may have been trained on a set of training comments that are each manually labeled with a respective one of the set of class labels.
  • the information output component 58 outputs information 60 based on one or more of the identified core, classification of the comments and/or the output of the processing of the comments.
  • the term “software,” as used herein, is intended to encompass any collection or set of instructions executable by a computer or other digital system so as to configure the computer or other digital system to perform the task that is the intent of the software.
  • the term “software” as used herein is intended to encompass such instructions stored in storage medium such as RAM, a hard disk, optical disk, or so forth, and is also intended to encompass so-called “firmware” that is software stored on a ROM or so forth.
  • Such software may be organized in various ways, and may include software components organized as libraries, Internet-based programs stored on a remote server or so forth, source code, interpretive code, object code, directly executable code, and so forth. It is contemplated that the software may invoke system-level code or calls to other software residing on a server or other location to perform certain functions.
  • FIG. 1 is a high level functional block diagram of only a portion of the components which are incorporated into a computer system 18 . Since the configuration and operation of programmable computers are well known, they will not be described further.
  • the method begins at S 100 .
  • idea submissions and/or associated comments on the idea submissions are received by the system.
  • the submissions and/or comments are natural language processed with component 52 .
  • a core of the idea may be identified with the idea processing component 64 .
  • the natural language processed comments may be classified with comment classification component 66 .
  • the classified comments may be processed with interpretation component 68 , and actions to be recommended generated, based on the assigned class.
  • information based on the identified core and/or classification/processing of the comments is output from the system by the output component.
  • the method ends at S 114 .
  • the steps of the method need not all proceed in the order illustrated and fewer, more, or different steps may be performed. Additionally, the identification of the core S 106 and classification/interpretation of comments S 108 /S 110 need not all be performed in the method. For example, in one method, S 108 and S 110 are omitted, the core is identified at S 106 and the information output may be the identified core and/or a proposed title which is based on the identified core. In another method, S 106 is omitted and the information output is based on the classifications of the comments and/or their interpretation.
  • the method illustrated in FIG. 3 may be implemented in a computer program product that may be executed on a computer.
  • the computer program product may comprise a non-transitory computer-readable recording medium on which a control program is recorded (stored), such as a disk, hard drive, or the like.
  • a non-transitory computer-readable recording medium such as a disk, hard drive, or the like.
  • Common forms of non-transitory computer-readable media include, for example, floppy disks, flexible disks, hard disks, magnetic tape, or any other magnetic storage medium, CD-ROM, DVD, or any other optical medium, a RAM, a PROM, an EPROM, a FLASH-EPROM, or other memory chip or cartridge, or any other non-transitory medium from which a computer can read and use.
  • the computer program product may be integral with the computer 56 , (for example, an internal hard drive of RAM), or may be separate (for example, an external hard drive operatively connected with the computer 56 ), or may be separate and accessed via a digital data network such as a local area network (LAN) or the Internet (for example, as a redundant array of inexpensive of independent disks (RAID) or other network server storage that is indirectly accessed by the computer 56 , via a digital network).
  • LAN local area network
  • RAID redundant array of inexpensive of independent disks
  • the method may be implemented in transitory media, such as a transmittable carrier wave in which the control program is embodied as a data signal using transmission media, such as acoustic or light waves, such as those generated during radio wave and infrared data communications, and the like.
  • transitory media such as a transmittable carrier wave
  • the control program is embodied as a data signal using transmission media, such as acoustic or light waves, such as those generated during radio wave and infrared data communications, and the like.
  • the exemplary method may be implemented on one or more general purpose computers, special purpose computer(s), a programmed microprocessor or microcontroller and peripheral integrated circuit elements, an ASIC or other integrated circuit, a digital signal processor, a hardwired electronic or logic circuit such as a discrete element circuit, a programmable logic device such as a PLD, PLA, FPGA, Graphical card CPU (GPU), or PAL, or the like.
  • any device capable of implementing a finite state machine that is in turn capable of implementing the flowchart shown in FIG. 3 , can be used to implement the idea management method.
  • the steps of the method may all be computer implemented, in some embodiments one or more of the steps may be at least partially performed manually.
  • FIG. 4 illustrates a method for generating the system for identifying the core of an idea and/or for classifying comments. The method begins at S 200 .
  • a set of terms is stored in memory 50 for each of a set of domain dependent concepts and a set of terms is stored for each of a plurality of domain-independent concepts.
  • one or more patterns is generated for assigning a textual comment on an idea submission to the class based on the occurrence of a specified syntactic relation in the comment. At least some of the patterns specify that a term in one of the domain-dependent or domain-independent concepts is a syntactic relation with another term in the comment for the pattern to fire on the comment.
  • the patterns are stored in memory 50 for application to textual comments by the comment classification component 66 .
  • a classifier model is trained on a set of manually labeled sample comments (each label corresponding to one of the comment classes). Any suitable machine learning method can be used, such as support vector machines, linear regression, or the like.
  • the classifier model is stored in memory 50 at S 206 , for applying by the comment classification component.
  • rules may be generated and stored in memory for interpretation of some or all of the comments, which are dependent on the class which is assigned to the comment.
  • a linguistic processing component, output component, processor, and any other components of the system that are needed may be provided.
  • the system may be tested on a set of comments/idea submissions to determine if the respective patterns/vocabulary are providing a desired level of precision, and if not, additional and/or different patterns and/or concept terms may be developed or patterns modified and the system reevaluated.
  • the method ends at S 218 .
  • While part of the method of FIG. 4 may be performed manually, part may be computer implemented, as for the method of FIG. 3 , such as the storing terms and storing patterns and/or classifier model learning.
  • the method need not include those steps which are related to identifying the core for a system which identifies the core but does not classify comments, and vice versa in the case of a system for classifying comments which does not identify the idea core.
  • the idea submissions may be requested by someone in the organization, e.g., in the case where the submitters are employees.
  • the request may be limited to a subset of the employees, such as the employees in a particular department.
  • the entire workforce may be asked to provide their suggestions.
  • a facilitator may communicate a request to employees for their ideas about a new product, process, or service that the company is considering introducing.
  • the request may be limited to a particular product or service or may be open to any idea that the employees may want to propose.
  • customers When customers are the submitters, they may be invited to propose ideas for new products or services or for ideas on existing products and services. In some cases, the customers may be limited to those who have purchased a product or service on which they are being asked for suggestions. In other embodiments, they may be contacted through the company website, in stores where the company's goods/services are sold, or through other marketing channels. In the case of a Q&A website, provision is made so that one user may submit a description of a problem and other users may provide answers (submissions) and in some cases, comments on those answers.
  • the number of commenters who provide suggestions may be up to the number of other members of the community. In other embodiments, only a limited subset of the community is permitted to make comments.
  • submissions and/or comments may be provided in one or more different formats, such as wiki pages, posts, emails, text messages, instant messages, tweets, voice messages, faxes, or the like.
  • a plurality of commenters may provide their comments on the same idea suggestion using electronic form 30 .
  • the linguistic processing component 62 processes the text of the submission and/or comments.
  • the linguistic processing component 62 may be based on a general syntactic dependency parser with additional grammar rules added to identify the patterns useful herein.
  • the parser annotates the text strings of the document with tags (labels) which correspond to grammar rules, such as lexical rules and syntactic and/or semantic dependency rules, such as SUBJ (a dependency between the subject of the sentence and the predicate verb) and OBJ (a dependency between the object of the sentence and the predicate verb).
  • the lexical rules define features of terms such as words and multi-word expressions.
  • Syntactic rules describe the grammatical relationships between the words, such as subject-verb, object-verb relationships.
  • Semantic rules include rules for extracting semantic relations such as co-reference links. The application of the rules may proceed incrementally, with the option to return to an earlier rule when further information is acquired.
  • the labels applied by the parser may be in the form of tags, e.g., XML tags, metadata, log files, or the like.
  • the syntactic analysis may include the construction of a set of syntactic relations from an input text by application of a set of parser rules.
  • Exemplary methods are developed from dependency grammars, as described, for example, in Mel' ⁇ hacek over (c) ⁇ uk I., “Dependency Syntax,” State University of New York, Albany (1988) and in Tesberger L., “Elements de Syntaxe Structurale” (1959) Klincksiek Eds. (Corrected edition, Paris 1969).
  • An exemplary parser is the Xerox Incremental Parser (XIP).
  • the linguistic processing component may apply rules which identify and tag domain-dependent and domain-independent concepts and apply a set of patterns which specify semantic relations involving these concepts.
  • a set of patterns adapted to core analysis or comment classification is employed for identifying syntactic and/or semantic relations (dependencies) between particular grammatical structures and/or morpho-syntactic forms and words representing particular domain-independent or domain-dependent concepts. Exemplary patterns are described in more detail below.
  • the domain-dependent concepts may be customized for each application and may be stored in a vocabulary 90 , such as lexicon.
  • each concept may be associated with a set of terms which represent it. Each term can be a word or phrase.
  • One domain-dependent concept may relate to the organization, such as a company, to which the suggestions are addressed.
  • the name of the company may be used as the title of the concept and the terms representing it may include some or all of: variants of its name, its divisions, its products, its employees, and its activities.
  • terms in the domain-specific lexicon 90 are identified in the text being processed, they are labeled with the appropriate concept label (e.g., COMPANY or PRODUCT).
  • Domain-independent concepts may also be stored in the lexicon 90 , or a separate lexicon, which relate to the task being performed by the IMS.
  • the domain-independent concepts may include a PERFORMATIVE concept, which includes terms often verbs, which are often found in combination with an action to be performed, and an IDEA concept which is associated with terms relating to an idea assertion.
  • Examples of the PERFORMATIVE concept may include verbs such as suggest, propose, and recommend.
  • Examples of the IDEA concept include terms such as idea, suggestion, proposition, proposal, and the like.
  • domain-independent concepts may include some or all of the following concepts and terms:
  • Some of these concepts may also be used for identifying the idea core.
  • the creation of the sets of concept terms can be performed manually or semi-automatically, for example using a glossary of terms used within the company for the ORGANIZATION concept. Manually selected terms can be supplemented with synonyms from a thesaurus. Lexical items (individual words) may also be stored.
  • the example patterns described below include those which identify a semantic relation, i.e., a specified relationship between two text elements in a sentence (such as SUBJ or OBJ) where at least one of the text elements is tagged with one of the concepts.
  • Each text element can include one or more words.
  • Other useful patterns identify syntactic relations in which specific verb forms are used.
  • the identification of the core of an idea involves automatically detecting the idea core(s) within idea descriptions.
  • only a part of the description is analyzed for identifying the core of the idea, such as the first one or two sentences.
  • the entire description is analyzed.
  • the sentences are analyzed in turn until one of the sentences is determined to include a core.
  • idea spotting can be based on analyzing idea descriptions 38 in terms of speech act theory (first developed in John Langshaw Austin: How to Do Things With Words. Cambridge Mass. (1962).
  • speech act theory speakers perform illocutionary acts by utterances using special linguistic structures. Conveying ideas is performed by directive speech acts, which are defined as acts that cause the hearer to take a particular action.
  • a “linguistic structure” refers generally to a word or phrase and the linguistic tags that have been applied by the parser, such as part of speech, gender, tense, etc.
  • Some linguistic structures are ambiguous with respect to the illocutionary act that they convey, i.e., the same structure can convey different illocutionary acts depending on the communicative situation. For example a question, such as “Could the new printer include a scanner?” may be an inquiry, a request or a suggestion. In the case of detecting the core(s) of an idea, such ambiguity is resolved by the predefined communicative situation of the IMS as a space for proposing ideas. As a consequence, once a particular linguistic structure is identified in the context of the participants as conveying a directive speech-act, it is considered that the identified linguistic structure conveys a suggestion to be acted on, i.e., a core of the idea of an idea submission.
  • patterns are structured (modeled) to identify grammatical indicators of directive speech-acts in text, in particular, to identify when someone wants someone else to do something, such as in the form of a request, order, proposal, assertion, or question.
  • detecting the core of an idea includes detecting sentences that use particular linguistic patterns for expressing directive illocutionary acts in the context of the participants of the illocutionary act, i.e., the idea owner and/or the organization.
  • Some of the patterns require a dependency with a domain-specific word, while others identify a dependency in which “I” or “me” is the subject.
  • Imperative in which a verb beginning a sentence or clause has a domain-dependent concept (e.g., COMPANY) as its object.
  • a domain-dependent concept e.g., COMPANY
  • conditional pronoun expression e.g., conditional verb in a dependency with a pronoun
  • a domain-dependent concept e.g., COMPANY
  • the system may employ two, three four, five or all of these exemplary patterns.
  • These linguistic structures can be encoded by rules in rule-based NLP systems such as in the form of additional rules on top of a parser as described above.
  • training examples can be annotated for training a machine learning system.
  • idea descriptions which may have undergone some initial linguistic processing, have their core idea(s) manually labeled.
  • SVM support vector machines
  • linear regression linear regression
  • the context of the idea-owner can be indicated by any expression where the author of the idea refers to herself or himself (e.g., the pronoun “I”, “my”, “we”, “ours”, “our company”, etc.).
  • the context of the organization can be indicated by the organization-specific domain-dependent vocabulary 90 .
  • This vocabulary can be obtained by various methods such as calculating term frequency or using glossaries.
  • An idea core is detected if the words conveying the linguistic pattern and the domain vocabulary are in dependency relationships, like the underlined words in the sentences above.
  • more than one idea core may be detected in a single idea submission, in which case, one, such as the first, may be adopted as the idea core, or the user may be asked to select one.
  • candidate cores may be ranked, with the ranking being based on the order in which the cores were identified in the description and/or on the type of pattern which identified them. In one embodiment, there may be a predefined precedence established among the patterns.
  • the first linguistic pattern matched in the description which is of the performative type or insertion type (direct expressions) may be used as the (main) core, with other identified cores optionally being considered as supplemental core(s). If there is no core identified which is of a direct expression type, then a first idea core identified with one of the other patterns may be selected.
  • the idea core can be the entire sentence identified as containing the core, or may be less than a sentence, such as the clause of the sentence on which the pattern fired.
  • the idea processing component 64 may output a decision, such as a binary decision, on whether or not the sentence includes a core, based on whether any of the patterns (or classifier) fired on the sentence.
  • the exemplary system provides automatic linguistic analysis of idea descriptions 38 and free-text comments 44 , 46 .
  • the automatic analysis supports both facilitators and content-providing users.
  • the comment classifier 54 automatically labels comments according to their type of reaction to the idea. This facilitates using the classified comments for various analytical purposes and for enhancing the effectiveness of the idea management system 18 .
  • the classified comments are each interpreted, when applicable, as the action that the comment represents, and its execution may be supported by the user interface 86 of the system 18 . This approach is consistent with the language-action perspective (Winograd, T., “A language/action perspective on the design of cooperative work,” Human Computer Interaction, 3(1) 3-30(1987)).
  • this class allows the facilitators of the IMS 18 to enhance the idea submission with additional propositions put forth in the comments, or to categorize the content of the idea submission as existing, e.g., it refers to an idea, product, or service that already exists in the organization (based on the comment).
  • a rating may be computed based the comments, e.g., as positive neutral, or negative.
  • the computed rating is compared with an actual rating of the idea given by the commenter, e.g., the commenter may be requested/permitted to provide a rating 48 of the idea submission. If the computed and actual ratings agree, the actual rating may be labeled as reliable. If the computed rating and commenter's actual rating do not agree, the actual rating may be modified or flagged as unreliable. For example, in FIG.
  • Meta-reaction reaction relative to the idea generation workflow: this category can help provide information about the commenter's reactions regarding the idea status within the IMS workflow.
  • each reaction type is defined in terms of a set of linguistic patterns or rules, each rule including linguistic structures and/or associated lexical items/expressions.
  • the linguistic structures mainly convey the comment classes, whereas the lexical items mostly refer to the topic of the comment.
  • the list is definitely not complete but in our experiment, where we tested our proof-of-concept system, the proposed list accounted for a large part of all the reactions.
  • Each reaction type can be further categorized into fine-grained classes (categories) that depend on particular systems.
  • the fine-grained categories in the list apply to a given organization's IMS. The following notation is used in the descriptions:
  • the words in capital letters represent CONCEPTS that are instantiated by lists of domain-specific and general vocabulary words. Whenever two concepts are mentioned, the words that instantiate them should be in a syntactic dependency relation. The words in italics are lexical items. The content underlined is conveyed by linguistic structures (tense, mood, voice, etc.).
  • This class of comments identifies the commenter's view of the content of idea (e.g., whether it already exists or is similar to an existing idea, or whether it could be changed, e.g., amplified) and can include the following subclasses:
  • IDEA/PRODUCT in which the concept IDEA, PRODUCT, or COMPANY (or other ORGANIZATION concept) is the subject of the verb “exist.”
  • a pattern can be of the form:
  • SIMILAR IDEA an IDEA is similar to something existing.
  • a pattern can be of the form:
  • IDEA/COMPANY needs: in which IDEA or COMPANY is in any syntactic relationship with a term from the domain-independent concept LACK. It often suggests something missing from the idea submission, or company, or that the idea submission is not understood. For example:
  • I SUGGEST my SUGGESTION: in which the first person pronoun is in any syntactic relationship with a term from the domain-independent concept SUGGESTION. For example:
  • This class can include different types of evaluative comments.
  • a sentiment vocabulary classes certain words as being positive or negative and may assign a value on a scale of positive/negative. For example, fabulous, may be classed more positive than nice.
  • a set of positive terms in the sentiment vocabulary is collected into a domain independent concept GOOD and optionally a set of negative terms is collected in a domain independent concept BAD.
  • the company may only be interested in judgments that are supportive of the idea (e.g., using the concepts GOOD, AGREE), and thus negative judgments may be ignored (or vice versa).
  • Words of negation usually reverse the polarity, for example, not very good, is classed as negative.
  • Example judgment subclasses may be as follows, although more or less refined subclasses are also considered, such as simply positive and negative.
  • AGREE adverb beginning the sentence.
  • An adverb in the domain-independent concept AGREE is present at or near the beginning of the sentence. For example:
  • the comment includes an expression from a lexicon of specific expressions that are recognized as connoting a positive attitude, e.g., Sounds good/makes sense.
  • the pattern fires when the specific words themselves are used. For example:
  • BAD IDEA A word in the domain independent concept BAD precedes a word in the domain independent concept IDEA. For example:
  • the comment concerns the management of the process for implementing a product/process that is the focus of the idea (e.g., “Rating—Hold until 2010”). Depending on the stage at which the process is currently, the idea may need to wait to be implemented, could be implemented on a trial basis, or could be ready for implementing now.
  • a pattern identifies when a term in the domain-specific concept PROCESS is in the sentence.
  • the concept PROCESS is a vocabulary of company-specific processes for ideas.
  • the comment classification component 66 can be implemented by applying pattern-matching rules. Alternatively a collection of training examples can be used as annotated examples for training a machine-learning system.
  • Comment sentences which satisfy one of the patterns are labeled with the class corresponding to that pattern, and optionally, more fine-grained tags corresponding to the particular pattern(s) that fired.
  • comment classes are described by way of example, it is to be appreciated that fewer or more comment classes may be employed for identifying different types of comments.
  • the number of different comment classes may depend, in part on the available interpretation methods.
  • the system may employ two, three four, five, or more, or all of the exemplary patterns described above, such as at least one for each class.
  • the comment classification may have a specific goal within the community, which depends on the type of innovation community. For example, a community of employees typically deliberates on innovative projects that the company should invest in, while a customer community typically deliberates on the next line of products that they want.
  • the set of fine-grained reaction types may be modified to be appropriate to the particular community goal.
  • one aspect of the comment classification is to identify prior art, if there is any, in the idea content.
  • patterns for identifying existing products/services may not be needed.
  • two of the three main reaction types, “Reaction to the content of the idea” and “Expression of the commenter's judgment of the idea” are largely system independent.
  • the “Meta-reaction” type it may be present whenever formal idea-management steps are defined in the system.
  • the set of fine-grained classes may be customizable for the specific community, since the reaction types depend on the community goals. Thus, some comment classes/subclasses can change. That is, as a new corpus is given, the classes and list of reactions can be adapted based on the goal and content of this system.
  • the classification of the comments, as reactions to ideas, into three classes, as described herein, has several benefits.
  • content generation and management tools e.g., wikis, document editors, tagging tools
  • the IMS may include tools for judging the value of ideas (e.g., tools for voting, deliberation, sentiment analysis)
  • the IMS may
  • the classification of three basic comment types logically maps onto three basic actions types that an idea-generating social media platform can recommend. For example, detecting the reaction to the idea content leads to recommending actions aimed at content generation and management. Detecting the commenter's attitude towards the idea value leads to recommending actions aimed at voting and deliberating on what idea to select. Detecting reaction to the process leads to recommending actions aimed at managing the idea state or process with respect to a given workflow.
  • ADDITIONAL INFORMATION aims at refining the idea by adding new content or a clarification.
  • PRIOR ART aims at appending a reference to prior work or ideas to the content of the current idea. Accordingly, after classifying each comment, the interpretation component 68 may recommend, to the user, e.g., commenter 28 or reviewer 84 , the inception or full execution of a corresponding action.
  • FIG. 5 illustrates a screenshot of an exemplary user interface 92 , which may be displayed on the commenter's display device.
  • the comment classifier has classified the comment as a case of an ADDITIONAL INFORMATION (ADDID).
  • ADDID ADDITIONAL INFORMATION
  • the commenter submits the comment to the idea “Automation of Creating . . . ”
  • the comment is classified by the classification component as an ADDID.
  • the interpretation component 68 then asks the commenter via a dialog box 94 if the user would like to execute the action of appending the content of the comment to the description of that idea. If the user clicks the button “Yes, append”, then the system may let the user choose the point of the description where the comment should be appended.
  • the illustrated dialog box is an embodiment of the action recommendation with which the user interacts by accepting or rejecting it.
  • the system sends a request 96 ( FIG. 6 ) to the author of the idea (and/or moderator) to either accept or reject the edit or addition proposed.
  • a request 96 FIG. 6
  • the author of the idea and/or moderator
  • Various ways for requesting confirmations of the proposed change can be implemented, e.g., through a version manager such as in the github.com code management system.
  • a similar procedure can be implemented for a comment classified as PRIOR ART: if related ideas are mentioned, the system can suggest that the two ideas are related. This is another embodiment of action recommendation, which the user can accept or reject.
  • the commenter may be given the option of giving a positive vote (e.g., a thumbs up or a positive value on a rating sale) to the idea and for NEGATIVE ATTITUDE, a negative vote (e.g., a thumbs down or negative value on a rating sale).
  • a positive vote e.g., a thumbs up or a positive value on a rating sale
  • a negative vote e.g., a thumbs down or negative value on a rating sale
  • the vote may automatically be generated.
  • the vote is added to the votes previously cast by other commenters and may be displayed in a voting region the interface, as illustrated at 98 .
  • the voting information may be forwarded to a person/department associated with evaluating the comments.
  • each of a plurality of the PROCESS terms may be associated with a respective specific action, for example, comments including the PROCESS term “implement now” may be forwarded to a department tasked with the process of implementing the product or service.
  • the information output may be an assigned comment class for each comment (where none of the three example classes is assigned, the comment may be assigned to an “unclassified” class).
  • the information output may be a request to append, or automatically appended comment or part thereof, in the case of a comment assigned to the first class, or a vote/rank or request for a vote/rank, or other scoring metric, in the case of a comment assigned to the second class.
  • the information output may be the number or proportion of the comments assigned to each class.
  • a reviewer may request to see all submission in a given one or more of the classes/subclasses, and this information may be output.
  • the output component 70 may include an interactive visualization component that generates a Graphical User Interface (GUI) 100 for display to a user on a client device, as illustrated, for example, in FIGS. 7 and 8 .
  • GUI Graphical User Interface
  • the GUI visualizes the identified idea core(s) 80 , allows the user to validate them, and the validated cores are then visualized as validated for future use (with a stored history of validation).
  • the GUI may allow the user to modify, remove, and/or add a new core(s) manually.
  • the user interacts with the GUI using a user input device of the client device 14 , 16 , 86 , such as a keyboard, keypad, touch screen, cursor control device, or combination thereof.
  • a user input device of the client device 14 , 16 , 86 such as a keyboard, keypad, touch screen, cursor control device, or combination thereof.
  • the automatically identified core(s) may be highlighted in the description, as shown in FIG. 7 .
  • the user can click on an “accept” 102 or a “decline” 104 actuable area of the screen, or employ other suitable selection mechanism.
  • the user has declined the highlighted entry, allowing the user to select a new idea core or to modify it.
  • FIG. 8 the user has opted to modify the identified core by including additional text of the idea description.
  • a highlighting pen may be automatically displayed, as shown at 106 , for the user to make the text selection.
  • the exemplary system and method are particularly useful in organizations interested in pursuing open innovation independently or collaboratively, where a large number of ideas and comments may be generated by employees, customers, or the like.
  • a prototype system was developed for detecting ideas in an Idea Management System for a company operating a number of coffee shops.
  • the linguistic patterns for the idea processing component 64 were encoded in XIP rules, and the domain-specific vocabulary was constructed manually, based on a development corpus.
  • the development corpus consisted of 681 idea postings by customers.
  • About 40 rules were manually generated using 50 words of domain-specific vocabulary (including the name of the company, its abbreviated forms, and the like). When these rules were on 21,000 postings the system detected 15,000 idea cores.
  • the performance was manually evaluated on 50 postings, and achieved 75% recall and 93% precision.
  • An idea classifier was developed for a company IMS that is used by employees. 281 comments submitted in connection with idea submissions were analyzed first. For this evaluation, the submitted ideas were ignored. From this initial review, it was found that the comment classes, as listed below, were expressed in the first sentence of the comment in 88% of the comments. Out of the remaining 12% of comments, in 8%, no comment type could be identified in the entire comment text and in 4%, the comment type was expressed later in the text. Based on this analysis, it was decided to analyze the first sentences only. While this approach means that the method misses 4% of the comment types, there is a gain in precision and relevance if only the first sentence is considered.
  • PROS-CONS 4%
  • Some of the comments classified as “unclassified” may, in fact be noise, such as “thankyou,” or “hello”, so the unclassified comments can be filtered out. Further patterns may decrease the percentage of unclassified comments.

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Health & Medical Sciences (AREA)
  • Artificial Intelligence (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Computational Linguistics (AREA)
  • General Health & Medical Sciences (AREA)
  • Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Machine Translation (AREA)

Abstract

A computer-implemented system and method provide for identifying the core of an idea. The method includes receiving an idea submission which includes a textual description of an idea. The textual description of the idea is natural language processed to identify dependencies (syntactic and/or semantic relations between text elements) in at least a part of the textual description. Provision is made for identifying directive illocutionary acts in the textual description, based on the identified dependencies. The method further includes providing for identifying an idea core of the idea submission, based on an identified directive illocutionary act, where present, and outputting information based on the identified idea core.

Description

    CROSS REFERENCE TO RELATED PATENTS AND APPLICATIONS
  • Cross-reference is made to U.S. application Ser. No. 13/______, filed contemporaneously herewith, entitled METHOD AND SYSTEM FOR CLASSIFYING REVIEWERS' COMMENTS AND RECOMMENDING RELATED ACTIONS IN IDEA-GENERATING SOCIAL MEDIA PLATFORMS, by Gregorio Convertino and Agnes Sandor, the disclosure of which is incorporated herein by reference in its entirety.
  • BACKGROUND
  • The exemplary embodiment relates to idea generation and finds particular application in connection with a system and method for identifying the core of an idea submission.
  • Idea-generating social media platforms include Idea Management Systems (IMS's), such as employee and customer suggestion systems, Q&A sites, argumentation systems, eGovernment sites and other similar computer-implemented systems where ideas are shared, discussed, and selected by a community of users. Examples of such platforms are those used by a community for the purpose of innovation. The innovation community may include the employees, partners, or customers of a company or the members of a civic community, city, region, or nation. The ideas submitted may be proposed solutions to an existing problem or proposals for a new item. For example, the innovation community proposes and selects new services or products for an organization with which it is associated.
  • Idea submissions in IMSs usually include two parts, a title and an idea description or body. The title intended to be a concise formulation of the core of the idea and the description is intended to include a fuller description of the idea that is summarized in the title. However, the idea submissions are often submitted by people who are not familiar with the process of idea submissions and thus the title is often not reflective of the core of the idea that is expressed in the description. When the titles are often not well formulated, the actual idea cannot be understood on the basis of the title alone. In some cases, the submitter may draft a title before formulating the description, and thus it may not be reflective of the idea which emerges in the course of writing the description. In other cases, the title may be too general or too specific to be helpful. The description may contain a sometimes lengthy description of the idea and it may take the recipient of such a submission considerable time to identify the core of the idea hidden within the description. Additionally, some idea submissions may include more than one core idea in the description, while the title may focus on one of them.
  • There remains a need for a system and method for identifying the core of an idea expressed in an idea submission.
  • INCORPORATION BY REFERENCE
  • The following references, the disclosures of which are incorporated by reference in their entireties, are mentioned:
  • Information sharing systems are disclosed, for example, in U.S. Pub. No. 20120117115, published May 10, 2012, entitled SYSTEM AND METHOD FOR SUPPORTING TARGETED SHARING AND EARLY CURATION OF INFORMATION, by Gregorio Convertino, et al.; U.S. Pub. No. 20120117484, published May 10, 2012, entitled SYSTEM AND METHOD FOR PROVIDING MIXED-INITIATIVE CURATION OF INFORMATION WITHIN A SHARED REPOSITORY, by Gregorio Convertino, et al.; U.S. Pub. No. 20120129145, published May 24, 2012, entitled SYSTEM FOR FOSTERING INNOVATION AMONG A GROUP OF USERS, by William Miller, et al.; U.S. Pub. No. 20110093539, published Apr. 21, 2011, entitled SYSTEM AND METHOD FOR INNOVATION AND IDEA MANAGEMENT, by Andre Laurin, et al.; U.S. Pub. No. 20050044135, published Feb. 24, 2005, entitled METHOD FOR MANAGING AND PROVIDING AN IDEA MANAGEMENT SYSTEM, by Norbert Klausnitzer; U.S. Pub. No. 20020107722, published Aug. 8, 2002, entitled IDEA MANAGEMENT, by Andre Laurin, et al.; U.S. Pub. No. 20030036947, published Feb. 20, 2003, entitled SYSTEMS AND METHODS FOR SUBMISSION, DEVELOPMENT AND EVALUATION OF IDEAS IN AN ORGANIZATION, by William E. Smith, III, et al.; U.S. Pat. No. 6,961,756, to Dilsaver; and U.S. application Ser. No. 13/300,467, filed Nov. 18, 2011, entitled SYSTEM AND METHOD FOR MANAGEMENT AND DELIBERATION OF IDEA GROUPS, by Gregorio Convertino, et al.
  • Opinion mining and opinion detection systems are disclosed, for example, in U.S. Pub. No. 20120245923, published on Sep. 27, 2012, entitled CORPUS-BASED SYSTEM AND METHOD FOR ACQUIRING POLAR ADJECTIVES, by Caroline Brun; U.S. Pub. No. 20130096909, published on Apr. 18, 2013, entitled SYSTEM AND METHOD FOR SUGGESTION MINING, by Caroline Brun et al.; U.S. application Ser. No. 13/400,263, filed on Feb. 20, 2012, entitled SYSTEM AND METHOD FOR PROVIDING RECOMMENDATIONS BASED ON INFORMATION EXTRACTED FROM REVIEWERS' COMMENTS, by Anna Stavrianou, et al; U.S. application Ser. No. 13/600,329, filed on Aug. 31, 2012, entitled LEARNING OPINION-RELATED PATTERNS FOR CONTEXTUAL AND DOMAIN-DEPENDENT OPINION DETECTION, by Caroline Brun; U.S. Pub. No. 20090265304, published Oct. 22, 2009, entitled METHOD AND SYSTEM FOR RETRIEVING STATEMENTS OF INFORMATION SOURCES AND ASSOCIATING A FACTUALITY ASSESSMENT TO THE STATEMENTS by Aït-Mokhtar, et al., and U.S. Pub. No. 20040158454, entitled SYSTEM AND METHOD FOR DYNAMICALLY DETERMINING THE ATTITUDE OF AN AUTHOR OF A NATURAL LANGUAGE DOCUMENT, by Livia Polanyi, et al.; Caroline Brun, “Detecting Opinions Using Deep Syntactic Analysis,” Proc. Recent Advances in Natural Language Processing (RANLP), Hissar, Bulgaria (2011); Moghaddam, et al., “Opinion Digger: An Unsupervised Opinion miner from Unstructured Product Reviews,” in Proc. 19th Conf. on Information and Knowledge Management (CIKM'10), 2010.
  • BRIEF DESCRIPTION
  • In accordance with one aspect of the exemplary embodiment, a method for identifying the core of an idea includes receiving an idea submission which includes a textual description of an idea, natural language processing the textual description of the idea to identify dependencies in at least a part of the textual description, and providing for identifying directive illocutionary acts in the textual description, based on the identified dependencies. The method further includes providing for identifying an idea core of the idea submission, based on an identified directive illocutionary act and outputting information based on the identified idea core. At least one of the natural language processing, identifying directive illocutionary acts, identifying the idea core, and outputting information may be performed with a computer processor.
  • In accordance with another aspect of the exemplary embodiment, a system for identifying the core of an idea includes a linguistic processing component which natural language processes an idea submission which includes a textual description of an idea to identify dependencies in the textual description. An idea processing component identifies directive illocutionary acts in the textual description, based on the identified dependencies and identifies an idea core of the idea submission, based on an identified directive illocutionary act. An output component outputs information based on the identified core. A processor implements the linguistic processing component, idea processing component, and output component.
  • In accordance with another aspect of the exemplary embodiment, a method for generating a system for idea processing includes storing a set of terms in non-transitory memory for each of a set of domain dependent concepts and a set of terms for each of a plurality of domain-independent concepts. A plurality of patterns is stored for identifying a core of an idea based on the occurrence of a specified syntactic relation in a sentence of an idea submission description, at least some of the patterns specifying that a term in one of the domain-dependent or domain-independent concepts be a syntactic relation with another term in the comment. The patterns are stored in non-transitory memory for application to sentences of an idea submission description by an idea processing component. At least one of the storing terms and storing patterns may be performed with a computer processor.
  • BRIEF DESCRIPTION OF THE DRAWINGS
  • FIG. 1 is a functional block diagram of an environment in which an idea management system operates in accordance with one aspect of the exemplary embodiment;
  • FIG. 2 illustrates a template idea submission form and a completed submission with comments;
  • FIG. 3 is a flow chart illustrating a method for identifying the core of an idea submission in accordance with another aspect of the exemplary embodiment;
  • FIG. 4 is a flow chart illustrating a method for forming the system of FIG. 1;
  • FIG. 5 illustrates a screen shot of a graphical user interface which displays a pop up when the system determines that the comment is concerning additional information;
  • FIG. 6 illustrates a message to the author of the idea submission proposing a modification to the idea description;
  • FIG. 7 is a screenshot illustrating the automatic highlighting of an identified idea in an idea submission; and
  • FIG. 8 is a screenshot illustrating user modifications to the highlighting of an identified idea in an idea submission.
  • DETAILED DESCRIPTION
  • Aspects of the exemplary embodiment relate to an idea management system which analyses idea submissions, in particular, by identifying the core of the ideas submitted by members of an innovation community. In some cases, the innovation community proposes and selects new services or products for an organization with which it is associated. The community may include the organization's employees, partners, or customers. However, other idea-generating social media platforms that have similar idea creation and deliberation functionalities can also be enhanced by the methods and systems proposed herein.
  • FIG. 1 is a block diagram illustrating an environment 10 for supporting sharing information and core identification, in accordance with one embodiment. A data repository 12 stores shared content, such as submitted ideas (suggestions) and optionally also comments on the ideas. The ideas and respective comments are generated by submitters on associated user devices 14, 16, etc. An idea management system (IMS) 18 processes the shared content and outputs information based thereon. In particular, the idea management system 18 automatically identifies the core of the idea submission, based on textual content of the idea submission, other than the title. The idea management system may also provide an analysis of the comments on the ideas submitted, as described in above-mentioned copending application Ser. No. 13/______ entitled METHOD AND SYSTEM FOR CLASSIFYING REVIEWERS' COMMENTS AND RECOMMENDING RELATED ACTIONS IN IDEA-GENERATING SOCIAL MEDIA PLATFORMS.
  • In an exemplary embodiment, each user device 14, 16, etc. is a Web-enabled device that executes a Web browser and/or email program, which supports interfacing tools and information exchange with a Web server that is linked to the system 18. A wired or wireless digital data communications network 20, such as the Internet or a local intranet, provides an infrastructure for exchange of digital information. The network 20 provides interconnectivity between the user devices 14, 16, data repository 12, and system 18. Users can access and upload content, such as idea submissions 22 and comments 24 on the submitted ideas, and the like to the network 20, which are then stored in the data repository 12 and processed by the system 18.
  • User devices 14, 16 can include desktop, laptop, or tablet computers, handheld devices, such as mobile telephones and mobile Internet devices, and other computing devices capable of communicating via the network.
  • The users form an innovation community. Some of the users serve as submitters of ideas. One or more of the users may be tasked with providing comments on submissions of other users, or all users may serve as commenters. The ideas submitted may include suggestions for solving an existing problem, suggestions for modifications to an item, such as a product, service, or company procedure, suggestions for a new item, such as a product service, or company procedure, or the like. The comments 24 may relate to any aspect of the submission. FIG. 1 shows a submitter 26, who generates an idea submission 22 and a commenter 28, who provides comments 24 on the submission, i.e., the comments are provided by a different person 28 from the person 26 who generated the idea submission. It is to be appreciated that in an innovation community there may be a large number of submitters and a large number of commenters. In some cases submitters may serve as commenters on the ideas of others, and vice versa.
  • With reference also to FIG. 2, the idea submissions 22A, 22B may be submitted in response to a request for submissions on a given topic or may be submitted without any specific request. Submitters 26 may generate their submission by adding information to one or more fields of a template submission form 30. One of the fields may be a title field 32, in which the submitter provides a title 34 for the idea, if this has not been decided in advance. Another of the fields may be a suggestion field 36 in which the user enters a textual description 38 of the idea. In some embodiments, commenters 28 may add comments to the same form 30, e.g., the form may include one or more comments field(s) 40, 42 for one or more commenters to add their textual comments 44, 46. Alternatively, commenters may enter their textual comments on a separate comments form. The commenter may also provide a rating 48 of the idea (on a scale of, for example, 1 to 5 stars, where 5 represents a highly positive rating and 1 represents a highly negative rating).
  • Returning to FIG. 1, the idea management system 18 includes memory 50 which stores instructions 52 for performing the exemplary method and a processor 54 in communication with the memory for executing the instructions. The system 18 may be hosted by a suitable computing device or devices, such as the illustrated server computer 56. The system communicates with the network 20 via a network interface 58. Hardware components 50, 54, 58 of the system may communicate via a data control bus 60.
  • In one embodiment, the system 18 is hosted by a computing device within the organization that requests the idea submissions. In other embodiments, the processing of the submissions may be performed by an external web service and the results returned to the requesting organization.
  • The host computer 56 may be a PC, such as a desktop, a laptop, palmtop computer, portable digital assistant (PDA), server computer, cellular telephone, tablet computer, pager, combination thereof, or other computing device capable of executing instructions for performing the exemplary method. The memory 50 may represent any type of non-transitory computer readable medium such as random access memory (RAM), read only memory (ROM), magnetic disk or tape, optical disk, flash memory, or holographic memory. In one embodiment, the memory 50 comprises a combination of random access memory and read only memory. In some embodiments, the processor 54 and memory 50 may be combined in a single chip. The network interface 58 allows the computer to communicate with other devices via the computer network 20, and may comprise a modulator/demodulator (MODEM) a router, a cable, and and/or Ethernet port. Memory 50 stores processed data as well as instructions for performing the exemplary method described below.
  • The digital processor 54 can be variously embodied, such as by a single-core processor, a dual-core processor (or more generally by a multiple-core processor), a digital processor and cooperating math coprocessor, a digital controller, or the like. The digital processor 54, in addition to controlling the operation of the computer 56, executes instructions stored in memory 50 for performing the method outlined in FIG. 3.
  • As illustrated in FIG. 1, the instructions 52 may include some or all of the following: a natural language linguistic processing component 62, an idea processing component 64 for identifying a core of the idea, a comment classification component (classifier) 66, a comment interpretation component 68, and an information output component 68. The linguistic processing component 62 may include a natural language parser which processes at least one of the title 34, the free text description 38 of the suggestion and the free text comments 44, 46 to identify words and dependencies (syntactic and/or semantic relations) between words of the text. The idea processing component 64 identifies a probable core 80 of the idea from the suggestion 38 and optionally compares this with the title 34 of the idea. The comment classification component 66 classifies the comments based on the processed text and assigns one or more class labels 82 to the comments. The classes (reaction types) may be selected from a predefined set of classes which express the commenter's reaction to the idea. Comment classes may include some or all of the following: a first class for comments conveying additional ideas; a second class for evaluative comments (agree/disagree, positive/ negative, pros and cons), and a third class for comments that concern the idea status within the idea management system. Classes may have subclasses. For example, the evaluative comments class may have a ‘positive’ comments subclass and a ‘negative’ comments subclass, etc.
  • Depending on the comment class, the comment interpretation component 68 may suggest or implement one of two or more different actions. For example, evaluative comments may be turned into votes, and/or comments conveying additional ideas may be appended to the idea description. In one embodiment, the actions are recommended to a reviewer 84 on an associated user interface 86, e.g., a display device of a client computing device analogous to devices 14, 16. The comment interpretation component 68 may include a tool for refining the content, which is called on when the comment is assigned to the first comment class, a tool for assessing the value of the idea, which is called on when the comment is assigned to the second comment class, and a tool for managing the process, when the comment is assigned to the third comment class. Comments classed as unclassified may be ignored.
  • In one embodiment, the classification component 66 is rule based, i.e., applies a set of grammar rules to the processed text and depending on the output of the applied rules, determines a class. The rules may include patterns which when met by text of a comment, return a class and/or subclass for the textual content. In another embodiment, the classification component employs a probabilistic classifier model (or models) trained to output a class based on the processed text. Such a classifier model may have been trained on a set of training comments that are each manually labeled with a respective one of the set of class labels.
  • The information output component 58 outputs information 60 based on one or more of the identified core, classification of the comments and/or the output of the processing of the comments.
  • The term “software,” as used herein, is intended to encompass any collection or set of instructions executable by a computer or other digital system so as to configure the computer or other digital system to perform the task that is the intent of the software. The term “software” as used herein is intended to encompass such instructions stored in storage medium such as RAM, a hard disk, optical disk, or so forth, and is also intended to encompass so-called “firmware” that is software stored on a ROM or so forth. Such software may be organized in various ways, and may include software components organized as libraries, Internet-based programs stored on a remote server or so forth, source code, interpretive code, object code, directly executable code, and so forth. It is contemplated that the software may invoke system-level code or calls to other software residing on a server or other location to perform certain functions.
  • As will be appreciated, FIG. 1 is a high level functional block diagram of only a portion of the components which are incorporated into a computer system 18. Since the configuration and operation of programmable computers are well known, they will not be described further.
  • With reference to FIG. 3, a method of for identifying the core of an idea submission (and optionally classifying comments) is shown, which may be performed with the system of FIG. 1. The method begins at S100.
  • At S102, idea submissions and/or associated comments on the idea submissions are received by the system.
  • At S104, the submissions and/or comments are natural language processed with component 52.
  • S106, a core of the idea may be identified with the idea processing component 64.
  • At S108, the natural language processed comments may be classified with comment classification component 66.
  • At S110, the classified comments may be processed with interpretation component 68, and actions to be recommended generated, based on the assigned class.
  • At S112, information based on the identified core and/or classification/processing of the comments is output from the system by the output component.
  • The method ends at S114.
  • As will be appreciated, the steps of the method need not all proceed in the order illustrated and fewer, more, or different steps may be performed. Additionally, the identification of the core S106 and classification/interpretation of comments S108/S110 need not all be performed in the method. For example, in one method, S108 and S110 are omitted, the core is identified at S106 and the information output may be the identified core and/or a proposed title which is based on the identified core. In another method, S106 is omitted and the information output is based on the classifications of the comments and/or their interpretation.
  • The method illustrated in FIG. 3 may be implemented in a computer program product that may be executed on a computer. The computer program product may comprise a non-transitory computer-readable recording medium on which a control program is recorded (stored), such as a disk, hard drive, or the like. Common forms of non-transitory computer-readable media include, for example, floppy disks, flexible disks, hard disks, magnetic tape, or any other magnetic storage medium, CD-ROM, DVD, or any other optical medium, a RAM, a PROM, an EPROM, a FLASH-EPROM, or other memory chip or cartridge, or any other non-transitory medium from which a computer can read and use. The computer program product may be integral with the computer 56, (for example, an internal hard drive of RAM), or may be separate (for example, an external hard drive operatively connected with the computer 56), or may be separate and accessed via a digital data network such as a local area network (LAN) or the Internet (for example, as a redundant array of inexpensive of independent disks (RAID) or other network server storage that is indirectly accessed by the computer 56, via a digital network).
  • Alternatively, the method may be implemented in transitory media, such as a transmittable carrier wave in which the control program is embodied as a data signal using transmission media, such as acoustic or light waves, such as those generated during radio wave and infrared data communications, and the like.
  • The exemplary method may be implemented on one or more general purpose computers, special purpose computer(s), a programmed microprocessor or microcontroller and peripheral integrated circuit elements, an ASIC or other integrated circuit, a digital signal processor, a hardwired electronic or logic circuit such as a discrete element circuit, a programmable logic device such as a PLD, PLA, FPGA, Graphical card CPU (GPU), or PAL, or the like. In general, any device, capable of implementing a finite state machine that is in turn capable of implementing the flowchart shown in FIG. 3, can be used to implement the idea management method. As will be appreciated, while the steps of the method may all be computer implemented, in some embodiments one or more of the steps may be at least partially performed manually.
  • FIG. 4 illustrates a method for generating the system for identifying the core of an idea and/or for classifying comments. The method begins at S200.
  • At S202, a set of terms is stored in memory 50 for each of a set of domain dependent concepts and a set of terms is stored for each of a plurality of domain-independent concepts.
  • At S204, for each of a plurality of comment classes, one or more patterns is generated for assigning a textual comment on an idea submission to the class based on the occurrence of a specified syntactic relation in the comment. At least some of the patterns specify that a term in one of the domain-dependent or domain-independent concepts is a syntactic relation with another term in the comment for the pattern to fire on the comment.
  • At S206 the patterns are stored in memory 50 for application to textual comments by the comment classification component 66.
  • Alternatively, at S208, a classifier model is trained on a set of manually labeled sample comments (each label corresponding to one of the comment classes). Any suitable machine learning method can be used, such as support vector machines, linear regression, or the like. The classifier model is stored in memory 50 at S206, for applying by the comment classification component.
  • At S210, rules may be generated and stored in memory for interpretation of some or all of the comments, which are dependent on the class which is assigned to the comment.
  • Additionally or alternatively, at S212 patterns are generated and stored in memory 50 for identifying the core of an idea based on the idea submission description.
  • At S214 a linguistic processing component, output component, processor, and any other components of the system that are needed may be provided.
  • At S216, the system may be tested on a set of comments/idea submissions to determine if the respective patterns/vocabulary are providing a desired level of precision, and if not, additional and/or different patterns and/or concept terms may be developed or patterns modified and the system reevaluated.
  • The method ends at S218.
  • While part of the method of FIG. 4 may be performed manually, part may be computer implemented, as for the method of FIG. 3, such as the storing terms and storing patterns and/or classifier model learning. As will be appreciated, the method need not include those steps which are related to identifying the core for a system which identifies the core but does not classify comments, and vice versa in the case of a system for classifying comments which does not identify the idea core.
  • Further details of the system and methods will now be provided.
  • Receiving Submissions and Comments (S102)
  • In some embodiments, the idea submissions may be requested by someone in the organization, e.g., in the case where the submitters are employees. The request may be limited to a subset of the employees, such as the employees in a particular department. In other embodiments, the entire workforce may be asked to provide their suggestions. For example, a facilitator may communicate a request to employees for their ideas about a new product, process, or service that the company is considering introducing. The request may be limited to a particular product or service or may be open to any idea that the employees may want to propose.
  • When customers are the submitters, they may be invited to propose ideas for new products or services or for ideas on existing products and services. In some cases, the customers may be limited to those who have purchased a product or service on which they are being asked for suggestions. In other embodiments, they may be contacted through the company website, in stores where the company's goods/services are sold, or through other marketing channels. In the case of a Q&A website, provision is made so that one user may submit a description of a problem and other users may provide answers (submissions) and in some cases, comments on those answers.
  • The number of commenters who provide suggestions may be up to the number of other members of the community. In other embodiments, only a limited subset of the community is permitted to make comments.
  • Submissions and/or comments may be provided in one or more different formats, such as wiki pages, posts, emails, text messages, instant messages, tweets, voice messages, faxes, or the like. In the illustrated embodiment, a plurality of commenters may provide their comments on the same idea suggestion using electronic form 30.
  • Linguistic Processing (S104)
  • The linguistic processing component 62 processes the text of the submission and/or comments. The linguistic processing component 62 may be based on a general syntactic dependency parser with additional grammar rules added to identify the patterns useful herein. During parsing of the document, the parser annotates the text strings of the document with tags (labels) which correspond to grammar rules, such as lexical rules and syntactic and/or semantic dependency rules, such as SUBJ (a dependency between the subject of the sentence and the predicate verb) and OBJ (a dependency between the object of the sentence and the predicate verb). The lexical rules define features of terms such as words and multi-word expressions. Syntactic rules describe the grammatical relationships between the words, such as subject-verb, object-verb relationships. Semantic rules include rules for extracting semantic relations such as co-reference links. The application of the rules may proceed incrementally, with the option to return to an earlier rule when further information is acquired. The labels applied by the parser may be in the form of tags, e.g., XML tags, metadata, log files, or the like.
  • The following disclose a parser which is useful for syntactically analyzing an input text string in which the parser applies a plurality of rules which describe syntactic properties of the language of the input text string: U.S. Pat. No. 7,058,567, issued Jun. 6, 2006, entitled NATURAL LANGUAGE PARSER, by Aït-Mokhtar, et al.; Aït-Mokhtar, et al., “Robustness beyond Shallowness: Incremental Dependency Parsing,” Special Issue of NLE Journal (2002). Similar incremental parsers are described in Aït-Mokhtar “Incremental Finite-State Parsing,” in Proc. 5th Conf. on Applied Natural Language Processing (ANLP'97), pp. 72-79 (1997); Aït-Mokhtar, et al., “Subject and Object Dependency Extraction Using Finite-State Transducers,” in Proc. 35th Conf. of the Assoc. for Computational Linguistics (ACL'97) Workshop on Information Extraction and the Building of Lexical Semantic Resources for NLP Applications, pp. 71-77 (1997); and Caroline Hagège and Xavier Tannier, “XTM: A robust temporal processor,” CICLing Conference on Intelligent Text Processing and Computational Linguistics, Haifa, Israel, Feb. 17-23, 2008. The syntactic analysis may include the construction of a set of syntactic relations from an input text by application of a set of parser rules. Exemplary methods are developed from dependency grammars, as described, for example, in Mel'{hacek over (c)}uk I., “Dependency Syntax,” State University of New York, Albany (1988) and in Tesnière L., “Elements de Syntaxe Structurale” (1959) Klincksiek Eds. (Corrected edition, Paris 1969). An exemplary parser is the Xerox Incremental Parser (XIP).
  • In addition to the rules applied by the general syntactic dependency parser, the linguistic processing component (and/or idea processing component and/or comment classification component) may apply rules which identify and tag domain-dependent and domain-independent concepts and apply a set of patterns which specify semantic relations involving these concepts.
  • In one embodiment, a set of patterns adapted to core analysis or comment classification is employed for identifying syntactic and/or semantic relations (dependencies) between particular grammatical structures and/or morpho-syntactic forms and words representing particular domain-independent or domain-dependent concepts. Exemplary patterns are described in more detail below.
  • The domain-dependent concepts may be customized for each application and may be stored in a vocabulary 90, such as lexicon. In the lexicon, each concept may be associated with a set of terms which represent it. Each term can be a word or phrase. One domain-dependent concept may relate to the organization, such as a company, to which the suggestions are addressed. As an example, the name of the company may be used as the title of the concept and the terms representing it may include some or all of: variants of its name, its divisions, its products, its employees, and its activities. When terms in the domain-specific lexicon 90 are identified in the text being processed, they are labeled with the appropriate concept label (e.g., COMPANY or PRODUCT).
  • Domain-independent concepts may also be stored in the lexicon 90, or a separate lexicon, which relate to the task being performed by the IMS. For example, in the case of identifying the core idea in the suggestion 38, the domain-independent concepts may include a PERFORMATIVE concept, which includes terms often verbs, which are often found in combination with an action to be performed, and an IDEA concept which is associated with terms relating to an idea assertion. Examples of the PERFORMATIVE concept may include verbs such as suggest, propose, and recommend. Examples of the IDEA concept include terms such as idea, suggestion, proposition, proposal, and the like.
  • In the case of classification of comments, the domain-independent concepts may include some or all of the following concepts and terms:
      • IDEA: idea, suggestion, proposition, proposal, concept, thought, brainwave, notion, initiative, plan, inspiration, etc.
      • SIMILAR: another, related, same, similar, comparable, etc.
      • PROCESS: accept, assign, process, hold, administer, etc.
      • AGREE: agree, concur, right, yes, perfectly, agreement, etc.
      • DISAGREE: disagree, differ, oppose, wrong, no, disagreement
      • LACK: need, require, lack, want, should
      • SUGGESTION: suggest, advise, add, propose, submit, put forward, recommend, suggestion, etc.
      • GOOD: good, lovely, splendid, wonderful, cool, fantastic, great, decent, useful, excellent, clever, etc.
      • BAD: bad, poor, awful, terrible, flawed, rotten, unpleasant
  • Some of these concepts, such as the IDEA and SUGGESTION concepts, may also be used for identifying the idea core. The creation of the sets of concept terms can be performed manually or semi-automatically, for example using a glossary of terms used within the company for the ORGANIZATION concept. Manually selected terms can be supplemented with synonyms from a thesaurus. Lexical items (individual words) may also be stored.
  • The example patterns described below include those which identify a semantic relation, i.e., a specified relationship between two text elements in a sentence (such as SUBJ or OBJ) where at least one of the text elements is tagged with one of the concepts. Each text element can include one or more words. Other useful patterns identify syntactic relations in which specific verb forms are used.
  • Core Analysis (S106)
  • The identification of the core of an idea (idea spotting) involves automatically detecting the idea core(s) within idea descriptions. In some embodiments, only a part of the description is analyzed for identifying the core of the idea, such as the first one or two sentences. In other embodiments, the entire description is analyzed. In yet other embodiments, the sentences are analyzed in turn until one of the sentences is determined to include a core.
  • Idea spotting can be based on analyzing idea descriptions 38 in terms of speech act theory (first developed in John Langshaw Austin: How to Do Things With Words. Cambridge Mass. (1962). According to speech act theory, speakers perform illocutionary acts by utterances using special linguistic structures. Conveying ideas is performed by directive speech acts, which are defined as acts that cause the hearer to take a particular action. A “linguistic structure” refers generally to a word or phrase and the linguistic tags that have been applied by the parser, such as part of speech, gender, tense, etc.
  • Some linguistic structures are ambiguous with respect to the illocutionary act that they convey, i.e., the same structure can convey different illocutionary acts depending on the communicative situation. For example a question, such as “Could the new printer include a scanner?” may be an inquiry, a request or a suggestion. In the case of detecting the core(s) of an idea, such ambiguity is resolved by the predefined communicative situation of the IMS as a space for proposing ideas. As a consequence, once a particular linguistic structure is identified in the context of the participants as conveying a directive speech-act, it is considered that the identified linguistic structure conveys a suggestion to be acted on, i.e., a core of the idea of an idea submission. In a rule-based system, patterns are structured (modeled) to identify grammatical indicators of directive speech-acts in text, in particular, to identify when someone wants someone else to do something, such as in the form of a request, order, proposal, assertion, or question. In the exemplary system, therefore, detecting the core of an idea includes detecting sentences that use particular linguistic patterns for expressing directive illocutionary acts in the context of the participants of the illocutionary act, i.e., the idea owner and/or the organization. Some of the patterns require a dependency with a domain-specific word, while others identify a dependency in which “I” or “me” is the subject.
  • The following types of linguistic patterns for identifying a core of an idea are exemplary and are provided together with examples in which the patterns are underlined:
  • 1. Performative verb forms: in which a PERFORMATIVE verb (selected from the verbs associated with this concept) has a first person singular (“I”) as its subject I suggest/propose/recommend . . . For example,
      • I suggest that CoffeeCo uses recyclable cups . . .
  • 2. Idea assertion: a term from the concept IDEA is in a syntactic dependency with a first person singular pronoun (“my”). For example,
      • My next idea is a refreshing fruity drink which I call Applechino
  • 3. Imperative: in which a verb beginning a sentence or clause has a domain-dependent concept (e.g., COMPANY) as its object. For example:
      • Connect CoffeeCo with a social network website.
  • Here, the imperative form connect, is in an OBJ-type relationship with CoffeeCo, which is assumed to be in the domain-dependent concept, COMPANY.
  • 4. Conditional: in which any conditional pronoun expression (e.g., conditional verb in a dependency with a pronoun) is also in any syntactic relationship with a domain-dependent concept (e.g., COMPANY).
      • Also you could sell toys with the CoffeeCo logo on them and the toys could be shaped like coffee cups.
  • 5. Question: in which any main verb of a question in any syntactic relationship with a domain-dependent concept (e.g., COMPANY). For example:
      • Why has the West 9th Street CoffeeCo stopped selling donuts?
  • 6. Need assertion: in which the verb “need” is in any syntactic relationship with a domain-dependent concept (e.g., COMPANY). For example:
      • I have suggested before that more CoffeeCo Drive-Thru's are needed for people with children or pets that can't park and walk in.
  • The system may employ two, three four, five or all of these exemplary patterns. These linguistic structures can be encoded by rules in rule-based NLP systems such as in the form of additional rules on top of a parser as described above. In another embodiment, training examples can be annotated for training a machine learning system. In particular, idea descriptions, which may have undergone some initial linguistic processing, have their core idea(s) manually labeled. These samples are used to train a classifier model using a machine learning technique, such as support vector machines (SVM) or linear regression.
  • The context of the idea-owner can be indicated by any expression where the author of the idea refers to herself or himself (e.g., the pronoun “I”, “my”, “we”, “ours”, “our company”, etc.).
  • The context of the organization can be indicated by the organization-specific domain-dependent vocabulary 90. This vocabulary can be obtained by various methods such as calculating term frequency or using glossaries.
  • An idea core is detected if the words conveying the linguistic pattern and the domain vocabulary are in dependency relationships, like the underlined words in the sentences above. In some cases, more than one idea core may be detected in a single idea submission, in which case, one, such as the first, may be adopted as the idea core, or the user may be asked to select one. In other embodiments, candidate cores may be ranked, with the ranking being based on the order in which the cores were identified in the description and/or on the type of pattern which identified them. In one embodiment, there may be a predefined precedence established among the patterns. For example, the first linguistic pattern matched in the description which is of the performative type or insertion type (direct expressions) may be used as the (main) core, with other identified cores optionally being considered as supplemental core(s). If there is no core identified which is of a direct expression type, then a first idea core identified with one of the other patterns may be selected.
  • The idea core can be the entire sentence identified as containing the core, or may be less than a sentence, such as the clause of the sentence on which the pattern fired. For each sentence of the description, the idea processing component 64 may output a decision, such as a binary decision, on whether or not the sentence includes a core, based on whether any of the patterns (or classifier) fired on the sentence.
  • Comment Classification (S108)
  • In one embodiment, the exemplary system provides automatic linguistic analysis of idea descriptions 38 and free-text comments 44, 46. The automatic analysis supports both facilitators and content-providing users.
  • The comment classifier 54 automatically labels comments according to their type of reaction to the idea. This facilitates using the classified comments for various analytical purposes and for enhancing the effectiveness of the idea management system 18. The classified comments are each interpreted, when applicable, as the action that the comment represents, and its execution may be supported by the user interface 86 of the system 18. This approach is consistent with the language-action perspective (Winograd, T., “A language/action perspective on the design of cooperative work,” Human Computer Interaction, 3(1) 3-30(1987)).
  • Three exemplary comment classes can be used as follows:
  • 1. Reaction to the content of the idea: this class allows the facilitators of the IMS 18 to enhance the idea submission with additional propositions put forth in the comments, or to categorize the content of the idea submission as existing, e.g., it refers to an idea, product, or service that already exists in the organization (based on the comment).
  • 2. Expression of the commenter's attitude towards the idea: discovering positive or negative attitudes in comments may serve as a complement to an existing voting mechanism or may be used as a voting mechanism itself. For example, a rating may be computed based the comments, e.g., as positive neutral, or negative. In one embodiment, the computed rating is compared with an actual rating of the idea given by the commenter, e.g., the commenter may be requested/permitted to provide a rating 48 of the idea submission. If the computed and actual ratings agree, the actual rating may be labeled as reliable. If the computed rating and commenter's actual rating do not agree, the actual rating may be modified or flagged as unreliable. For example, in FIG. 2, one of the comments “Yes!, why not every week?” is assigned a ‘positive” rating by the system, but the commenter has only given the idea two stars out of five, which corresponds to fairly negative. Since the actual rating and the computed rating differ by more than a threshold amount, the rating 48 may be flagged as unreliable.
  • 3. Meta-reaction: reaction relative to the idea generation workflow: this category can help provide information about the commenter's reactions regarding the idea status within the IMS workflow.
  • The automatic categorization of the comments according to the three reaction types can be carried out by various methods. By way of example, a rule-based automatic classification will be described. In this method, each reaction type is defined in terms of a set of linguistic patterns or rules, each rule including linguistic structures and/or associated lexical items/expressions. The linguistic structures mainly convey the comment classes, whereas the lexical items mostly refer to the topic of the comment. The list is definitely not complete but in our experiment, where we tested our proof-of-concept system, the proposed list accounted for a large part of all the reactions.
  • Exemplary patterns for the three reaction types are listed below. Each reaction type can be further categorized into fine-grained classes (categories) that depend on particular systems. The fine-grained categories in the list apply to a given organization's IMS. The following notation is used in the descriptions:
  • The words in capital letters represent CONCEPTS that are instantiated by lists of domain-specific and general vocabulary words. Whenever two concepts are mentioned, the words that instantiate them should be in a syntactic dependency relation. The words in italics are lexical items. The content underlined is conveyed by linguistic structures (tense, mood, voice, etc.).
  • 1. Reaction to the content of the idea:
  • This class of comments identifies the commenter's view of the content of idea (e.g., whether it already exists or is similar to an existing idea, or whether it could be changed, e.g., amplified) and can include the following subclasses:
  • A. PRIOR ART: previous work done/related thing in progress/idea exists. Two types of prior art patterns can be implemented:
  • i. IDEA/PRODUCT: in which the concept IDEA, PRODUCT, or COMPANY (or other ORGANIZATION concept) is the subject of the verb “exist.” For example: a pattern can be of the form:
      • IDEA/COMPANY exists/has been done/used to exist/was done at some point in the past, e.g.: IDEA or COMPANY subject of the verb “exist”.
  • This pattern could identify a prior art reaction in the text string:
      • This solution (IDEA) exists within CoffeeCo's marketing department
  • ii. SIMILAR IDEA: an IDEA is similar to something existing. For example, a pattern can be of the form:
  • a term from the concept SIMILAR in any syntactic relationship with a term from the concept IDEA.
  • For example this pattern would identify a prior art reaction in the text string:
      • The proposal (IDEA) is comparable (SIMILAR) to what we tried in North America
  • B. ADDITIONAL INFORMATION: In this type of comment, the commenter expands the idea (suggestion, question, advice, additional thoughts, request) with his or her own idea, e.g., “Since this is something that we already have, I wonder if it is something that could be centralized.” Examples of patterns which can be used to detect this type of comment can be of one or more of the following types:
  • i. Comparison of IDEAS, in which IDEA is a subject complement of a comparative adjective (selected from a predefined set of comparative adjectives, such as better, improved, worse, nicer, etc.). For example this pattern would identify an additional information in the text string:
      • This is a better (comparative adjective) proposal (IDEA) than using inflatable cups
  • ii. Conditional, in which any conditional auxiliary verb is present in the sentence. For example:
      • We could (conditional auxiliary verb) include a coupon for a sample of the new coffee.
  • iii. An IDEA/COMPANY needs: in which IDEA or COMPANY is in any syntactic relationship with a term from the domain-independent concept LACK. It often suggests something missing from the idea submission, or company, or that the idea submission is not understood. For example:
      • CoffeeCo (COMPANY) needs (LACK) to make the cups smaller to do this.
  • iv. Question: the comment is in the form of a question. For example:
      • Why doesn't CoffeeCo try this in the North East?
  • v. imperative: the comment is in the imperative form. For example:
      • Try this!, Do this, or Go and see this webpage
  • vi. I SUGGEST—my SUGGESTION: in which the first person pronoun is in any syntactic relationship with a term from the domain-independent concept SUGGESTION. For example:
      • I suggest that we also add a logo to the cup.
  • 2. Expression of the commenter's judgment of the idea's value
  • Here, the commenter is not trying to change the content, but evaluate the comment. This class can include different types of evaluative comments.
  • Some expressions can be classified into positive and negative categories using a sentiment vocabulary and/or patterns. A sentiment vocabulary classes certain words as being positive or negative and may assign a value on a scale of positive/negative. For example, fabulous, may be classed more positive than nice. In some embodiments, a set of positive terms in the sentiment vocabulary is collected into a domain independent concept GOOD and optionally a set of negative terms is collected in a domain independent concept BAD. In other embodiments, the company may only be interested in judgments that are supportive of the idea (e.g., using the concepts GOOD, AGREE), and thus negative judgments may be ignored (or vice versa). Words of negation usually reverse the polarity, for example, not very good, is classed as negative. Example judgment subclasses may be as follows, although more or less refined subclasses are also considered, such as simply positive and negative.
  • A. AGREE (e.g., “True, I support this idea.”). Two patterns are suggested:
  • i. I agree/support: in which a verb in the domain-independent concept AGREE is in any syntactic relationship with a first person pronoun. For example.
  • I agree that we need to focus on this.
  • ii. AGREE adverb beginning the sentence. An adverb in the domain-independent concept AGREE is present at or near the beginning of the sentence. For example:
  • Perfectly described!
  • B. POSITIVE ATTITUDE Two patterns are suggested:
  • i. GOOD IDEA: A word in the domain independent concept GOOD precedes a word in the domain independent concept IDEA. For example:
      • Great idea, Kelly.
  • ii. the comment includes an expression from a lexicon of specific expressions that are recognized as connoting a positive attitude, e.g., Sounds good/makes sense. The pattern fires when the specific words themselves are used. For example:
      • This makes sense to me.
  • C. NEGATIVE ATTITUDE Two patterns are suggested:
  • i. BAD IDEA: A word in the domain independent concept BAD precedes a word in the domain independent concept IDEA. For example:
  • Bad idea, Bob
  • ii. Negation in the sentence: Patterns which use the word not, particularly in a pattern which would otherwise be an indicator of a positive attitude, such as:
  • This is not the kind of savings/revenue idea that we are looking for.
  • This is not a very good idea
  • D. PROS-CONS arguments for and against the idea. Two patterns are suggested:
  • i. Yes . . . but/ . . . , however/while . . . . For example:
  • Yes, we can use WebEx or Live Meeting, but it only displays our computer screen.
  • we can use WebEx or Live Meeting, however it only displays our computer screen.
  • ii POSITIVE ATTITUDE+but For example:
  • We could introduce this next year, but I don't expect it will be ready
  • 3. Meta-reaction: reaction relative to the idea generation workflow
  • The comment concerns the management of the process for implementing a product/process that is the focus of the idea (e.g., “Rating—Hold until 2010”). Depending on the stage at which the process is currently, the idea may need to wait to be implemented, could be implemented on a trial basis, or could be ready for implementing now. A pattern identifies when a term in the domain-specific concept PROCESS is in the sentence. The concept PROCESS is a vocabulary of company-specific processes for ideas.
  • The comment classification component 66 can be implemented by applying pattern-matching rules. Alternatively a collection of training examples can be used as annotated examples for training a machine-learning system.
  • Comment sentences which satisfy one of the patterns are labeled with the class corresponding to that pattern, and optionally, more fine-grained tags corresponding to the particular pattern(s) that fired.
  • While three comment classes are described by way of example, it is to be appreciated that fewer or more comment classes may be employed for identifying different types of comments. The number of different comment classes may depend, in part on the available interpretation methods. The system may employ two, three four, five, or more, or all of the exemplary patterns described above, such as at least one for each class.
  • The comment classification may have a specific goal within the community, which depends on the type of innovation community. For example, a community of employees typically deliberates on innovative projects that the company should invest in, while a customer community typically deliberates on the next line of products that they want. The set of fine-grained reaction types may be modified to be appropriate to the particular community goal. In the case of a community of employees seeking novel and promising ideas, one aspect of the comment classification is to identify prior art, if there is any, in the idea content. In the case of a customer community, patterns for identifying existing products/services may not be needed. However, two of the three main reaction types, “Reaction to the content of the idea” and “Expression of the commenter's judgment of the idea” are largely system independent. As for the “Meta-reaction” type, it may be present whenever formal idea-management steps are defined in the system.
  • The set of fine-grained classes (categories) may be customizable for the specific community, since the reaction types depend on the community goals. Thus, some comment classes/subclasses can change. That is, as a new corpus is given, the classes and list of reactions can be adapted based on the goal and content of this system.
  • It has been found that a set of about 40 or 50 pattern matching rules of the type described above are sufficient to generate a practical system for classifying comments, although a larger number or rules could allow a higher precision or retrieval to be achieved.
  • The classification of the comments, as reactions to ideas, into three classes, as described herein, has several benefits. First, in logical terms, the classification distinguishes three distinct aspects of the idea that can be targeted by the commenter's reaction: idea content, idea value, or idea state with respect to a given workflow. Second, in terms of tools that a social media platform can include, the classification corresponds to distinct types of technologies: for the first class, the IMS may include content generation and management tools (e.g., wikis, document editors, tagging tools), for the second class, the IMS may include tools for judging the value of ideas (e.g., tools for voting, deliberation, sentiment analysis), and for the third class, the IMS may include tools for managing processes (e.g., planning or workflow systems).
  • The classification of three basic comment types logically maps onto three basic actions types that an idea-generating social media platform can recommend. For example, detecting the reaction to the idea content leads to recommending actions aimed at content generation and management. Detecting the commenter's attitude towards the idea value leads to recommending actions aimed at voting and deliberating on what idea to select. Detecting reaction to the process leads to recommending actions aimed at managing the idea state or process with respect to a given workflow.
  • Comment Interpretation and Action Recommendation (S110)
  • Several of the comments found in an idea-generating social media platform are aimed at performing specific actions. The number of these actions is usually limited, given the specific and shared goal of the platform. For example, ADDITIONAL INFORMATION aims at refining the idea by adding new content or a clarification. PRIOR ART aims at appending a reference to prior work or ideas to the content of the current idea. Accordingly, after classifying each comment, the interpretation component 68 may recommend, to the user, e.g., commenter 28 or reviewer 84, the inception or full execution of a corresponding action.
  • For example FIG. 5 illustrates a screenshot of an exemplary user interface 92, which may be displayed on the commenter's display device. By application of the patterns, the comment classifier has classified the comment as a case of an ADDITIONAL INFORMATION (ADDID). As the commenter submits the comment to the idea “Automation of Creating . . . ”, the comment is classified by the classification component as an ADDID. The interpretation component 68 then asks the commenter via a dialog box 94 if the user would like to execute the action of appending the content of the comment to the description of that idea. If the user clicks the button “Yes, append”, then the system may let the user choose the point of the description where the comment should be appended. The illustrated dialog box is an embodiment of the action recommendation with which the user interacts by accepting or rejecting it. After the user responds, the system sends a request 96 (FIG. 6) to the author of the idea (and/or moderator) to either accept or reject the edit or addition proposed. Various ways for requesting confirmations of the proposed change can be implemented, e.g., through a version manager such as in the github.com code management system.
  • A similar procedure can be implemented for a comment classified as PRIOR ART: if related ideas are mentioned, the system can suggest that the two ideas are related. This is another embodiment of action recommendation, which the user can accept or reject.
  • For POSITIVE ATTITUDE or AGREE comments the commenter may be given the option of giving a positive vote (e.g., a thumbs up or a positive value on a rating sale) to the idea and for NEGATIVE ATTITUDE, a negative vote (e.g., a thumbs down or negative value on a rating sale). In other embodiments, the vote may automatically be generated. The vote is added to the votes previously cast by other commenters and may be displayed in a voting region the interface, as illustrated at 98. The voting information may be forwarded to a person/department associated with evaluating the comments.
  • In the case of process related comments, each of a plurality of the PROCESS terms may be associated with a respective specific action, for example, comments including the PROCESS term “implement now” may be forwarded to a department tasked with the process of implementing the product or service.
  • Information Output (S112)
  • A. Comment Classification
  • In the case of comment classification, the information output may be an assigned comment class for each comment (where none of the three example classes is assigned, the comment may be assigned to an “unclassified” class). Alternatively or additionally the information output may be a request to append, or automatically appended comment or part thereof, in the case of a comment assigned to the first class, or a vote/rank or request for a vote/rank, or other scoring metric, in the case of a comment assigned to the second class.
  • In other embodiments, the information output may be the number or proportion of the comments assigned to each class. In other embodiments, a reviewer may request to see all submission in a given one or more of the classes/subclasses, and this information may be output.
  • B. Idea Spotting
  • In the case when the idea core is identified at S106, this may be output and/or proposed at S112 as a replacement title for the submission. The output component 70 may include an interactive visualization component that generates a Graphical User Interface (GUI) 100 for display to a user on a client device, as illustrated, for example, in FIGS. 7 and 8. The GUI visualizes the identified idea core(s) 80, allows the user to validate them, and the validated cores are then visualized as validated for future use (with a stored history of validation). The GUI may allow the user to modify, remove, and/or add a new core(s) manually. For example, the user interacts with the GUI using a user input device of the client device 14, 16, 86, such as a keyboard, keypad, touch screen, cursor control device, or combination thereof. Where more than one idea core is identified, a single one may be output, or a set of idea cores may be displayed in a ranked order on the GUI, with the opportunity for the user to select one. In one embodiment, the automatically identified core(s) may be highlighted in the description, as shown in FIG. 7. The user can click on an “accept” 102 or a “decline” 104 actuable area of the screen, or employ other suitable selection mechanism. In the illustrated example, the user has declined the highlighted entry, allowing the user to select a new idea core or to modify it. As shown in FIG. 8, the user has opted to modify the identified core by including additional text of the idea description. A highlighting pen may be automatically displayed, as shown at 106, for the user to make the text selection.
  • The exemplary system and method are particularly useful in organizations interested in pursuing open innovation independently or collaboratively, where a large number of ideas and comments may be generated by employees, customers, or the like.
  • Without intending to limit the scope of the exemplary embodiment, the following Examples illustrate the applicability of the methods disclosed herein for idea spotting and comment classification.
  • EXAMPLES Example 1 Idea Core Spotting
  • A prototype system was developed for detecting ideas in an Idea Management System for a company operating a number of coffee shops. The linguistic patterns for the idea processing component 64 were encoded in XIP rules, and the domain-specific vocabulary was constructed manually, based on a development corpus. The development corpus consisted of 681 idea postings by customers. About 40 rules were manually generated using 50 words of domain-specific vocabulary (including the name of the company, its abbreviated forms, and the like). When these rules were on 21,000 postings the system detected 15,000 idea cores. The performance was manually evaluated on 50 postings, and achieved 75% recall and 93% precision.
  • Example 2 Comment Classification
  • An idea classifier was developed for a company IMS that is used by employees. 281 comments submitted in connection with idea submissions were analyzed first. For this evaluation, the submitted ideas were ignored. From this initial review, it was found that the comment classes, as listed below, were expressed in the first sentence of the comment in 88% of the comments. Out of the remaining 12% of comments, in 8%, no comment type could be identified in the entire comment text and in 4%, the comment type was expressed later in the text. Based on this analysis, it was decided to analyze the first sentences only. While this approach means that the method misses 4% of the comment types, there is a gain in precision and relevance if only the first sentence is considered.
  • Based on 285 comments, a domain vocabulary of approximately 140 words and about 50 pattern-matching rules were constructed. The resulting grammar was run on 3000 comments, out of which 2000 (70%) were classified.
  • 48 of the comments that were classified by the system were evaluated. The comments (first sentence only) and respective classification labels assigned by the system were presented independently to two human coders to assess inter-annotator agreement. They fully agreed in 81% of the cases, partially agreed in 12.5% of the cases (one evaluator agreed with the system classification, the other did not), and both disagreed in 6.3% of the cases. The Kappa value (removing the effect of chance) was 0.81. In those cases were there was agreement between the two human annotators, there was agreement with the automatic classification in 87% of the cases, and partial agreement and disagreement in 6.3%. The Kappa value was 0.87.
  • Finally, the linguistic processor 62 and classifier 66 were applied to a dataset of 1078 ideas in the organization, which received 2984 comments by 598 authors (73% of the ideas had at least one comment). The comment classification results were as follows:
  • ADDITIONAL INFORMATION: 19%
  • PRIOR ART: 9%
  • POSITIVE ATTITUDE: 17%
  • AGREE: 5%
  • NEGATIVE ATTITUDE: 2%
  • PROS-CONS: 4%
  • PROCESS: 12%
  • UNCLASSIFIED 26%
  • Therefore, a total of 28% of the comments conveyed a reaction to the content, 28% expressed a judgment about the value of the idea, 12% were devoted to managing the process, while the rest of the comments remained unclassified.
  • Some of the comments classified as “unclassified” may, in fact be noise, such as “thankyou,” or “hello”, so the unclassified comments can be filtered out. Further patterns may decrease the percentage of unclassified comments.
  • It will be appreciated that variants of the above-disclosed and other features and functions, or alternatives thereof, may be combined into many other different systems or applications. Various presently unforeseen or unanticipated alternatives, modifications, variations or improvements therein may be subsequently made by those skilled in the art which are also intended to be encompassed by the following claims.

Claims (22)

What is claimed is:
1. A method for identifying the core of an idea, comprising:
receiving an idea submission which includes a textual description of an idea;
natural language processing the textual description of the idea to identify dependencies in at least a part of the textual description;
providing for identifying directive illocutionary acts in the textual description, based on the identified dependencies;
providing for identifying an idea core of the idea submission, based on an identified directive illocutionary act;
outputting information based on the identified idea core,
wherein at least one of the natural language processing, identifying directive illocutionary acts, identifying an idea core, and outputting information is performed with a computer processor.
2. The method of claim 1, wherein the identifying directive illocutionary acts comprises identifying directive illocutionary acts in the context of participants of the illocutionary act selected from a submitter of the idea and an organization for which the idea submission is generated.
3. The method of claim 2, wherein the context of the submitter of the idea is indicated by an expression in which the submitter of the idea uses a personal pronoun that refers to the submitter or the organization for which the idea submission is generated.
4. The method of claim 1, wherein the identifying directive illocutionary acts comprises applying syntactic patterns each configured for identifying a directive illocutionary act.
5. The method of claim 4, wherein the applied syntactic patterns are each configured for identifying at least one of a request, order, assertion, suggestion, and a question.
6. The method of claim 4, wherein the patterns are selected from the group consisting of:
a first pattern which identifies when a verb selected from a set of verbs associated with a PERFORMATIVE concept has a first person singular as its subject;
a second pattern which identifies when a term selected from a set of terms associated with an IDEA concept is in a syntactic relationship with a first person singular pronoun;
a third pattern which identifies when a verb beginning a sentence or clause has, as its object, a term selected from a set of terms associated with a domain-dependent concept that is related to an organization for which the idea submission is generated;
a fourth pattern which identifies when a conditional verb is in a dependency with a pronoun and is also in a syntactic relationship with a term selected from a set of terms associated with a domain-dependent concept that is related to the organization for which the idea submission is generated;
a fifth pattern which identifies when a main verb of a question is in a syntactic relationship with a term selected from a set of terms associated with a domain-dependent concept that is related to the organization for which the idea submission is generated; and
a sixth pattern which identifies when a form of the verb “need” is in a syntactic relationship with a term selected from a set of terms associated with a domain-dependent concept that is related to the organization for which the idea submission is generated.
7. The method of claim 6, wherein at least three of the patterns are employed.
8. The method of claim 4, wherein when more than one core is identified in the idea description, the method further comprises ranking the identified cores based on at least one of an order of the cores in the description and a pattern which was used to identified the core.
9. The method of claim 1, wherein the providing for identifying directive illocutionary acts in the textual description comprises classifying sentences of the description with a classifier which has been trained using manually labeled description sentences to identify a sentence that includes core, based on the identified dependencies.
10. The method of claim 1, wherein the receiving an idea submission comprises receiving a completed template form in which the textual description of the idea is entered in a designated description field of the form.
11. The method of claim 1, wherein the template form further includes a designated field for a title to be entered, which is not used in identifying the core.
12. The method of claim 1, wherein the outputting information based on the identified idea core comprises proposing the identified core as a title for the idea submission.
13. The method of claim 1, wherein the outputting information comprises generating a graphical user interface for display to a user which visualizes the identified idea core and provides for the user to validate the idea core, or to remove, modify, or replace the idea core.
14. The method of claim 1, wherein the idea submission relates to a company's product or service and the idea submission is submitted by a customer or employee of the company.
15. The method of claim 1, wherein the receiving the idea submission comprises receiving a collection of idea submissions from a plurality of different submitters.
16. A computer program product comprising a non-transitory recording medium storing instructions, which when executed on a computer causes the computer to perform the method of claim 1.
17. A system comprising memory which stores instructions for performing the method of claim 1 and a processor in communication with the memory for executing the instructions.
18. A system for identifying the core of an idea, comprising:
a linguistic processing component which natural language processes an idea submission which includes a textual description of an idea to identify dependencies in the textual description;
an idea processing component which identifies directive illocutionary acts in the textual description, based on the identified dependencies and identifies an idea core of the idea submission, based on an identified directive illocutionary act;
an output component which outputs information based on the identified core; and
a processor which implements the linguistic processing component, idea processing component, and output component.
19. The system of claim 18, further comprising memory which stores, for each of a set of domain-dependent concepts, a respective set of terms, and for each of a set of domain-independent concepts, a respective set of terms, the idea processing component applying patterns for classifying the comments including patterns which each specify at least one of the domain-dependent and domain-independent concepts.
20. The system of claim 18, wherein the output component comprises a visualization component which visualizes the identified idea core and provides for the user to validate the idea core, or to remove, modify, or replace the idea core.
21. An idea management system comprising the core identification system of claim 18.
22. A method for generating a system for idea processing, comprising:
storing a set of terms in non-transitory memory for each of a set of domain dependent concepts and a set of terms for each of a plurality of domain-independent concepts;
storing a plurality of patterns for identifying a core of an idea based on the occurrence of a specified syntactic relation in a sentence of an idea submission description, at least some of the patterns specifying that a term in one of the domain-dependent or domain-independent concepts be a syntactic relation with another term in the comment;
storing the patterns in non-transitory memory for application to sentences of an idea submission description by an idea processing component; and
wherein at least one of the storing terms and storing patterns is performed with a computer processor.
US13/911,508 2013-06-06 2013-06-06 Method and system for idea spotting in idea-generating social media platforms Abandoned US20140365206A1 (en)

Priority Applications (2)

Application Number Priority Date Filing Date Title
US13/911,534 US9400779B2 (en) 2013-06-06 2013-06-06 Method and system for classifying reviewers' comments and recommending related actions in idea-generating social media platforms
US13/911,508 US20140365206A1 (en) 2013-06-06 2013-06-06 Method and system for idea spotting in idea-generating social media platforms

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
US13/911,508 US20140365206A1 (en) 2013-06-06 2013-06-06 Method and system for idea spotting in idea-generating social media platforms

Publications (1)

Publication Number Publication Date
US20140365206A1 true US20140365206A1 (en) 2014-12-11

Family

ID=52006204

Family Applications (2)

Application Number Title Priority Date Filing Date
US13/911,508 Abandoned US20140365206A1 (en) 2013-06-06 2013-06-06 Method and system for idea spotting in idea-generating social media platforms
US13/911,534 Expired - Fee Related US9400779B2 (en) 2013-06-06 2013-06-06 Method and system for classifying reviewers' comments and recommending related actions in idea-generating social media platforms

Family Applications After (1)

Application Number Title Priority Date Filing Date
US13/911,534 Expired - Fee Related US9400779B2 (en) 2013-06-06 2013-06-06 Method and system for classifying reviewers' comments and recommending related actions in idea-generating social media platforms

Country Status (1)

Country Link
US (2) US20140365206A1 (en)

Cited By (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20140365207A1 (en) * 2013-06-06 2014-12-11 Xerox Corporation Method and system for classifying reviewers' comments and recommending related actions in idea-generating social media platforms
US20150339583A1 (en) * 2014-05-20 2015-11-26 Aol Inc. Machine learning and validation of account names, addresses, and/or identifiers
US20160308816A1 (en) * 2015-04-20 2016-10-20 International Business Machines Corporation Reducing a negative social interaction
US20170109848A1 (en) * 2015-10-20 2017-04-20 International Business Machines Corporation Value Scorer in an Automated Disclosure Assessment System
US20200081980A1 (en) * 2018-09-07 2020-03-12 The Florida International University Board Of Trustees Features for classification of stories
WO2021029886A1 (en) * 2019-08-14 2021-02-18 Unify Patente Gmbh & Co. Kg Real-time communication and collaboration system and method of monitoring objectives
CN113408291A (en) * 2021-07-09 2021-09-17 平安国际智慧城市科技股份有限公司 Training method, device and equipment for Chinese entity recognition model and storage medium

Families Citing this family (55)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US8977255B2 (en) 2007-04-03 2015-03-10 Apple Inc. Method and system for operating a multi-function portable electronic device using voice-activation
DE112014000709B4 (en) 2013-02-07 2021-12-30 Apple Inc. METHOD AND DEVICE FOR OPERATING A VOICE TRIGGER FOR A DIGITAL ASSISTANT
US9256341B2 (en) 2013-03-20 2016-02-09 Microsoft Technology Licensing, Llc Tracking changes in collaborative authoring environment
WO2014210334A1 (en) 2013-06-28 2014-12-31 President And Fellows Of Harvard College Machine learning enchanced by human measurements
US10824787B2 (en) * 2013-12-21 2020-11-03 Microsoft Technology Licensing, Llc Authoring through crowdsourcing based suggestions
US11514399B2 (en) * 2013-12-21 2022-11-29 Microsoft Technology Licensing, Llc Authoring through suggestion
US9965776B2 (en) * 2013-12-30 2018-05-08 Verizon and Redbox Digital Entertainment Services, LLC Digital content recommendations based on user comments
US9467744B2 (en) * 2013-12-30 2016-10-11 Verizon and Redbox Digital Entertainment Services, LLC Comment-based media classification
US9747567B2 (en) * 2014-01-13 2017-08-29 Conduent Business Services, Llc Selecting organizational transportation proposals based on commuter pledges and cost-benefit computation
US9338493B2 (en) 2014-06-30 2016-05-10 Apple Inc. Intelligent automated assistant for TV user interactions
US10068185B2 (en) * 2014-12-07 2018-09-04 Microsoft Technology Licensing, Llc Error-driven feature ideation in machine learning
US9645995B2 (en) 2015-03-24 2017-05-09 Conduent Business Services, Llc Language identification on social media
US20160300023A1 (en) * 2015-04-10 2016-10-13 Aetna Inc. Provider rating system
US10460227B2 (en) 2015-05-15 2019-10-29 Apple Inc. Virtual assistant in a communication session
US10095779B2 (en) * 2015-06-08 2018-10-09 International Business Machines Corporation Structured representation and classification of noisy and unstructured tickets in service delivery
US10025773B2 (en) 2015-07-24 2018-07-17 International Business Machines Corporation System and method for natural language processing using synthetic text
US10747498B2 (en) 2015-09-08 2020-08-18 Apple Inc. Zero latency digital assistant
US20170116557A1 (en) * 2015-10-21 2017-04-27 Tata Consultancy Services Limited System and method for performing root cause analysis on unstructured data
US10691473B2 (en) 2015-11-06 2020-06-23 Apple Inc. Intelligent automated assistant in a messaging environment
US10409878B2 (en) * 2016-01-29 2019-09-10 Kiran Girish Sonnad Algorithm for an automated, community based, periodic selection process of creative works
US10318622B2 (en) 2016-03-30 2019-06-11 International Business Machines Corporation Weighted annotation evaluation
US10102200B2 (en) * 2016-08-25 2018-10-16 International Business Machines Corporation Predicate parses using semantic knowledge
US11205103B2 (en) 2016-12-09 2021-12-21 The Research Foundation for the State University Semisupervised autoencoder for sentiment analysis
US10089297B2 (en) * 2016-12-15 2018-10-02 Microsoft Technology Licensing, Llc Word order suggestion processing
US10628738B2 (en) 2017-01-31 2020-04-21 Conduent Business Services, Llc Stance classification of multi-perspective consumer health information
DK201770427A1 (en) 2017-05-12 2018-12-20 Apple Inc. Low-latency intelligent automated assistant
DK179496B1 (en) 2017-05-12 2019-01-15 Apple Inc. USER-SPECIFIC Acoustic Models
CN107609960A (en) * 2017-10-18 2018-01-19 口碑(上海)信息技术有限公司 Rationale for the recommendation generation method and device
CN108153733B (en) * 2017-12-26 2021-07-09 北京星选科技有限公司 Comment quality classification method and device
US10713441B2 (en) * 2018-03-23 2020-07-14 Servicenow, Inc. Hybrid learning system for natural language intent extraction from a dialog utterance
US10928918B2 (en) 2018-05-07 2021-02-23 Apple Inc. Raise to speak
KR102460293B1 (en) * 2018-05-18 2022-10-31 이베이 인크. Handling of transaction feedback
CN110738046B (en) * 2018-07-03 2023-06-06 百度在线网络技术(北京)有限公司 Viewpoint extraction method and apparatus
US11164065B2 (en) * 2018-08-24 2021-11-02 Bright Marbles, Inc. Ideation virtual assistant tools
CN109408808B (en) * 2018-09-12 2023-08-22 中国传媒大学 Evaluation method and evaluation system for literature works
US11163958B2 (en) * 2018-09-25 2021-11-02 International Business Machines Corporation Detecting and highlighting insightful comments in a thread of content
US11462215B2 (en) 2018-09-28 2022-10-04 Apple Inc. Multi-modal inputs for voice commands
US10885019B2 (en) * 2018-10-17 2021-01-05 International Business Machines Corporation Inter-reviewer conflict resolution
US11138367B2 (en) 2019-02-11 2021-10-05 International Business Machines Corporation Dynamic interaction behavior commentary
US20220245548A1 (en) * 2019-02-23 2022-08-04 Maher G. MEZHER Innovation process
US20200380389A1 (en) * 2019-05-31 2020-12-03 Apple Inc. Sentiment and intent analysis for customizing suggestions using user-specific information
US11227599B2 (en) 2019-06-01 2022-01-18 Apple Inc. Methods and user interfaces for voice-based control of electronic devices
US11487945B2 (en) 2019-07-02 2022-11-01 Servicenow, Inc. Predictive similarity scoring subsystem in a natural language understanding (NLU) framework
US11481554B2 (en) 2019-11-08 2022-10-25 Oracle International Corporation Systems and methods for training and evaluating machine learning models using generalized vocabulary tokens for document processing
CN110929528B (en) * 2019-11-21 2023-09-05 腾讯科技(深圳)有限公司 Method, device, server and storage medium for analyzing emotion of sentence
US11494559B2 (en) * 2019-11-27 2022-11-08 Oracle International Corporation Hybrid in-domain and out-of-domain document processing for non-vocabulary tokens of electronic documents
US11507747B2 (en) * 2019-11-27 2022-11-22 Oracle International Corporation Hybrid in-domain and out-of-domain document processing for non-vocabulary tokens of electronic documents
US11061543B1 (en) 2020-05-11 2021-07-13 Apple Inc. Providing relevant data items based on context
US11490204B2 (en) 2020-07-20 2022-11-01 Apple Inc. Multi-device audio adjustment coordination
US11438683B2 (en) 2020-07-21 2022-09-06 Apple Inc. User identification using headphones
CN111949757A (en) * 2020-07-29 2020-11-17 深圳市高德信通信股份有限公司 IDC user information arrangement management system
US11556223B2 (en) * 2020-08-27 2023-01-17 Ebay Inc. Automatic feedback system using visual interactions
US20220107973A1 (en) * 2020-10-07 2022-04-07 DropCite Inc. Collaborative annotation and artificial intelligence for discussion, evaluation, and recommendation of research papers
US11789944B2 (en) 2021-04-12 2023-10-17 Microsoft Technology Licensing, Llc User-specific computer interaction recall
CN113127604B (en) * 2021-05-10 2022-10-04 齐鲁工业大学 Comment text-based fine-grained item recommendation method and system

Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20020095355A1 (en) * 2001-01-18 2002-07-18 Doreen A. Labit Computer-implemented international trade system
US20110093539A1 (en) * 2009-07-30 2011-04-21 Brainbank, Inc. System And Method For Innovation And Idea Management
US20110202334A1 (en) * 2001-03-16 2011-08-18 Meaningful Machines, LLC Knowledge System Method and Apparatus

Family Cites Families (41)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US7533034B2 (en) 1999-07-20 2009-05-12 Brainbank, Inc. Idea management
US6961756B1 (en) 2000-08-16 2005-11-01 Charles Schwab & Co., Inc. Innovation management network
US20030036947A1 (en) 2001-08-20 2003-02-20 Ncr Corporation Systems and methods for submission, development and evaluation of ideas in an organization
US7058567B2 (en) 2001-10-10 2006-06-06 Xerox Corporation Natural language parser
US7369985B2 (en) 2003-02-11 2008-05-06 Fuji Xerox Co., Ltd. System and method for dynamically determining the attitude of an author of a natural language document
US7536310B2 (en) 2003-07-31 2009-05-19 Siemens Aktiengesellschaft Method for managing and providing an idea management system
US7809551B2 (en) * 2005-07-01 2010-10-05 Xerox Corporation Concept matching system
US7703071B2 (en) * 2006-04-13 2010-04-20 International Business Machines Corporation Method for modeling business transformation
US9015194B2 (en) * 2007-07-02 2015-04-21 Verint Systems Inc. Root cause analysis using interactive data categorization
US20090119095A1 (en) * 2007-11-05 2009-05-07 Enhanced Medical Decisions. Inc. Machine Learning Systems and Methods for Improved Natural Language Processing
US8086557B2 (en) 2008-04-22 2011-12-27 Xerox Corporation Method and system for retrieving statements of information sources and associating a factuality assessment to the statements
US8301563B2 (en) * 2008-05-15 2012-10-30 Wells Fargo Bank, N.A. Emerging trends lifecycle management
US8180629B2 (en) * 2008-07-10 2012-05-15 Trigent Softward Ltd. Automatic pattern generation in natural language processing
US8370128B2 (en) * 2008-09-30 2013-02-05 Xerox Corporation Semantically-driven extraction of relations between named entities
US20100153318A1 (en) * 2008-11-19 2010-06-17 Massachusetts Institute Of Technology Methods and systems for automatically summarizing semantic properties from documents with freeform textual annotations
US8572084B2 (en) * 2009-07-28 2013-10-29 Fti Consulting, Inc. System and method for displaying relationships between electronically stored information to provide classification suggestions via nearest neighbor
US8707250B2 (en) * 2009-12-22 2014-04-22 Board Of Regents, The University Of Texas System Automation support for domain modeling
US8745091B2 (en) * 2010-05-18 2014-06-03 Integro, Inc. Electronic document classification
US8656286B2 (en) 2010-11-05 2014-02-18 Palo Alto Research Center Incorporated System and method for providing mixed-initiative curation of information within a shared repository
US8380743B2 (en) 2010-11-05 2013-02-19 Palo Alto Research Center Incorporated System and method for supporting targeted sharing and early curation of information
US20120129145A1 (en) 2010-11-23 2012-05-24 William Cox Miller System for fostering innovation among a group of users
US8532981B2 (en) 2011-03-21 2013-09-10 Xerox Corporation Corpus-based system and method for acquiring polar adjectives
US9176957B2 (en) * 2011-06-10 2015-11-03 Linkedin Corporation Selective fact checking method and system
US8935664B2 (en) * 2011-10-05 2015-01-13 International Business Machines Corporation Method and apparatus to determine rules implementation decision
US8738363B2 (en) * 2011-10-13 2014-05-27 Xerox Corporation System and method for suggestion mining
US20130132284A1 (en) 2011-11-18 2013-05-23 Palo Alto Research Center Incorporated System And Method For Management And Deliberation Of Idea Groups
US20130173254A1 (en) * 2011-12-31 2013-07-04 Farrokh Alemi Sentiment Analyzer
US20130198750A1 (en) * 2012-01-30 2013-08-01 Xerox Corporation Wizard-based system for business process specification
US20130218914A1 (en) * 2012-02-20 2013-08-22 Xerox Corporation System and method for providing recommendations based on information extracted from reviewers' comments
US9275341B2 (en) * 2012-02-29 2016-03-01 New Sapience, Inc. Method and system for machine comprehension
US9135602B2 (en) * 2012-07-25 2015-09-15 E-Plan, Inc. Management of building plan documents utilizing comments and a correction list
US9396179B2 (en) * 2012-08-30 2016-07-19 Xerox Corporation Methods and systems for acquiring user related information using natural language processing techniques
US20140067370A1 (en) * 2012-08-31 2014-03-06 Xerox Corporation Learning opinion-related patterns for contextual and domain-dependent opinion detection
US20140108307A1 (en) * 2012-10-12 2014-04-17 Wipro Limited Methods and systems for providing personalized and context-aware suggestions
US20140278985A1 (en) * 2013-03-13 2014-09-18 DataPop, Inc. Systems and methods for the enhancement of semantic models utilizing unstructured data
US9477755B2 (en) * 2013-04-09 2016-10-25 International Business Machines Corporation Question-related identification of relevant social communities
US20140330832A1 (en) * 2013-05-05 2014-11-06 My Ideal World, Inc. Universal Idea Capture and Value Creation Apparatuses, Methods and Systems
EP3005283A4 (en) * 2013-05-31 2017-02-15 Intel Corporation Online social persona management
US20140365206A1 (en) * 2013-06-06 2014-12-11 Xerox Corporation Method and system for idea spotting in idea-generating social media platforms
US20150010141A1 (en) * 2013-07-03 2015-01-08 Gurbirinder Singh SANDHU System and method for generating natural language comments in a telecommunication system
US9069736B2 (en) * 2013-07-09 2015-06-30 Xerox Corporation Error prediction with partial feedback

Patent Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20020095355A1 (en) * 2001-01-18 2002-07-18 Doreen A. Labit Computer-implemented international trade system
US20110202334A1 (en) * 2001-03-16 2011-08-18 Meaningful Machines, LLC Knowledge System Method and Apparatus
US20110093539A1 (en) * 2009-07-30 2011-04-21 Brainbank, Inc. System And Method For Innovation And Idea Management

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
Jurafsky et al "Speech and Language Processing", Pearson/Prentic Hall 2008, ISBN-13: 978-0-13-187321-6, *

Cited By (16)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US9400779B2 (en) * 2013-06-06 2016-07-26 Xerox Corporation Method and system for classifying reviewers' comments and recommending related actions in idea-generating social media platforms
US20140365207A1 (en) * 2013-06-06 2014-12-11 Xerox Corporation Method and system for classifying reviewers' comments and recommending related actions in idea-generating social media platforms
US10789537B2 (en) 2014-05-20 2020-09-29 Oath Inc. Machine learning and validation of account names, addresses, and/or identifiers
US20150339583A1 (en) * 2014-05-20 2015-11-26 Aol Inc. Machine learning and validation of account names, addresses, and/or identifiers
US9928465B2 (en) * 2014-05-20 2018-03-27 Oath Inc. Machine learning and validation of account names, addresses, and/or identifiers
US11704583B2 (en) 2014-05-20 2023-07-18 Yahoo Assets Llc Machine learning and validation of account names, addresses, and/or identifiers
US20160308816A1 (en) * 2015-04-20 2016-10-20 International Business Machines Corporation Reducing a negative social interaction
US10187346B2 (en) * 2015-04-20 2019-01-22 International Business Machines Corporation Analysis of social interaction sentiment
US10757062B2 (en) 2015-04-20 2020-08-25 International Business Machines Corporation Analysis of social interaction sentiment
US20170109848A1 (en) * 2015-10-20 2017-04-20 International Business Machines Corporation Value Scorer in an Automated Disclosure Assessment System
US10832360B2 (en) * 2015-10-20 2020-11-10 International Business Machines Corporation Value scorer in an automated disclosure assessment system
US10909324B2 (en) * 2018-09-07 2021-02-02 The Florida International University Board Of Trustees Features for classification of stories
US20200081980A1 (en) * 2018-09-07 2020-03-12 The Florida International University Board Of Trustees Features for classification of stories
WO2021029886A1 (en) * 2019-08-14 2021-02-18 Unify Patente Gmbh & Co. Kg Real-time communication and collaboration system and method of monitoring objectives
US20220277733A1 (en) * 2019-08-14 2022-09-01 Unify Patente Gmbh & Co. Kg Real-time communication and collaboration system and method of monitoring objectives to be achieved by a plurality of users collaborating on a real-time communication and collaboration platform
CN113408291A (en) * 2021-07-09 2021-09-17 平安国际智慧城市科技股份有限公司 Training method, device and equipment for Chinese entity recognition model and storage medium

Also Published As

Publication number Publication date
US20140365207A1 (en) 2014-12-11
US9400779B2 (en) 2016-07-26

Similar Documents

Publication Publication Date Title
US9400779B2 (en) Method and system for classifying reviewers' comments and recommending related actions in idea-generating social media platforms
Dashtipour et al. A hybrid Persian sentiment analysis framework: Integrating dependency grammar based rules and deep neural networks
Alaei et al. Sentiment analysis in tourism: capitalizing on big data
Villarroel Ordenes et al. Unveiling what is written in the stars: Analyzing explicit, implicit, and discourse patterns of sentiment in social media
Banks et al. A review of best practice recommendations for text analysis in R (and a user-friendly app)
US9633007B1 (en) Loose term-centric representation for term classification in aspect-based sentiment analysis
Aggarwal Opinion mining and sentiment analysis
US8650023B2 (en) Customer review authoring assistant
Fernandes et al. Bridging the gap: A survey on integrating (human) feedback for natural language generation
US20130218914A1 (en) System and method for providing recommendations based on information extracted from reviewers' comments
US20140067370A1 (en) Learning opinion-related patterns for contextual and domain-dependent opinion detection
Karanikolas et al. Large language models versus natural language understanding and generation
Luangrath et al. Paralanguage classifier (PARA): An algorithm for automatic coding of paralinguistic nonverbal parts of speech in text
Gharibi et al. Influential factors in lexical richness of young heritage speakers’ family language: Iranians in New Zealand
Gao et al. CoAIcoder: Examining the effectiveness of AI-assisted human-to-human collaboration in qualitative analysis
Hussein Analyzing scientific papers based on sentiment analysis
Kim et al. Accurate and prompt answering framework based on customer reviews and question-answer pairs
Sarzynska-Wawer et al. Truth or lie: Exploring the language of deception
Neviarouskaya et al. Intelligent interface for textual attitude analysis
Liao et al. Status, identity, and language: A study of issue discussions in GitHub
Hailu Opinion mining from Amharic blog
Ferrari et al. QuOD: an NLP tool to improve the quality of business process descriptions
Kauttonen et al. Predictive modeling for trustworthiness and other subjective text properties in online nutrition and health communication
Tessore et al. Distant Supervised Construction and Evaluation of a Novel Dataset of Emotion-Tagged Social Media Comments in Spanish
Akerkar et al. Natural language processing

Legal Events

Date Code Title Description
AS Assignment

Owner name: XEROX CORPORATION, CONNECTICUT

Free format text: ASSIGNMENT OF ASSIGNORS INTEREST;ASSIGNORS:CONVERTINO, GREGORIO;SANDOR, AGNES;REEL/FRAME:030560/0112

Effective date: 20130604

STCB Information on status: application discontinuation

Free format text: ABANDONED -- FAILURE TO RESPOND TO AN OFFICE ACTION