CA2715777A1 - Method and system to generate mapping among a question and content with relevant answer - Google Patents

Method and system to generate mapping among a question and content with relevant answer Download PDF

Info

Publication number
CA2715777A1
CA2715777A1 CA2715777A CA2715777A CA2715777A1 CA 2715777 A1 CA2715777 A1 CA 2715777A1 CA 2715777 A CA2715777 A CA 2715777A CA 2715777 A CA2715777 A CA 2715777A CA 2715777 A1 CA2715777 A1 CA 2715777A1
Authority
CA
Canada
Prior art keywords
content
question
standard question
mapping
standard
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CA2715777A
Other languages
French (fr)
Other versions
CA2715777C (en
Inventor
Mirza Muhammad Ajmal Beg
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Individual
Original Assignee
Individual
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Individual filed Critical Individual
Priority to CA2715777A priority Critical patent/CA2715777C/en
Publication of CA2715777A1 publication Critical patent/CA2715777A1/en
Application granted granted Critical
Publication of CA2715777C publication Critical patent/CA2715777C/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/30Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
    • G06F16/36Creation of semantic tools, e.g. ontology or thesauri

Abstract

A method and system for generating mapping among a standard question and a content containing an answer is provided. The method comprising the steps of: receiving content from an author of the content, analyzing the received content, producing suggestions for standard questions and transmitting the suggested standard questions to the author of the content producer for approval, determining a number of approved standard questions for which the said received content provides answer for generating mapping between the said at least one approved standard question and the said content if the number of approved standard questions is more than one, or initiating an approval process for generating a new standard question related to the content if the number of approved questions is zero.

Description

METHOD AND SYSTEM TO GENERATE MAPPING AMONG A QUESTION AND
CONTENT WITH RELEVANT ANSWER

TECHNICAL FIELD
This invention relates to a method and system to generate mapping among a question and content with relevant answer, and a machine readable medium containing instructions to perform such method.

BACKGROUND
Search system typically matches a user's question to contents available on the network that may contain answer to the user's question. A search system contain a collection of standard questions and a mapping from standard questions to contents with relevant answers. Such search system allows user select a standard question by suggesting standard question(s) based on words forming the submitted question. Mapping from standard questions to contents with relevant answers is produced by converting a standard question into relevant keywords, using these relevant keywords to search relevant contents available on the network, ranking the found relevant contents on the network and selecting the top ranked contents as contents with answers.
Containing the same relevant keywords in content on the network and in the submitted question, does not mean that the content on the network always satisfies the need of a user. For instance, a user selects a standard question "How to make a birthday cake" to a search system. The search system returns a reference to the content with title "How to make a birthday cake" as the most relevant page. However, the content that this reference refers to, is a detailed introduction of 10 books about baking cakes. After going through significant number of contents, the user discovers a content which details recipes of delicious birthday cakes by a not well-known content producer. The content producer wanted to share his cake recipes for free for user connected to the network. This content did not contain the phrase "How to make a birthday cake" and word "recipes" in the page title or in the content itself.
The search system did not rank this content as the top, although it contained the most relevant answer to the user's question. Thus, there is a need for a system and method which generate a mapping between a standard question and content containing the answers to standard question in an effective way.

SUMMARY OF THE INVENTION
A method and system for generating mapping among a standard question and content includes, receiving content, analyzing the received content, determining at least one standard question for which the said received content provides answer for, transferring the said at least one standard question to a client for approval, receiving approval from the said client that the said received content answers the said at least one standard question, generating mapping between the said at least one standard question and a reference to the said content.
Further, a method and system for approving a proposal of a new standard question related to a content, includes; receiving content, analyzing the received content, determining that the said received content does not provide answer for any of the already defined standard questions, generating a proposal for a new standard question and transferring it to a client, receiving acceptance data from the client, receiving any new proposal of the standard question from the client, storing the generated proposal for a new standard question and any new proposal of the new standard question received from the client for approval.
BRIEF DESCRIPTION OF THE DRAWINGS
The accompanying drawings together with the description, serve to explain the principles of the invention.
FIG. 1 is a diagram conceptually illustrating an exemplary search system which prepares a document with answers/search results responsive to a question/search query.
FIG. 2 is a exemplary flow chart illustrating operation of an exemplary search system which prepares a document with references to answers to user submitted questions using mapping.
FIG. 3 is a exemplary flow chart illustrating operation of an exemplary mapping system 500 which maps references to contents containing answers and a question according to present invention.
FIG. 4 illustrates a block diagram of an exemplary computer system on which the computer instructions can be executed to implement the methods of the present invention.
FIG. 5 illustrates an exemplary client-server environments which shows mapping server according to present invention.
FIG. 6 illustrates an exemplary data model for standard questions stored in mapping database/OS level files according to the present invention.
FIG. 7 illustrates an exemplary mapping management module according to the present invention.
FIG. 8 illustrates an exemplary code which is included in the header of the content by using metadata tags.

DETAILED DESCRIPTION OF THE PREFERRED EMBODIMENT
FIG. 1 is a diagram conceptually illustrating an exemplary search system 100 which
2 prepares a document with answers/search results responsive to a question/search query. The search system 100 uses database/OS level files 101 for storing and retrieving information. The search system 100 consists of multiple modules 102 103 104 105 106. Question query parsing module 102 parses the question/search query received from the client and transforms it into a search query which can be executed to retrieve information from database/OS level files 101. Query execution module 103 executes the search query and manages the result of the executed search query.
Indexing module 104 indexes contents available on the network. Ranking module 105 ranks the indexed contents. The ranking module 105 may use different algorithms to rank the contents.
Presentation module 106 prepares a document containing the search result and the advertisement FIG. 2 is a flow chart illustrating operation of an exemplary search system which prepares a document with references to contents containing answers responsive to a question. The operation starts when the search system receives a question/search query from the client (act 201). The search system parses the question/search query into keyword(s)/phrase(s) (act 202).
The search system searches the database/OS level files 101 for already defined standard questions (act 203). In case, an already defined standard question exists (act 204), the search system suggests the standard question for selection to the client (act 205). In case, the suggested standard question is selected at the client, the search system finds references to contents containing standard answers using an index based mapping from standard questions to contents with relevant answers mapping (act 206 act 207). In case, the end user does not select the suggested standard question, the search system finds relevant references to the contents using relevant keywords and already defined indexes (act 206 act 208).
The search system ranks the references to the contents (act 209). The search system prepares a document containing the ranked references to the contents (act 210) and sends the prepared document to the client (act 211).
FIG. 3 is a exemplary flow chart illustrating operation of an exemplary mapping system 500 which maps references to contents containing answers and a question according to present invention. FIG. 5 illustrates an exemplary mapping system 500 in details. The mapping system 500 receives a reference such as a URL for content or it receives content itself from the client (act 301).
The mapping system 500 analyzes the content (act 302) and produces list of keyword(s)/phrase(s) relevant to the content (act 303). Based on the list of keyword(s)/phrase(s), the mapping system 500 produces standard question suggestion(s) (act 304). The mapping system 500 sends standard question suggestion(s) to the client (act 305). In case, the mapping system 500 receives an approval for standard question suggestion(s), the mapping system 500 generates mapping between standard question suggestion(s) and the reference to the content (act 306 and act 307).
This generated mapping is transferred to the search system 100 506 and stored in the database/OS level files of the
3 search system 101 (act 307). The mapping system 500 also generate code that can be added to the content and is used to identify the standard answer the content provides (act 308). In case, the mapping system 500 generates proposal for a new standard question which is not already a standard question and sends it to the client for approval (act 306 and act 309). In case, the mapping system 500 receives an approval for proposed new question, the mapping system 500 stores the proposal for new question for approval process (act 310 and act 311). In case, the mapping system 500 does not receive an approval for proposed new question, the mapping system 500 receives a proposal for new question from the client (act 312). The proposal for new question from the client is stored in the database/OS level files of the mapping system 500 for approval process (act 311). After approval process is complete, the newly approved question is available as question suggestion at act 304.
FIG. 4 shows a block diagram of an example computer system 400 on which the computer instructions can be executed to implement the methods of the present invention. Computing devices such as laptop, desktop, server, cluster, virtual machine, mainframe, wireless telephone, wired telephone, a personal digital assistance are examples of such computer system 400. Processor 401 can be microprocessor, digital signal processor, a conventional processor, micro-controller, virtual machine or any logic that executes computer instructions. Memory 402 can be any kind of memory device such as read only memory (ROM), random access memory (RAM) and flash memory.
Storage device 403 can be any medium which can be used as persistence storage.
For example, hard drive, tape drive, optical disk drive and disk arrays. Input device 404 is used to input external data and can be any kind of device such as mouse, trackball, light pen, biometric mechanism including voice recognition. Output device 405 can be any kind of device used for data output. For example, cathode-ray-tube (CRT) monitor, plasma display, crystal display, projector, printer and speaker.
Communication interface 406 can be a interface to any kind of network such as internet, intranet, local area network, wide area network, a telephone network such as Public Switched Telephone Network, or combination of different kinds of networks. Bus 407 is used to communicate information and commands between different sub systems 401 402 403 404 405 406. Bus 407 may be implemented using any kind of data transfer technology such as USB, serial connection, parallel connection and circuit board connections. In some embodiments, the storage device 403 may connect to the example computer system 400 using communication interface 406.
FIG. 5 illustrates an exemplary client-server environment which maps a standard question to contents containing answers according to present invention. The standard questions are managed by mapping system 500 which stores standard questions in the mapping database/OS
level files 501.
The mapping system 500 contains mapping management module 502 and mapping query
4 processing module 503 to process information stored in the mapping database/OS
level files 501 .
The mapping management module 502 manages the data in the mapping database/OS
level files 501. The mapping query processing module 503 processes the queries. The mapping management module 502 and the mapping query processing module 503 implements the flow chart in FIG. 3.
The mapping management client 504 connects to the mapping system 500 through network 505.
The mapping management client 504 manages the mapping database/OS level files 501 and is mainly used by the mapping system 500 system administrators. Client/server of content provider 507, client/server of content creator 508, client/server of other users 509 and search systems 506 connect with the mapping system 500 through network 505 and exchanges information.
FIG. 6 illustrates an exemplary data model 600 for standard questions stored in mapping database/OS level files 501 according to the present invention. Question IDs 601 is a collection of unique question IDs and their metadata. Question full expressions 602 is a collection of different questions looking for the same information. At least one question full expression in the Question full expressions 602 is associated with a question ID in Question IDs 601. For instance: expressions "What is autism?", and "Tell me something about autism!" and "Do you know anything about autism?" are three question full expressions associated with the same question ID in Question IDs 601. A question full expression stored in the Question full expressions 602 is a collection of question expression parts stored in Question expression parts 603. For instance; the a question full expression "Tell me something about autism!" is a collection of two question expression parts: "Tell me something about" and "autism". The question expression parts importance 604 stores the importance of question expression parts stored in Question expression parts 603. For example:
question expression part "autism" has higher importance than question expression part "Tell me something about". Relevant domains 605 contains the name of the domain of a question id stored in Question IDs 601 or a question full expression stored in Question full expressions 602. For example: the question full expression "Tell me something about autism!"
belongs to relevant domain "medical". A relevant domain stored in Relevant domain 605 can have hierarchical structure. For example: relevant domain 605 "medical: mental health" refers to a parent-child relationship between relevant domain "medical" and relevant domain "mental health". Sets of keywords/phrases 606 are keywords/phrases that are associated with a question ID stored in Question IDs 601 or a question full expression stored in Question full expression 602. For example: "definition, autism" is a set of keywords that is associated with a question full expression "Tell me something about autism!". Keyword/phrase importance 607 stores the importance information for a set of keywords/phrases stored in Set of keywords/phrases 606. For example: for a set of keywords "definition, autism" both keywords "definition" and "autism"
have high
5 importance. Matching rules 608 contains rules that can be used to match and rank standard questions in the mapping database/OS level files 501 with a question received through the client.
The user of the mapping system 500 searches the data model 600 for standard questions using a graphical user interface.
FIG. 7 illustrates an exemplary mapping management module 502 700 according to the present invention. Access control logic 701 defines and control the access to data in the mapping database/OS level files 501. Change approval logic 702 approves the review and approval process of proposal for new questions. Modification logic 703 controls the modification to data in the mapping database/OS level files 501. Life cycle management logic 704 controls the life cycle of the standard questions in the mapping database/OS level files 501. Trend analysis logic 705 analyzes the usage of the already defined standard questions and newly created standard questions in the mapping database/OS level files 501. Payment logic 706 manages the cost of using the mapping system 500. Interface management logic 707 manages the interface with the external clients 504 506 507 508.
FIG. 8 illustrates an exemplary code 800 which is included in the header of the content by using metadata tag. The mapping system 500 may include this exemplary code 800 in the content.
Metadata SEARCH ENGINE NAME -1 801 indicates the search systems for which the metadata is intended for. content="* -C" means that the metadata is intended for all search system except the search system C. Metadata SEARCH ENGINE QUESTION ID_1 802 indicates this contents provide answers for questions ABCD123456 and EFGH789012. The answer to standard question ABCD 123456 is located at anchor ANCHOR2, while answer to standard question EFGH789012 is located at ANCHOR2. "::" is used as a separate. _1 at the end of meta name "SEARCH ENGINE NAME 1" 801 and "SEARCH ENGINE QUESTION ID 1" 802 indicates that these metadata belongs to group 1 of metadata. The question IDs 601 such as ABCD 123456 and EFGH789012 are generated at act 308, while the content creator/provider has supplemented the the anchor information to the meta tag. In this example, meta tag is used to add standard question information to add to the content to allow a search system determine what is the question this content provides answer for. When multiple contents provide the answer to the same questions, the search system uses the ranking module to rank the contents. However, in other embodiments, standard question information may be associated to the content in different format. Such association may be located at the same local location as the content or located at another network location.
It is to be understood that while the detailed description describes the present invention, the foregoing description is for illustrative purpose and does not limit the scope of the present invention which is defined by the scope of the appended claims. Other embodiments, arrangements and
6 equivalents will be evident to those skilled in the art. Such other embodiments, arrangements and equivalents are within the scope of the present invention as defined by the appended claims.
7

Claims (15)

What Is Claimed Is:
1. A method for generating mapping among a standard question and a content, comprising:
a. receiving content b. analyzing the received content c. determining at least one standard question for which the said received content provides answer for d. transferring the said at least one standard question to a client for approval e. receiving approval from the said client that the said received content answers the said at least one standard question f. generating mapping between the said at least one standard question and a reference to the said content
2. A method according to claim 1, characterized in that the said generated mapping between the said at least one standard question and a reference to the said content is transferred to at least one search system.
3. A method according to claim 2, characterized in that the said transferred mapping is used in preparing a document with search result by the said at least one search engine.
4. A method according to claim 1, characterized in that the said generated mapping between the said at least one standard question and the said content is stored on a network location not belonging to a search system.
5. A method according to claim 2, characterized in that the said stored generated mapping on a network location not belonging to a search system, is accessed and analyzed by a search system.
6. A machine-readable medium storing sequence of instructions which can be executed by a computing device and perform the method of claim 1.
7. A system for generating mapping among a standard question and a content, comprising:
a. first means for receiving content b. second means for analyzing the received content c. third means for determining at least one standard question for which the said received content provides answer for d. fourth means for transferring the said at least one standard question to a client for approval e. fifth means for receiving approval from the said client that the said received content answers the said at least one standard question f. sixth means for generating mapping between the said at least one standard question and a reference to the said content
8. A system according to claim 1, wherein in the said generated mapping between the said at least one standard question and a reference to the said content is transferred to at least one search system.
9. A system according to claim 2, wherein the said transferred mapping is used in preparing a document with search result by the said at least one search engine.
10. A system according to claim 1, wherein the said generated mapping between the said at least one standard question and the said content is stored on a network location not belonging to a search system.
11. A system according to claim 2, wherein the said stored generated mapping on a network location not belonging to a search system, is accessed and analyzed by a search system.
12. A method for approving a proposal of a new standard question related to a content, comprising:
a. receiving content b. analyzing the received content c. determining that the said received content does not provide answer for any of the already defined standard questions d. generating a proposal for a new standard question and transferring it to a client e. receiving acceptance data from the client f. receiving any new proposal of the standard question from the client g. storing the generated proposal for a new standard question and any new proposal of the new standard question received from the client for approval.

13. A method according to claim 12, characterized in that the said proposed new standard question is after approval added to the standard questions which are used to generate mapping between a content and standard question.
13. A machine-readable medium storing sequence of instructions which can be executed by a computing device and perform the method of claim 12
14. A system for approving a proposal of a new standard question related to a content, comprising:
a. first means for receiving content b. second means for analyzing the received content c. third means for determining that the said received content does not provide answer for any of the already defined standard questions d. fourth means for generating a proposal for a new standard question and transferring it to a client e. fifth means for receiving acceptance data from the client f. sixth means for receiving any new proposal of the standard question from the client g. seventh storing the generated proposal for a new standard question and any new proposal of the new standard question received from the client for approval
15. A system according to claim 14, wherein the said proposed new standard question is after approval added to the standard questions which are used to generate mapping between a content and standard question.
CA2715777A 2010-09-27 2010-09-27 Method and system to generate mapping among a question and content with relevant answer Active CA2715777C (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CA2715777A CA2715777C (en) 2010-09-27 2010-09-27 Method and system to generate mapping among a question and content with relevant answer

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CA2715777A CA2715777C (en) 2010-09-27 2010-09-27 Method and system to generate mapping among a question and content with relevant answer

Publications (2)

Publication Number Publication Date
CA2715777A1 true CA2715777A1 (en) 2012-03-27
CA2715777C CA2715777C (en) 2023-08-08

Family

ID=45893820

Family Applications (1)

Application Number Title Priority Date Filing Date
CA2715777A Active CA2715777C (en) 2010-09-27 2010-09-27 Method and system to generate mapping among a question and content with relevant answer

Country Status (1)

Country Link
CA (1) CA2715777C (en)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US10861022B2 (en) 2019-03-25 2020-12-08 Fmr Llc Computer systems and methods to discover questions and answers from conversations

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US10861022B2 (en) 2019-03-25 2020-12-08 Fmr Llc Computer systems and methods to discover questions and answers from conversations

Also Published As

Publication number Publication date
CA2715777C (en) 2023-08-08

Similar Documents

Publication Publication Date Title
US20230205828A1 (en) Related entities
US9053115B1 (en) Query image search
JP5281405B2 (en) Selecting high-quality reviews for display
US8180754B1 (en) Semantic neural network for aggregating query searches
US9684713B2 (en) Methods and systems for retrieval of experts based on user customizable search and ranking parameters
JP6014725B2 (en) Retrieval and information providing method and system for single / multi-sentence natural language queries
US20070043761A1 (en) Semantic discovery engine
US20120059838A1 (en) Providing entity-specific content in response to a search query
RU2718435C2 (en) Computer-executable method and system for searching in inverted index having plurality of wordpositions lists
AU2009257386A1 (en) Searching using patterns of usage
TW200842624A (en) Federated search implemented across multiple search engines
US20120016863A1 (en) Enriching metadata of categorized documents for search
US9916384B2 (en) Related entities
Dong et al. A survey in semantic search technologies
EP2192503A1 (en) Optimised tag based searching
CN109952571B (en) Context-based image search results
Wolfram The symbiotic relationship between information retrieval and informetrics
US11481454B2 (en) Search engine results for low-frequency queries
US9773035B1 (en) System and method for an annotation search index
CA2715777C (en) Method and system to generate mapping among a question and content with relevant answer
US20230237103A1 (en) Self-improving system for searching cross-lingual and multi-media data
CN111488506B (en) Method, device, equipment and storage medium for processing resource information
van Zwol et al. Multimedia information retrieval: " new challenges in audio visual search"
Natarajan Query Expansion For Handling Exploratory And Ambiguous Keyword Queries
WO2001065412A2 (en) Automatically determining a response to an inquiry using structured information

Legal Events

Date Code Title Description
EEER Examination request

Effective date: 20150908

EEER Examination request

Effective date: 20150908

EEER Examination request

Effective date: 20150908

EEER Examination request

Effective date: 20150908

EEER Examination request

Effective date: 20150908

EEER Examination request

Effective date: 20150908

EEER Examination request

Effective date: 20150908

EEER Examination request

Effective date: 20150908