CN112395399A - Specific personality dialogue robot training method based on artificial intelligence - Google Patents

Specific personality dialogue robot training method based on artificial intelligence Download PDF

Info

Publication number
CN112395399A
CN112395399A CN202011270983.2A CN202011270983A CN112395399A CN 112395399 A CN112395399 A CN 112395399A CN 202011270983 A CN202011270983 A CN 202011270983A CN 112395399 A CN112395399 A CN 112395399A
Authority
CN
China
Prior art keywords
personality
virtual robot
training method
language
specific
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN202011270983.2A
Other languages
Chinese (zh)
Inventor
杨帅
卫静雯
田而慷
殷莉
王双文
蒋泓杰
周嘉茗
王玥
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Sichuan University
Original Assignee
Sichuan University
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Sichuan University filed Critical Sichuan University
Priority to CN202011270983.2A priority Critical patent/CN112395399A/en
Publication of CN112395399A publication Critical patent/CN112395399A/en
Pending legal-status Critical Current

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/30Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
    • G06F16/33Querying
    • G06F16/332Query formulation
    • G06F16/3329Natural language query formulation or dialogue systems
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/20Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
    • G06F16/24Querying
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F40/00Handling natural language data
    • G06F40/20Natural language analysis
    • G06F40/205Parsing
    • G06F40/216Parsing using statistical methods
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F40/00Handling natural language data
    • G06F40/20Natural language analysis
    • G06F40/279Recognition of textual entities
    • G06F40/289Phrasal analysis, e.g. finite state techniques or chunking
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F40/00Handling natural language data
    • G06F40/20Natural language analysis
    • G06F40/279Recognition of textual entities
    • G06F40/289Phrasal analysis, e.g. finite state techniques or chunking
    • G06F40/295Named entity recognition

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • General Physics & Mathematics (AREA)
  • Computational Linguistics (AREA)
  • General Engineering & Computer Science (AREA)
  • Artificial Intelligence (AREA)
  • General Health & Medical Sciences (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Health & Medical Sciences (AREA)
  • Mathematical Physics (AREA)
  • Data Mining & Analysis (AREA)
  • Databases & Information Systems (AREA)
  • Human Computer Interaction (AREA)
  • Probability & Statistics with Applications (AREA)
  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)

Abstract

The invention belongs to the field of artificial intelligence, and relates to a training robot system and a training robot method with specific personality and conversation functions. The virtual robot corresponds to 5 factors in the five-big personality factor model: extroversion, concordance, conscientity, nervousness, patency. Based on the psychological sample example, the language sample of each personality mental patient is processed by intelligent natural language, and a language prediction model with a specific personality is established by using a deep learning technology. And then extracting a candidate answer group from a retrieval document set obtained by retrieval according to the user question by adopting an IF-IDF (intermediate frequency-inverse discrete frequency) calculation method, selecting standard answers from the candidate answer group by adopting a correlation similarity calculation method, and feeding the standard answers back to the user, thereby providing a direct and interactive virtual robot with a specific personality for the teaching and simulated training of psychology.

Description

Specific personality dialogue robot training method based on artificial intelligence
Technical Field
The invention belongs to the field of artificial intelligence, and relates to a training robot system and a training robot method with specific personality and conversation functions.
Background
Psychiatry, an important subject in clinical medicine, relates to wide scope, strong specialization and has its own particularity. The diagnosis of psychotic disorders follows mainly the thinking approach of "symptom-syndrome-diagnosis", relying on the identification and comprehensive judgment of psychotic symptoms, without corresponding signs or laboratory evidence, and therefore requiring that the psychiatrist must have a solid theoretical basis and clinical skills, as well as a rich practical experience. Among them, the cultivation of mental skills is important. The mental examination is to sample and evaluate information of mental disorder individuals by a psychiatrist through language communication and observation, and is divided into free conversation and inquiry, wherein the communication capacity is the key. However, domestic common medical students and some psychiatrists rarely pay attention to training of communication skills, and meanwhile, the aspects of privacy, social influence, personal safety and the like of patients are considered, so that some difficulties exist in practice teaching, and the opportunity of direct communication with the patients is low. In addition, the intelligent social robot in China starts late, although some intelligent social robots have voice and face recognition capabilities and simple emotional modalities, most of the current conversation functions are limited to a certain topic field, and the intelligent social robot does not have anthropomorphic personality under a specific scene, and has little effect on clinical teaching and training.
The "big five" personality factor model, namely the basic structure of the personality, is composed of the five big factors, and is called as "big five". In the last 10 years, the study of five-factor models has made dramatic progress, with stability verified in a large number of studies from both old specials questionnaires and other person evaluations, various samples of lexical studies and questionnaire measurements, as well as different cultural backgrounds and different analytical methods, which have been considered by many psychologists as the best paradigm of personality architecture. According to McCrae & Costa et al (1985), the five major factors that make up the personality are respectively, extroversion, concordance, conscientiveness, nervousness, patency. While extroversion represents enthusiasm, confidence, vitality, and also has the characteristics of happiness and sociality, these manifestations of the insiders are not prominent, but not equal to self-centering and lack of energy. The concordance indicates that he is friendly, lovely and lovely, people with high scores are willing to assist people, reliable and rich in the same mood, and pay attention to cooperation without emphasizing competition. Conscientiousness represents restriction and stringency, and is related to achievements motivation and organizational planning, also known as "achievements will" or "working" latitude. Those who feel worries frequently and whose emotions are easy to fluctuate get high scores in measurement of the nervousness, and the low scores of people mostly show calm self-adaptation and are not easy to have extreme and bad emotional reactions. Studies have shown that the "five-factor" model is valuable for diagnosing clinical disorders and treating psychological diseases, as well as being useful for predicting and identifying healthy behaviors and problems.
Disclosure of Invention
The invention aims to provide a more accurate and intelligent interactive virtual robot for psychological discipline teaching. The technical scheme of the invention is as follows:
an intelligent question-answering system with a specific personality, comprising:
the database establishing module is used for establishing a psychological domain database, and a large number of mental patient language example samples are stored in the psychological domain database;
the problem acquisition module is used for acquiring user problems;
the question processing module is used for carrying out natural language processing on the user question to obtain query expression about the user question;
the candidate answer extraction module is used for calculating question-answer pair similarity by adopting an IF-IDF calculation method based on the retrieval document set;
and the standard answer obtaining module extracts the most relevant answer according to the candidate answer, and the answer feedback module feeds the standard answer back to the user.
The question processing module comprises a word segmentation sub-module, a word segmentation sub-module and a question processing sub-module, wherein the word segmentation sub-module is used for segmenting a user question and obtaining a group of terms of the user question;
each term in the term set is subjected to named entity recognition, semantic disambiguation and synonym expansion to obtain a term set;
and judging that the user question is complete according to the intention analysis result, if so, combining the terms in the processing term set into a query expression, and if not, requesting the user to supplement the user question, and supplementing the user question after the user question is supplemented.
The standard answer obtaining module optionally comprises a candidate answer correlation calculating submodule for calculating the occurrence frequency of each noun phrase in the candidate answers in the medical field database, obtaining the correlation of each noun phrase in the candidate answer set, and calculating the occurrence frequency of the noun phrase in the query expression in the medical field database by using the query expression correlation calculating submodule to obtain the correlation expressed in the query;
calculating the similarity of the candidate answer relevance and the relevance represented in the query by using a calculation sub-module; candidate answers greater than the second set of similarity values are selected and combined into a standard answer to the user's question.
According to the specific embodiment of the invention, the invention discloses the following technical effects:
the invention discloses a virtual robot training method and a virtual robot training system, wherein based on a psychological sample example, an IF-IDF calculation method is adopted to extract a candidate answer group from a retrieval document set obtained by retrieval according to user problems, a correlation similarity calculation method is adopted to select standard answers from the candidate answer group, and the standard answers are fed back to a user, so that a direct and interactive virtual robot with a specific personality is provided for the teaching of psychology.
Drawings
FIG. 1: a component module of an intelligent question-answering system with a specific personality.
FIG. 2: a training method and a process for a virtual robot are provided.
Detailed Description
S1: a receiving module receives a problem sent by a client through an HTTP request, wherein the problem is described by characters;
s2: the preprocessing module removes special symbols, punctuation marks and stop words in the problem to obtain a processed problem q;
s3: the intention module classifies the question q through a question intention classification model based on BERT to obtain the category Cq of the input question;
s4: the retrieval module firstly refers to a domain dictionary to divide an input question q into w1 and w2 … wi … wn, wherein wi is the ith word which is divided, and a query formula of (w 1U w 2U … U … wn) n Cq is adopted to retrieve a corresponding question and a corresponding answer from a knowledge base (the knowledge base is used for storing a problem set and a corresponding answer set in the domain, and the problem set and the corresponding answer set in the domain are stored in the knowledge base in an index establishing mode), so that a candidate question set and a corresponding answer set are obtained;
s5: the sequencing module sequences the candidate question set Q and the corresponding answer set A by adopting a TF-IDF statistical method to obtain a set Score corresponding to Q and A, wherein a Score threshold Scoremin is set, if Scorein is greater than Scoremin, corresponding Qi and Ai are stored, and the rest question sets and the corresponding answer sets are RQ and RA; s6: the matching algorithm module takes out the problems in the remaining problem set RQ, establishes a matching pair (RQI, q) with the input problem q, inputs the matching pair into a problem similarity matching model based on BERT to obtain a matching probability pi, obtains the probability maximum value pmax of the problems in the RQ through statistics, sets a probability threshold pmin, determines that the RQI is the problem required by the symbol if pmax > pmin, otherwise, does not have the problems meeting the requirements in the remaining problem set; s7: and the Answer configuration module outputs RAi corresponding to RQI or a Special Answer which does not meet the requirement to the client.
S31: and collecting related problem sets Q and corresponding classification sets C in the field to obtain one-to-one corresponding Qi and Ci.

Claims (4)

1. A virtual robot training method and a system are characterized by comprising the following steps: generating a virtual robot that simulates a particular personality of mental patient, the virtual robot corresponding to 5 factors in a "five-large" personality factor model: extroversion, concordance, conscientity, nervousness, patency.
2. The virtual robot training method and system according to claim 1, further comprising: based on the example, the intelligent natural language processing is carried out on the language sample of each personality mental patient, a language prediction model with a specific personality is built by using a deep learning technology, and an intelligent dialogue system with the specific personality is built according to training.
3. The virtual robot training method and system according to claim 1, wherein quantitative clustering is performed on the personality traits of each psychiatric patient's sample.
4. The virtual robot training method and system according to claim 2, wherein: the probability of each answer is selected for each mental patient for each personality forecast, and the dialog of the maximum probability is output.
CN202011270983.2A 2020-11-13 2020-11-13 Specific personality dialogue robot training method based on artificial intelligence Pending CN112395399A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202011270983.2A CN112395399A (en) 2020-11-13 2020-11-13 Specific personality dialogue robot training method based on artificial intelligence

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202011270983.2A CN112395399A (en) 2020-11-13 2020-11-13 Specific personality dialogue robot training method based on artificial intelligence

Publications (1)

Publication Number Publication Date
CN112395399A true CN112395399A (en) 2021-02-23

Family

ID=74600308

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202011270983.2A Pending CN112395399A (en) 2020-11-13 2020-11-13 Specific personality dialogue robot training method based on artificial intelligence

Country Status (1)

Country Link
CN (1) CN112395399A (en)

Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN108804698A (en) * 2018-03-30 2018-11-13 深圳狗尾草智能科技有限公司 Man-machine interaction method, system, medium based on personage IP and equipment
CN109416701A (en) * 2016-04-26 2019-03-01 泰康机器人公司 The robot of a variety of interactive personalities
CN110019698A (en) * 2017-09-04 2019-07-16 珠海健康云科技有限公司 A kind of intelligent Service method and system of medicine question and answer
CN110188177A (en) * 2019-05-28 2019-08-30 北京搜狗科技发展有限公司 Talk with generation method and device
CN111339280A (en) * 2020-03-23 2020-06-26 上海奔影网络科技有限公司 Question and answer sentence processing method, device, equipment and storage medium
CN111723898A (en) * 2020-05-25 2020-09-29 成都时空穿梭智能科技有限公司 Intelligent robot for simulating human

Patent Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN109416701A (en) * 2016-04-26 2019-03-01 泰康机器人公司 The robot of a variety of interactive personalities
CN110019698A (en) * 2017-09-04 2019-07-16 珠海健康云科技有限公司 A kind of intelligent Service method and system of medicine question and answer
CN108804698A (en) * 2018-03-30 2018-11-13 深圳狗尾草智能科技有限公司 Man-machine interaction method, system, medium based on personage IP and equipment
CN110188177A (en) * 2019-05-28 2019-08-30 北京搜狗科技发展有限公司 Talk with generation method and device
CN111339280A (en) * 2020-03-23 2020-06-26 上海奔影网络科技有限公司 Question and answer sentence processing method, device, equipment and storage medium
CN111723898A (en) * 2020-05-25 2020-09-29 成都时空穿梭智能科技有限公司 Intelligent robot for simulating human

Similar Documents

Publication Publication Date Title
CN109670179B (en) Medical record text named entity identification method based on iterative expansion convolutional neural network
CN112863630A (en) Personalized accurate medical question-answering system based on data and knowledge
CN108491486B (en) Method, device, terminal equipment and storage medium for simulating patient inquiry dialogue
Barchard et al. Computer scoring of the levels of emotional awareness scale
CN107403068A (en) Merge the intelligence auxiliary way of inquisition and system of clinical thinking
CN112667799B (en) Medical question-answering system construction method based on language model and entity matching
CN113569023A (en) Chinese medicine question-answering system and method based on knowledge graph
CN113724882A (en) Method, apparatus, device and medium for constructing user portrait based on inquiry session
CN113571184A (en) Dialogue interaction design method and system for mental health assessment
CN113590783A (en) Traditional Chinese medicine health-preserving intelligent question-answering system based on NLP natural language processing
CN117216209A (en) Ultrasonic examination report reading system based on large language model
Alazraki et al. An empathetic AI coach for self-attachment therapy
Yiran Evaluation of students’ IELTS writing ability based on machine learning and neural network algorithm
WO2022174161A1 (en) Systems and methods for psychotherapy using artificial intelligence
Cuffy et al. Measuring the quality of patient–physician communication
CN112380231A (en) Training robot system and method with depressive disorder characteristics
CN113254609A (en) Question-answering model integration method based on negative sample diversity
Wang et al. Research on named entity recognition of doctor-patient question answering community based on bilstm-crf model
CN117454217A (en) Deep ensemble learning-based depression emotion recognition method, device and system
CN116959754A (en) Feature extraction method of structured interview recording transcribed text based on intention slots
CN116168844A (en) Medical data processing system based on big data analysis
CN112395399A (en) Specific personality dialogue robot training method based on artificial intelligence
Ma et al. Medical answer selection based on two attention mechanisms with birnn
CN112380329A (en) Training robot system and method under fine positive symptom background
CN112380330A (en) Training robot system and method under background of fine yin syndrome

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
WD01 Invention patent application deemed withdrawn after publication
WD01 Invention patent application deemed withdrawn after publication

Application publication date: 20210223