WO2023017690A1 - Information processing device, information processing method, and information processing program - Google Patents

Information processing device, information processing method, and information processing program Download PDF

Info

Publication number
WO2023017690A1
WO2023017690A1 PCT/JP2022/025966 JP2022025966W WO2023017690A1 WO 2023017690 A1 WO2023017690 A1 WO 2023017690A1 JP 2022025966 W JP2022025966 W JP 2022025966W WO 2023017690 A1 WO2023017690 A1 WO 2023017690A1
Authority
WO
WIPO (PCT)
Prior art keywords
user
unit
information
sentence
information processing
Prior art date
Application number
PCT/JP2022/025966
Other languages
French (fr)
Japanese (ja)
Inventor
勇也 西村
Original Assignee
株式会社エッセンス
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 株式会社エッセンス filed Critical 株式会社エッセンス
Publication of WO2023017690A1 publication Critical patent/WO2023017690A1/en

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/30Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
    • G06F16/33Querying
    • G06F16/335Filtering based on additional data, e.g. user or group profiles
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/90Details of database functions independent of the retrieved data types
    • G06F16/93Document management systems

Definitions

  • the present invention relates to an information processing device, an information processing method, and an information processing program.
  • the document search device described in Patent Document 1 recommends patent documents depending on whether or not the search result based on the search keyword has been viewed.
  • literature includes not only patent literature but also various other literature.
  • the user may wish to read a document that suits his or her taste, and the document retrieval device described in Patent Document 1 cannot set the search keyword according to the user's preference.
  • recommendations according to the user's preferences cannot be made.
  • the present invention provides an information processing device, an information processing method, and an information processing program capable of grouping users to be used for recommending sentences.
  • An information processing apparatus includes a text, a specific position where a user attaches an index to a predetermined unit when the text is divided into predetermined units, and a user classification estimated based on the text and the specific position.
  • a first acquisition unit for acquiring user information about a specific position marked by the first user when the first user reads a sentence; an estimation unit for estimating a classification corresponding to a specific position of the first user based on the acquired user information and the corresponding information stored in the storage unit; and a plurality of first users for each classification estimated by the estimation unit.
  • a group setting unit that performs grouping.
  • the first user when the first user reads a text, user information about a specific position marked by the first user is acquired, and based on the user information and the corresponding information, the specific position of the first user is displayed. Since the corresponding classification is estimated and a plurality of first users are grouped for each classification, it is possible to group users to be used for recommending sentences.
  • FIG. 1 is a diagram for explaining an information processing device according to an embodiment
  • FIG. 1 is a block diagram for explaining an information processing device according to an embodiment
  • FIG. FIG. 4 is a diagram showing the relationship between visible variable vectors and hidden variable vectors
  • 4 is a first flowchart for explaining an information processing method according to one embodiment
  • 7 is a second flowchart for explaining the information processing method according to one embodiment
  • 9 is a third flowchart for explaining the information processing method according to one embodiment
  • FIG. 1 is a diagram for explaining an information processing device 1 according to one embodiment.
  • the information processing device 1 may be configured as a group forming device for grouping users to be used for recommending texts, or may be configured as a text recommendation device for recommending texts to users in a group.
  • the information processing device 1 may be, for example, a computer such as a server, laptop, desktop, tablet, and smart phone.
  • the information processing apparatus 1 classifies users into one of a plurality of groups, and recommends sentences read by users in one group to other users in the group.
  • the information processing device 1 first classifies the users into groups based on the correspondence information.
  • the correspondence information is estimated based on, for example, the text 201, the specific position where the user attaches an index to the predetermined unit when the text 201 is divided into predetermined units, and the text 201 and the specific position. It may also be information that associates a user classification (for example, a classification for grouping users).
  • the corresponding information is a sentence 201 and a sentence 201 divided into clauses (predetermined units) for each punctuation mark.
  • It may be information that associates (specific position) with a group classification when users having the same (substantially the same or similar) tendency are grouped based on the positions of the text 201 and the bookmark 202 .
  • the correspondence information may be information about groups A and B classified according to the sentence 201 and the position of the bookmark 202 .
  • Correspondence information may be stored in the storage unit 21 .
  • the User information about the position (specific position) of the bookmark 102 is acquired.
  • the information processing device 1 classifies the first users into groups, for example, based on the correspondence information and the user information. That is, based on the correspondence information and the user information, the information processing apparatus 1 groups a plurality of first users who have the same (substantially the same or similar) tendency with respect to the portion of the text 101 that interests them. A plurality of first users are classified into the same group. As a more specific example, as illustrated in FIG.
  • the information processing apparatus 1 acquires user information bookmarked 102 by first users C, D, and E after reading sentences 101 .
  • the text 101 and the specific position of the bookmark 102 as the user information of the first users C, D, and E correspond to the text 101 and the bookmark 102 in any of the groups A and B as the correspondence information. It is determined whether the specific position of corresponds. That is, for example, the information processing apparatus 1 determines whether the specific positions of the text 101 and the bookmark 102 of the user information are the same (substantially the same or similar) as the specific positions of the text 201 and the bookmark 202 of the corresponding information. Classify C, D and E into either group A or B. In the case illustrated in FIG.
  • the information processing device 1 estimates that the first users C and D belong to the same group, and the first user E belongs to another group. In the case illustrated in FIG. 1, the information processing apparatus 1 detects, for example, a text read by the first user C who belongs to the same group, and if there is a text not read by the first user D, the text read by the first user C is recommended to the first user D.
  • FIG. 2 is a block diagram for explaining the information processing device 1 according to one embodiment.
  • the information processing device 1 includes, for example, a storage unit 21, a communication unit 22, a display unit 23, a control unit 10, and the like.
  • the control unit 10 includes, for example, a first acquisition unit 11, an estimation unit 12, a group setting unit 13, a second acquisition unit 14, a specification unit 15, a recommendation unit 16, and the like.
  • the control unit 10 may be configured by, for example, an arithmetic processing unit of the information processing device 1 or the like. For example, by appropriately reading and executing various programs stored in the storage unit 21 or the like, the control unit 10 controls each unit (for example, the first acquisition unit 11, the estimation unit 12, the group setting unit 13, the second acquisition unit 14). , the identifying unit 15, the recommending unit 16, etc.).
  • the storage unit 21 may store various information and programs, for example. Examples of the storage unit 21 may be a memory, a solid state drive, a hard disk drive, and the like. The storage unit 21 may be, for example, inside the information processing device 1 or outside the information processing device 1 .
  • the storage unit 21 associates a text with a specific position where the user marks a predetermined unit when the text is divided into predetermined units, and a user classification estimated based on the text and the specific position.
  • Store the attached correspondence information As described above, the corresponding information is, for example, a sentence, and when the sentence is divided into clauses (predetermined units) for each punctuation mark and bookmarks (indicators) are added to sentences (clauses) that the user is interested in. and a group classification when users having the same (substantially the same or similar) tendency are grouped based on the position of the text and the bookmark. .
  • the text may be, for example, an interview article created when the researcher is interviewed, or may be an article other than the interview article.
  • the specific position may be, for example, the position of a clause when a sentence (interview article) is divided into clauses as a predetermined unit.
  • the control unit 10 and the like may divide sentences by punctuation when dividing sentences into clauses as predetermined units.
  • the control unit 10 and the like may divide sentences into clauses based on a learning model in which punctuation is learned in advance and sentences.
  • the control unit 10 and the like may divide sentences into clauses by various processing methods, not limited to the specific example described above.
  • the control unit 10 or the like divides the text into specific units, and is not limited to the example of dividing the sentences into specific units as described above. Each page, etc.) may be used to separate sentences.
  • the corresponding information may be further associated with one or more recommended sentences recommended for each user classification.
  • the recommended sentence may be, for example, a sentence recommended to the first user in the group classified by the specifying unit 15, which will be described later.
  • the recommendation text may be, for example, an interview article created when the researcher is interviewed.
  • the communication unit 22 is capable of transmitting and receiving various information with, for example, a device outside the information processing device 1 (external device).
  • the external device may be, for example, an external server (not shown), the user terminal 100 (see FIG. 1), or the like.
  • the user terminal 100 may be, for example, a terminal or the like used by a user (for example, a first user or the like).
  • User terminal 100 may be, for example, a desktop, a laptop, a tablet, a smart phone, and the like.
  • the display unit 23 can display, for example, various characters, symbols and images.
  • the first acquisition unit 11 acquires user information about a specific position marked by the first user when the first user reads a sentence.
  • a specific position marked by the first user when the first user reads a sentence.
  • the storage area may be various storage areas such as the storage unit 21, an external server (not shown), and a terminal storage unit (not shown) of the user terminal 100, for example.
  • the display may be, for example, the display unit 23, a terminal display unit (not shown) of the user terminal 100, or the like.
  • the input unit may be, for example, a keyboard (eg, hard keys, soft keys, etc.), a mouse, and the like.
  • the first acquisition unit 11 acquires information about the specific position to which the index is attached from the user terminal 100 via the communication unit 22 .
  • the first acquisition unit 11 acquires information about the specific position to which the index is attached.
  • the text may be, for example, an interview article created when interviewing a researcher, as described above.
  • the specific position may be, for example, the position of a clause when a sentence (interview article) is segmented into clauses as a predetermined unit, as described above. Therefore, when the first user reads the interview article, the first acquisition unit 11 obtains user information about the specific position of one or more indicators attached to the specific clause of the interview article as a reference for the first user. may be obtained.
  • the estimation unit 12 estimates the classification according to the specific position of the first user. If the specific position of the text recorded in the user information is the same (or substantially the same) or similar to the specific position of the text recorded in the corresponding information, the estimating unit 12 detects the specific position of the text recorded in the user information. is relatively close to the specific position of the sentence recorded in the corresponding information, the first user is presumed to belong to the category recorded in the corresponding information.
  • the text recorded in the user information and the text recorded in the correspondence information may be the same, for example.
  • the estimation unit 12 may perform the following processing as an example.
  • FIG. 3 is a diagram showing the relationship between visible variable vectors and hidden variable vectors.
  • the estimating unit 12 uses the contrastive divergence method or the like to perform mutual sampling of the visible variable vector and the hidden variable vector as illustrated in FIG. 3, thereby classifying the first user. can be estimated.
  • the estimating unit 12 constructs a model (relationship illustrated in FIG. 3) for estimating hidden human characteristics "h” based on correspondence information using visible data "V" based on user information.
  • the visible data V is data indicating a specific position (clause) with an index (bookmark) attached to a sentence based on user information.
  • Vectorizing the visible data V results in the following visible variable vector.
  • N indicates the number of clauses in the entire sentence. In the visible variable vector, there are N for each first user.
  • Hidden personal characteristics h indicate the assumed personal characteristics of each user based on the corresponding information. That is, the hidden person characteristic h is a person characteristic hypothesis when it is assumed that a person exists. Vectorizing the hidden person characteristic h results in the following hidden variable vector.
  • T indicates the transposition of the vector
  • b, c, and w indicate the parameters.
  • the group setting unit 13 groups a plurality of first users for each classification estimated by the estimation unit 12 .
  • the estimation unit 12 estimates the classification of a plurality of first users
  • the group setting unit 13 groups the plurality of first users for each classification.
  • the control unit 10 when the control unit 10 acquires user information corresponding to each of the first users C, D, and E reading the text and adding the index, the user information and , and the correspondence information, the classification of the first users C, D, and E is estimated.
  • the control unit 10 presumes that the first users C and D belong to the same classification and the first user E belongs to another classification. , and the first user E is grouped.
  • the second acquisition unit 14 acquires sentence information related to sentences read by the plurality of first users based on the user information of each of the plurality of first users grouped by the group setting unit 13 .
  • the second acquisition unit 14 obtains the first user C's , D acquire text information about the text read.
  • the text information may be, for example, information that can identify the text read by the first users C and D.
  • textual information may be information that can identify various texts, including text content, author, title, researcher interviewed, publisher and publication date, and the like.
  • the second acquisition unit 14 based on the user information when each of the first users E in the same group puts an index (for example, a bookmark) in a sentence, Sentence information related to the sentence read by the first user E is acquired.
  • the specifying unit 15 is a second user and a third user among the plurality of first users grouped by the group setting unit 13, and the text information of the second user acquired by the second acquiring unit 14; Based on the third user's sentence information, the second user identifies sentences that the third user has not read among the sentences read by the second user.
  • the identification unit 15 determines, based on the text information of the first users C and D in the same group, that although the first user C is reading the text, the first user D Identify sentences you haven't read. That is, the identifying unit 15 identifies sentences that the first user D (third user) has not read among the sentences read by the first user C (second user).
  • the identifying unit 15 selects a fourth user among the plurality of first users grouped by the group setting unit 13 and selects a fourth user among the recommended sentences associated in the correspondence information stored in the storage unit 21 . 2 Based on the text information of the fourth user acquired by the acquisition unit 14, recommended texts that the fourth user has not read may be specified.
  • the specifying unit 15 selects the first users C, D ( A recommended sentence that the fourth user) has not read is specified.
  • the identification unit 15 selects the first user E (fourth user) among the recommended sentences recorded in the correspondence information based on the sentence information of the first user E in the same group. Identify recommended texts that have not read.
  • the recommendation unit 16 recommends the text identified by the identification unit 15 to the third user.
  • the recommendation unit 16 determines that, among the sentences F, G, and H read by the first user C (second user) by the identification unit 15, the first user D (third user)
  • sentence H is recommended to first user D (third user) (here, symbols F, G, and H are not shown in FIG. 1).
  • the recommendation unit 16 may control the communication unit 22 to transmit the text information of the text H to the user terminal 100 used by the first user D (third user).
  • the recommendation unit 16 may control the display unit 23 to display the text information of the text H.
  • the recommendation unit 16 may recommend the recommended text identified by the identification unit 15 to the fourth user.
  • the recommendation unit 16 causes the specification unit 15 to specify the recommended text K as the recommended text that the first user D (fourth user) has not read among the recommended texts (I, J, K). If identified, recommended sentence K is recommended to first user D (fourth user) (where symbols I, J, and K are not shown in FIG. 1).
  • the recommendation unit 16 may control the communication unit 22 to transmit the recommended sentence information of the recommended sentence K to the user terminal 100 used by the first user D (fourth user).
  • the recommendation unit 16 may control the display unit 23 to display the text information of the recommended text K.
  • the recommendation unit 16 similarly recommends a recommendation sentence for the first user C (fourth user). Similarly, in the case of FIG. 1, as an example, the recommendation unit 16 selects a recommended text from among the recommended texts (I, J, K) that the first user E (fourth user) has not read, by the specifying unit 15. When J is identified, recommended sentence J is recommended to first user E (fourth user). As an example, the recommendation unit 16 may control the communication unit 22 to transmit the recommended sentence information of the recommended sentence J to the user terminal 100 used by the first user E (fourth user). As another example, the recommendation unit 16 may control the display unit 23 to display the recommended sentence information of the recommended sentence J.
  • the recommended text information may be associated with the recommended text of the corresponding information, and includes various recommendations including recommended text content, author, title, interviewed researcher, publisher, publication date, and the like. Information that can specify a sentence may be used.
  • the control unit 10 learns parameters b, c, and w based on output results of visible data V as user information and hidden person characteristics h as corresponding information.
  • the control unit 10 may perform learning based on the gradient method for the parameters b, c, and w. In this case, the control unit 10 may stop learning when the range of change is small and falls below a predetermined standard. The predetermined criterion may be determined based on learning data and actual learning conditions.
  • the estimation unit 12 may learn the parameters b, c, and w at predetermined timings.
  • the control unit 10 uses the past visible data V of the first user to output the person characteristic h (classification of the first user) for the first user (sampling). Furthermore, the control unit 10 outputs (sampling) the visible data V again from the person characteristics h (classification of the first user), and predicts the reaction of the first user to the unread text. In this case, the control unit 10 excludes read sentences from the recommendation and recommends only unread sentences.
  • control unit 10 recommends, to the first user, sentences that are expected to respond favorably to the first user among the unread sentences.
  • FIG. 4 is a first flowchart for explaining an information processing method according to one embodiment.
  • the first acquisition unit 11 acquires user information.
  • the user information may be information about a specific position to which the first user marks (bookmarks) when the first user reads the text.
  • the specific position may be, for example, the position of the indexed clause in the sentence, ie information that characterizes the indexed clause.
  • the first acquisition unit 11 obtains information about specific positions of one or more indices attached to a specific phrase of the interview article as a reference for the first user. User information may be obtained.
  • step ST102 the estimation unit 12 estimates the classification corresponding to the specific position of the first user based on the user information acquired in step ST101 and the correspondence information stored in the storage unit 21.
  • step ST103 the group setting unit 13 groups a plurality of first users for each classification estimated in step ST102. That is, for example, when the classification of a plurality of first users is estimated in step ST102, the group setting unit 13 groups the plurality of first users for each classification.
  • FIG. 5 is a second flowchart for explaining the information processing method according to one embodiment.
  • step ST201 the second acquisition unit 14 collects sentence information related to sentences read by the plurality of first users based on the user information of each of the plurality of first users grouped in step ST103 shown in FIG. to get
  • step ST202 the identifying unit 15 identifies, for example, the second user C and the third user D among the plurality of first users C and D in the same group grouped in step ST103, Based on the sentence information of the second user C and the sentence information of the third user D acquired by the acquisition unit 14, the sentences read by the second user C but not read by the third user D are specified.
  • the recommendation unit 16 recommends the text identified in step ST202 to the third user D. That is, the recommendation unit 16 controls the output unit so that the third user D can browse the text information related to the text specified in step ST202.
  • the recommendation unit 16 may control, for example, at least one of the communication unit 22 and the display unit 23, in other words, at least one selected from the group of the communication unit 22 and the display unit 23, as the output unit. Note that the recommendation unit 16 may control the storage unit 21 as an embodiment of the output unit so as to store sentence information related to the sentence specified in step ST202.
  • FIG. 6 is a third flowchart for explaining the information processing method according to one embodiment.
  • step ST301 the second acquisition unit 14 collects sentence information related to sentences read by the plurality of first users based on the user information of each of the plurality of first users grouped in step ST103 shown in FIG. to get
  • the identification unit 15 is, for example, the fourth user E among one or more first users E in the same group grouped in step ST103, and is stored in the storage unit 21.
  • recommended sentences that the fourth user E has not read are specified based on the sentence information of the fourth user E acquired in step ST301.
  • the recommendation unit 16 recommends the recommended text identified in step ST302 to the fourth user E. That is, the recommendation unit 16 controls the output unit so that the fourth user E can browse the recommended sentence information related to the recommended sentence specified in step ST302.
  • the recommendation unit 16 may control, for example, at least one of the communication unit 22 and the display unit 23, in other words, at least one selected from the group of the communication unit 22 and the display unit 23, as the output unit. Note that the recommendation unit 16 may control the storage unit 21 as an embodiment of the output unit so as to store recommended text information related to the recommended text specified in step ST302.
  • Each unit of the information processing apparatus 1 described above may be implemented as a function of an arithmetic processing unit of a computer or the like. That is, the first acquiring unit 11, the estimating unit 12, the group setting unit 13, the second acquiring unit 14, the identifying unit 15, and the recommending unit 16 (control unit 10) of the information processing device 1 are configured by an arithmetic processing unit of a computer or the like. It may be implemented as a first acquisition function, an estimation function, a group setting function, a second acquisition function, a specific function, and a recommendation function (control function).
  • the information processing program can cause the computer to implement each function described above.
  • the information processing program may be recorded in a non-temporary computer-readable recording medium such as a memory, solid state drive, hard disk drive, or optical disc.
  • each part of the information processing device 1 may be realized by an arithmetic processing device of a computer or the like.
  • the arithmetic processing unit or the like is configured by an integrated circuit or the like, for example. Therefore, each part of the information processing device 1 may be implemented as a circuit that constitutes an arithmetic processing device or the like. That is, the first acquisition unit 11, the estimation unit 12, the group setting unit 13, the second acquisition unit 14, the identification unit 15, and the recommendation unit 16 (control unit 10) of the information processing device 1 constitute an arithmetic processing unit of a computer.
  • the storage unit 21, the communication unit 22, and the display unit 23 (output unit) of the information processing device 1 may be implemented as, for example, a storage function including functions of an arithmetic processing unit, a communication function, and a display function (output function). good.
  • the storage unit 21, the communication unit 22, and the display unit 23 (output unit) of the information processing device 1 are realized as a storage circuit, a communication circuit, and a display circuit (output circuit) by being configured by an integrated circuit or the like, for example.
  • the storage unit 21, the communication unit 22, and the display unit 23 (output unit) of the information processing device 1 are configured as a storage device, a communication device, and a display device (output device), for example, by being configured by a plurality of devices.
  • the information processing apparatus 1 can combine any one of the plurality of units described above or an arbitrary plurality thereof.
  • information can be interchanged with “data” and the term “data” can be interchanged with “information.”
  • An information processing apparatus includes a text, a specific position where a user attaches an index to a predetermined unit when the text is divided into predetermined units, and a user classification estimated based on the text and the specific position.
  • a first acquisition unit for acquiring user information about a specific position marked by the first user when the first user reads a sentence; an estimation unit for estimating a classification corresponding to a specific position of the first user based on the acquired user information and the corresponding information stored in the storage unit; and a plurality of first users for each classification estimated by the estimation unit.
  • a group setting unit that performs grouping. Accordingly, the information processing apparatus can group users for use in recommending sentences. Since the information processing apparatus attaches an index to the sentences that the user has divided into predetermined units, it is possible to reflect the preferences of each user and to classify the users more appropriately.
  • An information processing apparatus acquires text information related to text read by a plurality of first users based on user information of each of the plurality of first users grouped by a group setting unit. and a second user and a third user among a plurality of first users grouped by the group setting unit, wherein the text information of the second user and the text information of the third user acquired by the second acquisition unit Based on the sentence information, the second user has a specification unit that specifies sentences that the third user has not read among the sentences that the second user has read, and a recommendation unit that recommends the sentences specified by the specification unit to the third user. You can do it.
  • a plurality of users in the same group are considered to have similarities in terms of their interest in the text and the points of reference for the text. Therefore, the information processing apparatus can recommend sentences that suit the user's taste to users in the same group.
  • the correspondence information stored in the storage unit is further associated with recommended sentences recommended for each user classification
  • the specifying unit includes a plurality of sentences grouped by the group setting unit.
  • a fourth user among the first users of the fourth user based on the text information of the fourth user acquired by the second acquisition unit among the recommended texts associated in the correspondence information stored in the storage unit
  • the fourth user may specify the recommended sentences that the fourth user has not read, and the recommendation unit may recommend the recommended sentences specified by the specifying unit to the fourth user.
  • the information processing apparatus can recommend recommended sentences associated with each user classification, that is, each group, to the users classified into the corresponding groups.
  • the sentence associated with the correspondence information stored in the storage unit is an interview article created when interviewing a researcher
  • the specific position associated with the correspondence information is a sentence as a sentence. It is the position of the clause when the interview article is divided into clauses as a predetermined unit, and the first acquisition unit specifies the interview article as a reference for the first user when the first user reads the interview article. It is also possible to obtain user information on the specific position of one or more indicators attached to the clause. Thereby, the information processing device can classify the user according to the interview article that the user is interested in.
  • a sentence, a specific position where a user attaches an index to a predetermined unit when the sentence is divided into predetermined units, and a classification of the user estimated based on the sentence and the specific position a first obtaining step of obtaining user information about a specific position marked by the first user when the first user reads a sentence; an estimation step of estimating a classification according to the specific position of the first user based on the user information acquired by the acquisition step and the correspondence information stored in the storage unit; and a group setting step for grouping one user. Accordingly, the information processing method can achieve the same effect as the information processing apparatus of one aspect described above.
  • An information processing program of one aspect provides a computer with a sentence, a specific position marked by a user on a predetermined unit when the sentence is divided into predetermined units, and an estimation based on the sentence and the specific position.
  • a storage function for storing correspondence information associated with user classifications;
  • a first acquisition function for acquiring user information relating to a specific position marked by the first user when the first user reads a sentence;
  • an estimation function for estimating a classification corresponding to a specific position of the first user based on the user information acquired by the acquisition function and the correspondence information stored in the storage function; and a group setting function for grouping one user. Accordingly, the information processing program can achieve the same effect as the information processing apparatus of one aspect described above.
  • control unit 11 first acquisition unit 12 estimation unit 13 group setting unit 14 second acquisition unit 15 identification unit 16 recommendation unit 21 storage unit 22 communication unit 23 display unit

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Databases & Information Systems (AREA)
  • Data Mining & Analysis (AREA)
  • Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Computational Linguistics (AREA)
  • Business, Economics & Management (AREA)
  • General Business, Economics & Management (AREA)
  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)

Abstract

The present invention provides an information processing device, an information processing method, and an information processing program that can group users to be used to recommend a document. The information processing device comprises: a storage unit that stores a document, specific positions of indicators attached by a user to prescribed units obtained by dividing the document, and association information associating the document with a user classification estimated on the basis of the specific positions; a first acquisition unit that acquires user information pertaining to the specific positions of the indicators added by a first user when the first user read the document; an estimation unit that estimates, on the basis of the user information acquired by the first acquisition unit and the association information stored in the storage unit, a classification according to the specific positions designated by the first user; and a group setting unit that groups a plurality of first users for each classification estimated by the estimation unit.

Description

情報処理装置、情報処理方法及び情報処理プログラムInformation processing device, information processing method and information processing program
 本発明は、情報処理装置、情報処理方法及び情報処理プログラムに関する。 The present invention relates to an information processing device, an information processing method, and an information processing program.
 従来から、文献を検索する装置(文献検索装置)がある。文献検索装置には、特許文献1に記載のように、特許文献を検索することに特化したものがある。
 特許文献1に記載される文献検索装置は、検索キーワードに一致する特許文献を検索し、その検索の結果が閲覧された場合に閲覧と文献とを対応付けて検索履歴として記憶する。さらに文献検索装置は、他のユーザの検索履歴と、現在のユーザの検索履歴とに基づいて類似度を算出し、算出される類似度が最も高い他のユーザが閲覧した特許文献のうち、現在のユーザが閲覧していない特許文献を推薦する。
2. Description of the Related Art Conventionally, there is a device (document retrieval device) for retrieving documents. As described in Japanese Patent Laid-Open No. 2002-200012, there is a document search device specialized for searching patent documents.
The document search device described in Patent Literature 1 searches for patent documents that match a search keyword, and when the search result is browsed, associates the browse with the document and stores it as a search history. Furthermore, the document search device calculates the similarity based on the search history of other users and the search history of the current user, and among the patent documents viewed by other users with the highest calculated similarity, the current recommend patent documents that have not been viewed by users of
特開2014-186607号公報JP 2014-186607 A
 特許文献1に記載される文献検索装置は、上述したように、検索キーワードに基づく検索の結果を閲覧したか否か等に応じて特許文献を推薦する。しかしながら、一般的に文献には特許文献に限らず、種々の文献が存在する。この場合、ユーザは、自身の好みに応じた文献を読むことを希望することがあり、特許文献1に記載される文献検索装置では検索キーワードをユーザの好みに応じて設定することができず、ユーザの好みに応じた推薦することができない可能性があった。 As described above, the document search device described in Patent Document 1 recommends patent documents depending on whether or not the search result based on the search keyword has been viewed. However, in general, literature includes not only patent literature but also various other literature. In this case, the user may wish to read a document that suits his or her taste, and the document retrieval device described in Patent Document 1 cannot set the search keyword according to the user's preference. There is a possibility that recommendations according to the user's preferences cannot be made.
 本発明は、文章の推薦に利用するユーザのグループ化を行うことが可能な情報処理装置、情報処理方法及び情報処理プログラムを提供する。 The present invention provides an information processing device, an information processing method, and an information processing program capable of grouping users to be used for recommending sentences.
 一態様の情報処理装置は、文章と、その文章を所定単位毎に区切った際にユーザによって所定単位に指標が付される特定位置と、文章と特定位置とに基づいて推定されるユーザの分類とを対応付けた対応情報を記憶する記憶部と、第1ユーザが文章を読んだ際に第1ユーザが指標を付す特定位置に関するユーザ情報を取得する第1取得部と、第1取得部によって取得するユーザ情報と、記憶部に記憶する対応情報とに基づいて、第1ユーザの特定位置に応じた分類を推定する推定部と、推定部によって推定される分類毎に複数の第1ユーザのグループ化を行うグループ設定部と、を備える。 An information processing apparatus according to one aspect includes a text, a specific position where a user attaches an index to a predetermined unit when the text is divided into predetermined units, and a user classification estimated based on the text and the specific position. a first acquisition unit for acquiring user information about a specific position marked by the first user when the first user reads a sentence; an estimation unit for estimating a classification corresponding to a specific position of the first user based on the acquired user information and the corresponding information stored in the storage unit; and a plurality of first users for each classification estimated by the estimation unit. and a group setting unit that performs grouping.
 一態様によれば、第1ユーザが文章を読んだ際に第1ユーザが指標を付す特定位置に関するユーザ情報を取得し、そのユーザ情報と対応情報とに基づいて、第1ユーザの特定位置に応じた分類を推定し、分類毎に複数の第1ユーザのグループ化を行うので、文章の推薦に利用するユーザのグループ化を行うことができる。 According to one aspect, when the first user reads a text, user information about a specific position marked by the first user is acquired, and based on the user information and the corresponding information, the specific position of the first user is displayed. Since the corresponding classification is estimated and a plurality of first users are grouped for each classification, it is possible to group users to be used for recommending sentences.
一実施形態に係る情報処理装置について説明するための図である。1 is a diagram for explaining an information processing device according to an embodiment; FIG. 一実施形態に係る情報処理装置について説明するためのブロック図である。1 is a block diagram for explaining an information processing device according to an embodiment; FIG. 可視変数ベクトルと隠れ変数ベクトルとの関係を示す図である。FIG. 4 is a diagram showing the relationship between visible variable vectors and hidden variable vectors; 一実施形態に係る情報処理方法について説明するための第1のフローチャートである。4 is a first flowchart for explaining an information processing method according to one embodiment; 一実施形態に係る情報処理方法について説明するための第2のフローチャートである。7 is a second flowchart for explaining the information processing method according to one embodiment; 一実施形態に係る情報処理方法について説明するための第3のフローチャートである。9 is a third flowchart for explaining the information processing method according to one embodiment;
 以下、一実施形態について説明する。 An embodiment will be described below.
[情報処理装置の概要]
 まず、一実施形態に係る情報処理装置1の概要について説明する。
 図1は、一実施形態に係る情報処理装置1について説明するための図である。
[Overview of information processing device]
First, an outline of an information processing device 1 according to an embodiment will be described.
FIG. 1 is a diagram for explaining an information processing device 1 according to one embodiment.
 情報処理装置1は、例えば、文章の推薦に利用するユーザのグループ化を行うグループ形成装置として構成されてもよく、グループ内のユーザに文章を推薦する文章推薦装置として構成されてもよい。
 情報処理装置1は、例えば、サーバ、ラップトップ、デスクトップ、タブレット及びスマートフォン等のコンピュータであってもよい。
For example, the information processing device 1 may be configured as a group forming device for grouping users to be used for recommending texts, or may be configured as a text recommendation device for recommending texts to users in a group.
The information processing device 1 may be, for example, a computer such as a server, laptop, desktop, tablet, and smart phone.
 情報処理装置1は、例えば、ユーザを複数のグループのいずれかに分類し、1つのグループ内のユーザが読んだ文章を、そのグループ内の他のユーザに推薦する。この場合、まず情報処理装置1は、対応情報に基づいてユーザをグループに分類する。
 ここで、対応情報は、例えば、文章201と、その文章201を所定単位毎に区切った際にユーザによって所定単位に指標が付される特定位置と、文章201と特定位置とに基づいて推定されるユーザの分類(例えば、ユーザをグループ分けするための分類)とを対応付けた情報であってもよい。一例として、対応情報は、文章201と、その文章201を句読点毎に文節(所定単位)に分割して、ユーザが興味を持った文章201の箇所にブックマーク202(指標)を付す際のその箇所(特定位置)と、文章201とブックマーク202の位置とに基づいて同じ(略同じ又は類似)傾向を有するユーザをグループ化した際のグループの分類とを対応付けた情報であってもよい。より具体的な一例として図1に例示するように、対応情報は、文章201と、ブックマーク202の位置とに応じて分類されるグループA,Bについての情報であってもよい。対応情報は、記憶部21に記憶されていてもよい。
The information processing apparatus 1, for example, classifies users into one of a plurality of groups, and recommends sentences read by users in one group to other users in the group. In this case, the information processing device 1 first classifies the users into groups based on the correspondence information.
Here, the correspondence information is estimated based on, for example, the text 201, the specific position where the user attaches an index to the predetermined unit when the text 201 is divided into predetermined units, and the text 201 and the specific position. It may also be information that associates a user classification (for example, a classification for grouping users). As an example, the corresponding information is a sentence 201 and a sentence 201 divided into clauses (predetermined units) for each punctuation mark. It may be information that associates (specific position) with a group classification when users having the same (substantially the same or similar) tendency are grouped based on the positions of the text 201 and the bookmark 202 . As a more specific example, as illustrated in FIG. 1, the correspondence information may be information about groups A and B classified according to the sentence 201 and the position of the bookmark 202 . Correspondence information may be stored in the storage unit 21 .
 情報処理装置1は、例えば、新たなユーザとしての第1ユーザが文章101を読み、第1ユーザが興味を持ったその文章101の文節(所定単位)にブックマーク102(指標)を付す場合、そのブックマーク102の位置(特定位置)に関するユーザ情報を取得する。
 情報処理装置1は、例えば、対応情報及びユーザ情報に基づいて、第1ユーザをグループに分類する。すなわち、情報処理装置1は、対応情報及びユーザ情報に基づいて、文章101の興味を持った箇所について同じ(略同じ又は類似の)傾向を有する複数の第1ユーザをグループ化する、換言するとその複数の第1ユーザを同一のグループに分類する。
 より具体的な一例として図1に例示するように、情報処理装置1は、第1ユーザC,D,Eが文章101を読んでブックマーク102付したユーザ情報を取得する。情報処理装置1は、例えば、第1ユーザC,D,Eのユーザ情報としての文章101及びブックマーク102の特定位置とが、対応情報としてのグループA,Bのいずれのうちの文章101とブックマーク102の特定位置とが対応するか判定する。すなわち、情報処理装置1は、例えば、ユーザ情報の文章101及びブックマーク102の特定位置が対応情報の文章201及びブックマーク202の特定位置と同じ(略同じ又は類似する)かに応じて、第1ユーザC,D,EをグループA,Bのいずれかに分類する。図1に例示する場合では、情報処理装置1は、第1ユーザC,Dを同一グループとして推定し、第1ユーザEを他のグループとして推定する。
 図1に例示する場合、情報処理装置1は、例えば、同一グループとなる第1ユーザCが読んだ文章であり、第1ユーザDが読んでない文章があれば、第1ユーザCが読んだ文章を第1ユーザDに推薦する。
For example, when a first user as a new user reads a text 101 and attaches a bookmark 102 (index) to a clause (predetermined unit) of the text 101 that the first user is interested in, the User information about the position (specific position) of the bookmark 102 is acquired.
The information processing device 1 classifies the first users into groups, for example, based on the correspondence information and the user information. That is, based on the correspondence information and the user information, the information processing apparatus 1 groups a plurality of first users who have the same (substantially the same or similar) tendency with respect to the portion of the text 101 that interests them. A plurality of first users are classified into the same group.
As a more specific example, as illustrated in FIG. 1, the information processing apparatus 1 acquires user information bookmarked 102 by first users C, D, and E after reading sentences 101 . In the information processing apparatus 1, for example, the text 101 and the specific position of the bookmark 102 as the user information of the first users C, D, and E correspond to the text 101 and the bookmark 102 in any of the groups A and B as the correspondence information. It is determined whether the specific position of corresponds. That is, for example, the information processing apparatus 1 determines whether the specific positions of the text 101 and the bookmark 102 of the user information are the same (substantially the same or similar) as the specific positions of the text 201 and the bookmark 202 of the corresponding information. Classify C, D and E into either group A or B. In the case illustrated in FIG. 1, the information processing device 1 estimates that the first users C and D belong to the same group, and the first user E belongs to another group.
In the case illustrated in FIG. 1, the information processing apparatus 1 detects, for example, a text read by the first user C who belongs to the same group, and if there is a text not read by the first user D, the text read by the first user C is recommended to the first user D.
[情報処理装置1の詳細]
 次に、一実施形態に係る情報処理装置1の詳細について説明する。
 図2は、一実施形態に係る情報処理装置1について説明するためのブロック図である。
[Details of information processing device 1]
Next, details of the information processing apparatus 1 according to one embodiment will be described.
FIG. 2 is a block diagram for explaining the information processing device 1 according to one embodiment.
 情報処理装置1は、例えば、記憶部21、通信部22、表示部23及び制御部10等を備える。制御部10は、例えば、第1取得部11、推定部12、グループ設定部13、第2取得部14、特定部15及び推薦部16等を備える。制御部10は、例えば、情報処理装置1の演算処理装置等によって構成されてもよい。制御部10は、例えば、記憶部21等に記憶される各種プログラムを適宜読み出して実行することにより、各部(例えば、第1取得部11、推定部12、グループ設定部13、第2取得部14、特定部15及び推薦部16等)の機能を実現してもよい。 The information processing device 1 includes, for example, a storage unit 21, a communication unit 22, a display unit 23, a control unit 10, and the like. The control unit 10 includes, for example, a first acquisition unit 11, an estimation unit 12, a group setting unit 13, a second acquisition unit 14, a specification unit 15, a recommendation unit 16, and the like. The control unit 10 may be configured by, for example, an arithmetic processing unit of the information processing device 1 or the like. For example, by appropriately reading and executing various programs stored in the storage unit 21 or the like, the control unit 10 controls each unit (for example, the first acquisition unit 11, the estimation unit 12, the group setting unit 13, the second acquisition unit 14). , the identifying unit 15, the recommending unit 16, etc.).
 記憶部21は、例えば、種々の情報及びプログラムを記憶してもよい。記憶部21の一例は、メモリ、ソリッドステートドライブ及びハードディスクドライブ等であってもよい。記憶部21は、例えば、情報処理装置1の内部にあってもよく、情報処理装置1の外部にあってもよい。 The storage unit 21 may store various information and programs, for example. Examples of the storage unit 21 may be a memory, a solid state drive, a hard disk drive, and the like. The storage unit 21 may be, for example, inside the information processing device 1 or outside the information processing device 1 .
 記憶部21は、文章と、その文章を所定単位毎に区切った際にユーザによって所定単位に指標が付される特定位置と、文章と特定位置とに基づいて推定されるユーザの分類とを対応付けた対応情報を記憶する。対応情報は、上述したように、例えば、文章と、その文章を句読点毎に文節(所定単位)に分割して、ユーザが興味を持った文章(文節)の箇所にブックマーク(指標)を付す際のその箇所(特定位置)と、文章とブックマークの位置とに基づいて同じ(略同じ又は類似)の傾向を有するユーザをグループ化した際のグループの分類とを対応付けた情報であってもよい。 The storage unit 21 associates a text with a specific position where the user marks a predetermined unit when the text is divided into predetermined units, and a user classification estimated based on the text and the specific position. Store the attached correspondence information. As described above, the corresponding information is, for example, a sentence, and when the sentence is divided into clauses (predetermined units) for each punctuation mark and bookmarks (indicators) are added to sentences (clauses) that the user is interested in. and a group classification when users having the same (substantially the same or similar) tendency are grouped based on the position of the text and the bookmark. .
 ここで、文章は、例えば、研究者をインタビューした際に作成されるインタビュー記事等であってもよく、インタビュー記事以外の他の記事等であってもよい。特定位置は、例えば、文章(インタビュー記事)を所定単位としての文節毎に区切った際の文節の位置であってもよい。
 また一例として、制御部10等は、所定単位として文章を文節毎に区切る場合、句読点で文章を区切ってもよい。具体的な一例として、制御部10等は、句読点を予め学習した学習モデルと、文章とに基づいて、文章を文節毎に区切ってもよい。制御部10等は、上述した具体的な一例に限らず、種々の処理方法によって文章を文節毎に区切ってもよい。また、制御部10等は、文章を特定単位毎に区切ることとして、上述したように文節毎に区切る例に限らず、適宜設定される単位毎(例えば、1文毎、段落毎、章毎及びページ毎等)で文章を区切ってもよい。
Here, the text may be, for example, an interview article created when the researcher is interviewed, or may be an article other than the interview article. The specific position may be, for example, the position of a clause when a sentence (interview article) is divided into clauses as a predetermined unit.
Further, as an example, the control unit 10 and the like may divide sentences by punctuation when dividing sentences into clauses as predetermined units. As a specific example, the control unit 10 and the like may divide sentences into clauses based on a learning model in which punctuation is learned in advance and sentences. The control unit 10 and the like may divide sentences into clauses by various processing methods, not limited to the specific example described above. In addition, the control unit 10 or the like divides the text into specific units, and is not limited to the example of dividing the sentences into specific units as described above. Each page, etc.) may be used to separate sentences.
 対応情報には、ユーザの分類毎に推薦する1又は複数の推薦文章がさらに対応付けられていてもよい。推薦文章は、例えば、後述する特定部15によって分類されるグループ内の第1ユーザに対して推薦する文章であってもよい。推薦文章は、例えば、研究者をインタビューした際に作成されるインタビュー記事等であってもよい。 The corresponding information may be further associated with one or more recommended sentences recommended for each user classification. The recommended sentence may be, for example, a sentence recommended to the first user in the group classified by the specifying unit 15, which will be described later. The recommendation text may be, for example, an interview article created when the researcher is interviewed.
 通信部22は、例えば、情報処理装置1の外部にある装置(外部装置)等との間で種々の情報の送受信が可能である。外部装置は、例えば、外部サーバ(図示せず)及びユーザ端末100(図1参照)等であってもよい。ユーザ端末100は、例えば、ユーザ(例えば、第1ユーザ等)が使用する端末等であってもよい。ユーザ端末100は、例えば、デスクトップ、ラップトップ、タブレット及びスマートフォン等であってもよい。 The communication unit 22 is capable of transmitting and receiving various information with, for example, a device outside the information processing device 1 (external device). The external device may be, for example, an external server (not shown), the user terminal 100 (see FIG. 1), or the like. The user terminal 100 may be, for example, a terminal or the like used by a user (for example, a first user or the like). User terminal 100 may be, for example, a desktop, a laptop, a tablet, a smart phone, and the like.
 表示部23は、例えば、種々の文字、記号及び画像等を表示することが可能である。 The display unit 23 can display, for example, various characters, symbols and images.
 第1取得部11は、第1ユーザが文章を読んだ際に第1ユーザが指標を付す特定位置に関するユーザ情報を取得する。
 一例として、記憶領域に記憶される文章をディスプレイに表示させる場合、第1ユーザは、その文章を読んで興味を持った箇所等が有ると、入力部(図示せず)等を操作して、文章のうち興味を持った箇所(特定位置)に指標(例えば、ブックマーク等)を付す。ここで、記憶領域は、例えば、記憶部21、外部サーバ(図示せず)及びユーザ端末100の端末記憶部(図示せず)等を始めとする種々の記憶領域であってもよい。ディスプレイは、例えば、表示部23及びユーザ端末100の端末表示部(図示せず)等であってもよい。入力部は、例えば、キーボード(例えば、ハードキー及びソフトキー等)及びマウス等であってもよい。
 第1取得部11は、例えば、ユーザ端末100において文章に指標が付された場合、その指標が付された特定位置に関する情報を、通信部22を介してユーザ端末100から取得する。又は、第1取得部11は、例えば、情報処理装置1において所定単位に指標が付された場合、その指標が付された特定位置に関する情報を取得する。
The first acquisition unit 11 acquires user information about a specific position marked by the first user when the first user reads a sentence.
As an example, when displaying text stored in a storage area on a display, if the first user finds a part of the text that interests him or the like, he operates an input unit (not shown) or the like, An index (for example, a bookmark or the like) is attached to an interesting part (specific position) in the text. Here, the storage area may be various storage areas such as the storage unit 21, an external server (not shown), and a terminal storage unit (not shown) of the user terminal 100, for example. The display may be, for example, the display unit 23, a terminal display unit (not shown) of the user terminal 100, or the like. The input unit may be, for example, a keyboard (eg, hard keys, soft keys, etc.), a mouse, and the like.
For example, when an index is attached to a sentence in the user terminal 100 , the first acquisition unit 11 acquires information about the specific position to which the index is attached from the user terminal 100 via the communication unit 22 . Alternatively, for example, when an index is attached to a predetermined unit in the information processing device 1, the first acquisition unit 11 acquires information about the specific position to which the index is attached.
 ここで文章は、例えば、上述したように、研究者をインタビューした際に作成されるインタビュー記事等であってもよい。また、特定位置は、例えば、上述したように、文章(インタビュー記事)を所定単位としての文節毎に区切った際の文節の位置であってもよい。したがって、第1取得部11は、第1ユーザがインタビュー記事を読んだ際に、第1ユーザが参考となるとしてインタビュー記事の特定の文節に付した1又は複数の指標の特定位置に関するユーザ情報を取得してもよい。 Here, the text may be, for example, an interview article created when interviewing a researcher, as described above. Also, the specific position may be, for example, the position of a clause when a sentence (interview article) is segmented into clauses as a predetermined unit, as described above. Therefore, when the first user reads the interview article, the first acquisition unit 11 obtains user information about the specific position of one or more indicators attached to the specific clause of the interview article as a reference for the first user. may be obtained.
 推定部12は、第1取得部11によって取得するユーザ情報と、記憶部21に記憶する対応情報とに基づいて、第1ユーザの特定位置に応じた分類を推定する。推定部12は、ユーザ情報に記録される文章の特定位置が対応情報に記録される文章の特定位置と同一(又は、略同一)又は類似する場合、すなわちユーザ情報に記録される文章の特定位置が対応情報に記録される文章の特定位置に相対的に近い場合、第1ユーザをその対応情報に記録される分類に属すると推定する。ここで、ユーザ情報に記録される文章と、対応情報に記録される文章とは、例えば、同一であってもよい。 Based on the user information acquired by the first acquisition unit 11 and the correspondence information stored in the storage unit 21, the estimation unit 12 estimates the classification according to the specific position of the first user. If the specific position of the text recorded in the user information is the same (or substantially the same) or similar to the specific position of the text recorded in the corresponding information, the estimating unit 12 detects the specific position of the text recorded in the user information. is relatively close to the specific position of the sentence recorded in the corresponding information, the first user is presumed to belong to the category recorded in the corresponding information. Here, the text recorded in the user information and the text recorded in the correspondence information may be the same, for example.
 推定部12は、第1ユーザの特定位置に応じた分類を推定する場合、一例として以下のような処理を行ってもよい。
 図3は、可視変数ベクトルと隠れ変数ベクトルとの関係を示す図である。
When estimating the classification corresponding to the specific position of the first user, the estimation unit 12 may perform the following processing as an example.
FIG. 3 is a diagram showing the relationship between visible variable vectors and hidden variable vectors.
 推定部12は、一例として、コントラスティブ・ダイバージェンス法等を利用して、図3に例示するように、可視変数ベクトルと隠れ変数ベクトルとを相互にサンプリングを行うことにより、第1ユーザの分類を推定してもよい。 As an example, the estimating unit 12 uses the contrastive divergence method or the like to perform mutual sampling of the visible variable vector and the hidden variable vector as illustrated in FIG. 3, thereby classifying the first user. can be estimated.
 すなわち一例として、推定部12は、ユーザ情報に基づく「V」という可視データを用いて、対応情報に基づく「h」という隠れた人物特性を推定するモデル(図3に例示する関係)を構築してもよい。
 可視データVは、ユーザ情報に基づく文章に指標(ブックマーク)が付された特定位置(文節)を示すデータである。可視データVをベクトル化すると、以下の可視変数ベクトルとなる。ここで、Nは、全文章における文節の数を示す。可視変数ベクトルにおいて、Nは第1ユーザ毎に存在する。
That is, as an example, the estimating unit 12 constructs a model (relationship illustrated in FIG. 3) for estimating hidden human characteristics "h" based on correspondence information using visible data "V" based on user information. may
The visible data V is data indicating a specific position (clause) with an index (bookmark) attached to a sentence based on user information. Vectorizing the visible data V results in the following visible variable vector. Here, N indicates the number of clauses in the entire sentence. In the visible variable vector, there are N for each first user.
Figure JPOXMLDOC01-appb-M000001
Figure JPOXMLDOC01-appb-M000001
 隠れた人物特性hは、対応情報に基づく仮定されるユーザ毎の人物特性を示す。すなわち、隠れた人物特性hは、人物が存在すると仮定した場合の人物特性仮説である。隠れた人物特性hをベクトル化すると、以下の隠れ変数ベクトルとなる。  Hidden personal characteristics h indicate the assumed personal characteristics of each user based on the corresponding information. That is, the hidden person characteristic h is a person characteristic hypothesis when it is assumed that a person exists. Vectorizing the hidden person characteristic h results in the following hidden variable vector.
Figure JPOXMLDOC01-appb-M000002
Figure JPOXMLDOC01-appb-M000002
 可視データVと隠れた人物特性hとの間には、以下の確率分布が備わり、以下の式のように支配されているものとする。ここで、Tはベクトルの転置、b,c,wはパラメータを示す。  The following probability distribution is provided between the visible data V and the hidden human characteristics h, and is governed by the following formula. Here, T indicates the transposition of the vector, and b, c, and w indicate the parameters.
Figure JPOXMLDOC01-appb-M000003
Figure JPOXMLDOC01-appb-M000003
 グループ設定部13は、推定部12によって推定される分類毎に複数の第1ユーザのグループ化を行う。グループ設定部13は、推定部12によって複数の第1ユーザの分類が推定される場合、分類毎に複数の第1ユーザをグループ化する。 The group setting unit 13 groups a plurality of first users for each classification estimated by the estimation unit 12 . When the estimation unit 12 estimates the classification of a plurality of first users, the group setting unit 13 groups the plurality of first users for each classification.
 ここで図1に例示するように、制御部10は、第1ユーザC,D,Eそれぞれが文章を読んで指標が付されることに対応するユーザ情報を取得した場合、それぞれのユーザ情報と、対応情報とに基づいて、第1ユーザC,D,Eの分類を推定する。ここで図1に例示する場合、制御部10は、第1ユーザC,Dが同一の分類と推定され、第1ユーザEが他の分類と推定されるため、第1ユーザC,Dをグループ化し、第1ユーザEをグループ化する。 Here, as exemplified in FIG. 1, when the control unit 10 acquires user information corresponding to each of the first users C, D, and E reading the text and adding the index, the user information and , and the correspondence information, the classification of the first users C, D, and E is estimated. Here, in the case of FIG. 1, the control unit 10 presumes that the first users C and D belong to the same classification and the first user E belongs to another classification. , and the first user E is grouped.
 第2取得部14は、グループ設定部13によってグループ化が行われた複数の第1ユーザのそれぞれのユーザ情報に基づいて、複数の第1ユーザが読んだ文章に関する文章情報を取得する。
 図1に例示する場合、第2取得部14は、同一グループ内の第1ユーザC,Dそれぞれが文章で指標(例えば、ブックマーク等)を付した際のユーザ情報に基づいて、第1ユーザC,Dが読んだ文章に関する文章情報を取得する。文章情報は、例えば、第1ユーザC,Dが読んだ文章を特定することができる情報であってもよい。一例として、文章情報は、文章内容、著者、タイトル、インタビューを受けた研究者、発行所及び発行日等を始めとする種々の文章を特定することができる情報であってもよい。
 また上記と同様に、図1に例示する場合、第2取得部14は、同一グループ内の第1ユーザEそれぞれが文章で指標(例えば、ブックマーク等)を付した際のユーザ情報に基づいて、第1ユーザEが読んだ文章に関する文章情報を取得する。
The second acquisition unit 14 acquires sentence information related to sentences read by the plurality of first users based on the user information of each of the plurality of first users grouped by the group setting unit 13 .
In the case of FIG. 1, the second acquisition unit 14 obtains the first user C's , D acquire text information about the text read. The text information may be, for example, information that can identify the text read by the first users C and D. By way of example, textual information may be information that can identify various texts, including text content, author, title, researcher interviewed, publisher and publication date, and the like.
In the same manner as described above, in the case illustrated in FIG. 1, the second acquisition unit 14, based on the user information when each of the first users E in the same group puts an index (for example, a bookmark) in a sentence, Sentence information related to the sentence read by the first user E is acquired.
 特定部15は、グループ設定部13によってグループ化が行われた複数の第1ユーザのうちの第2ユーザ及び第3ユーザであって、第2取得部14によって取得する第2ユーザの文章情報と第3ユーザの文章情報に基づいて、第2ユーザは読んだ文章のうち第3ユーザは読んでいない文章を特定する。
 図1に例示する場合、一例として、特定部15は、同一グループ内の第1ユーザC,Dの文章情報に基づいて、第1ユーザCは読んでいる文章ではあるが、第1ユーザDは読んでいない文章を特定する。すなわち、特定部15は、第1ユーザC(第2ユーザ)は読んだ文章のうち、第1ユーザD(第3ユーザ)は読んでいない文章を特定する。
The specifying unit 15 is a second user and a third user among the plurality of first users grouped by the group setting unit 13, and the text information of the second user acquired by the second acquiring unit 14; Based on the third user's sentence information, the second user identifies sentences that the third user has not read among the sentences read by the second user.
In the case of FIG. 1, as an example, the identification unit 15 determines, based on the text information of the first users C and D in the same group, that although the first user C is reading the text, the first user D Identify sentences you haven't read. That is, the identifying unit 15 identifies sentences that the first user D (third user) has not read among the sentences read by the first user C (second user).
 特定部15は、グループ設定部13によってグループ化が行われた複数の第1ユーザのうちの第4ユーザであって、記憶部21に記憶される対応情報において対応付けられる推薦文章のうち、第2取得部14によって取得する第4ユーザの文章情報に基づいて第4ユーザが読んでいない推薦文章を特定してもよい。
 図1に例示する場合、一例として、特定部15は、同一グループ内の第1ユーザC,Dの文章情報に基づいて、対応情報に記録される推薦文章のうち、第1ユーザC,D(第4ユーザ)が読んでいない推薦文章を特定する。
 同様に、図1に例示する場合、特定部15は、同一グループ内の第1ユーザEの文章情報に基づいて、対応情報に記録される推薦文章のうち、第1ユーザE(第4ユーザ)が読んでいない推薦文章を特定する。
The identifying unit 15 selects a fourth user among the plurality of first users grouped by the group setting unit 13 and selects a fourth user among the recommended sentences associated in the correspondence information stored in the storage unit 21 . 2 Based on the text information of the fourth user acquired by the acquisition unit 14, recommended texts that the fourth user has not read may be specified.
In the case illustrated in FIG. 1, as an example, the specifying unit 15 selects the first users C, D ( A recommended sentence that the fourth user) has not read is specified.
Similarly, in the case of FIG. 1, the identification unit 15 selects the first user E (fourth user) among the recommended sentences recorded in the correspondence information based on the sentence information of the first user E in the same group. Identify recommended texts that have not read.
 推薦部16は、特定部15によって特定される文章を第3ユーザに推薦する。
 図1に例示する場合、一例として、推薦部16は、特定部15によって第1ユーザC(第2ユーザ)は読んだ文章F,G,Hのうち、第1ユーザD(第3ユーザ)は読んでいない文章として文章Hが特定される場合、文章Hを第1ユーザD(第3ユーザ)に推薦する(ここで、符号F,G,Hは図1には図示せず)。ここで一例として、推薦部16は、文章Hの文章情報を第1ユーザD(第3ユーザ)が使用するユーザ端末100に送信するよう通信部22を制御してもよい。また一例として、推薦部16は、文章Hの文章情報を表示するよう表示部23を制御してもよい。
The recommendation unit 16 recommends the text identified by the identification unit 15 to the third user.
In the case illustrated in FIG. 1, as an example, the recommendation unit 16 determines that, among the sentences F, G, and H read by the first user C (second user) by the identification unit 15, the first user D (third user) When sentence H is identified as a sentence that has not been read, sentence H is recommended to first user D (third user) (here, symbols F, G, and H are not shown in FIG. 1). Here, as an example, the recommendation unit 16 may control the communication unit 22 to transmit the text information of the text H to the user terminal 100 used by the first user D (third user). As another example, the recommendation unit 16 may control the display unit 23 to display the text information of the text H.
 推薦部16は、特定部15によって特定される推薦文章を第4ユーザに推薦してもよい。
 図1に例示する場合、一例として、推薦部16は、特定部15によって推薦文章(I,J,K)のうち第1ユーザD(第4ユーザ)が読んでいない推薦文章として推薦文章Kが特定される場合、推薦文章Kを第1ユーザD(第4ユーザ)に推薦する(ここで、符号I,J,Kは図1には図示せず)。ここで一例として、推薦部16は、推薦文章Kの推薦文章情報を第1ユーザD(第4ユーザ)が使用するユーザ端末100に送信するよう通信部22を制御してもよい。また一例として、推薦部16は、推薦文章Kの文章情報を表示するよう表示部23を制御してもよい。推薦部16は、例えば、第1ユーザC(第4ユーザ)についても同様に推薦文章を推薦する。
 同様に図1に例示する場合、一例として、推薦部16は、特定部15によって推薦文章(I,J,K)のうち第1ユーザE(第4ユーザ)が読んでいない推薦文章として推薦文章Jが特定される場合、推薦文章Jを第1ユーザE(第4ユーザ)に推薦する。ここで一例として、推薦部16は、推薦文章Jの推薦文章情報を第1ユーザE(第4ユーザ)が使用するユーザ端末100に送信するよう通信部22を制御してもよい。また一例として、推薦部16は、推薦文章Jの推薦文章情報を表示するよう表示部23を制御してもよい。
 一例として、推薦文章情報は、対応情報の推薦文章に対応付けられていてもよく、推薦文章内容、著者、タイトル、インタビューを受けた研究者、発行所及び発行日等を始めとする種々の推薦文章を特定することができる情報であってもよい。
The recommendation unit 16 may recommend the recommended text identified by the identification unit 15 to the fourth user.
In the case illustrated in FIG. 1, as an example, the recommendation unit 16 causes the specification unit 15 to specify the recommended text K as the recommended text that the first user D (fourth user) has not read among the recommended texts (I, J, K). If identified, recommended sentence K is recommended to first user D (fourth user) (where symbols I, J, and K are not shown in FIG. 1). As an example, the recommendation unit 16 may control the communication unit 22 to transmit the recommended sentence information of the recommended sentence K to the user terminal 100 used by the first user D (fourth user). As another example, the recommendation unit 16 may control the display unit 23 to display the text information of the recommended text K. FIG. The recommendation unit 16, for example, similarly recommends a recommendation sentence for the first user C (fourth user).
Similarly, in the case of FIG. 1, as an example, the recommendation unit 16 selects a recommended text from among the recommended texts (I, J, K) that the first user E (fourth user) has not read, by the specifying unit 15. When J is identified, recommended sentence J is recommended to first user E (fourth user). As an example, the recommendation unit 16 may control the communication unit 22 to transmit the recommended sentence information of the recommended sentence J to the user terminal 100 used by the first user E (fourth user). As another example, the recommendation unit 16 may control the display unit 23 to display the recommended sentence information of the recommended sentence J. FIG.
As an example, the recommended text information may be associated with the recommended text of the corresponding information, and includes various recommendations including recommended text content, author, title, interviewed researcher, publisher, publication date, and the like. Information that can specify a sentence may be used.
[情報処理方法]
 次に、一実施形態に係る情報処理方法について説明する。
[Information processing method]
Next, an information processing method according to one embodiment will be described.
 まず、情報処理方法の概要について説明する。
 ステップST1として、制御部10は、ユーザ情報としての可視データVと、対応情報としての隠れた人物特性hとの出力結果に基づき、パラメータb,c,wを学習する。一例として、制御部10は、パラメータb,c,wの勾配法に基づく学習を行ってもよい。この場合、制御部10は、変化の幅が小さく、所定の基準を下回ると学習を停止してもよい。所定の基準は、学習するデータと実際の学習の様子とに基づいて決定されてもよい。推定部12は、パラメータb,c,wを所定のタイミング毎に学習してもよい。
First, an overview of the information processing method will be described.
As step ST1, the control unit 10 learns parameters b, c, and w based on output results of visible data V as user information and hidden person characteristics h as corresponding information. As an example, the control unit 10 may perform learning based on the gradient method for the parameters b, c, and w. In this case, the control unit 10 may stop learning when the range of change is small and falls below a predetermined standard. The predetermined criterion may be determined based on learning data and actual learning conditions. The estimation unit 12 may learn the parameters b, c, and w at predetermined timings.
 ステップST2として、制御部10は、第1ユーザについて、その第1ユーザの過去の可視データVを利用して人物特性h(第1ユーザの分類)を出力する(サンプリング)。さらに、制御部10は、その人物特性h(第1ユーザの分類)から再び可視データVを出力して(サンプリング)、未読の文章への第1ユーザの反応を予測する。この場合、制御部10は、既読の文章は推薦から除外し、未読の文章のみを推薦する。 As step ST2, the control unit 10 uses the past visible data V of the first user to output the person characteristic h (classification of the first user) for the first user (sampling). Furthermore, the control unit 10 outputs (sampling) the visible data V again from the person characteristics h (classification of the first user), and predicts the reaction of the first user to the unread text. In this case, the control unit 10 excludes read sentences from the recommendation and recommends only unread sentences.
 ステップST3として、制御部10は、未読の文章うち、第1ユーザの反応が良いと予測される文章をその第1ユーザに推薦する。 At step ST3, the control unit 10 recommends, to the first user, sentences that are expected to respond favorably to the first user among the unread sentences.
 次に、図4~6を用いて、一実施形態に係る情報処理方法について具体的に説明する。 Next, the information processing method according to one embodiment will be specifically described with reference to FIGS.
 まず、一実施形態に係る情報処理方法として、第1ユーザの分類を推定する処理について説明する。
 図4は、一実施形態に係る情報処理方法について説明するための第1のフローチャートである。
First, processing for estimating the classification of the first user will be described as an information processing method according to an embodiment.
FIG. 4 is a first flowchart for explaining an information processing method according to one embodiment.
 ステップST101において、第1取得部11は、ユーザ情報を取得する。ユーザ情報は、第1ユーザが文章を読んだ際にその第1ユーザが指標(ブックマーク)を付す特定位置に関する情報であってもよい。特定位置は、例えば、文章において指標が付される文節の位置、すなわち指標が付される文節を特性する情報であってもよい。
 この場合、第1取得部11は、例えば、第1ユーザがインタビュー記事を読んだ際に、第1ユーザが参考となるとしてインタビュー記事の特定の文節に付した1又は複数の指標の特定位置に関するユーザ情報を取得してもよい。
In step ST101, the first acquisition unit 11 acquires user information. The user information may be information about a specific position to which the first user marks (bookmarks) when the first user reads the text. The specific position may be, for example, the position of the indexed clause in the sentence, ie information that characterizes the indexed clause.
In this case, for example, when the first user reads the interview article, the first acquisition unit 11 obtains information about specific positions of one or more indices attached to a specific phrase of the interview article as a reference for the first user. User information may be obtained.
 ステップST102において、推定部12は、ステップST101で取得するユーザ情報と、記憶部21に記憶する対応情報とに基づいて、第1ユーザの特定位置に応じた分類を推定する。 In step ST102, the estimation unit 12 estimates the classification corresponding to the specific position of the first user based on the user information acquired in step ST101 and the correspondence information stored in the storage unit 21.
 ステップST103において、グループ設定部13は、ステップST102で推定される分類毎に複数の第1ユーザのグループ化を行う。すなわち、グループ設定部13は、例えば、ステップST102で複数の第1ユーザの分類が推定される場合、分類毎に複数の第1ユーザをグループ化する。 In step ST103, the group setting unit 13 groups a plurality of first users for each classification estimated in step ST102. That is, for example, when the classification of a plurality of first users is estimated in step ST102, the group setting unit 13 groups the plurality of first users for each classification.
 次に、一実施形態に係る情報処理方法として、第1ユーザ(第3ユーザ)に文章を推薦する処理について説明する。
 図5は、一実施形態に係る情報処理方法について説明するための第2のフローチャートである。
Next, processing for recommending text to the first user (third user) will be described as an information processing method according to an embodiment.
FIG. 5 is a second flowchart for explaining the information processing method according to one embodiment.
 ステップST201において、第2取得部14は、図4に示すステップST103でグループ化が行われた複数の第1ユーザのそれぞれのユーザ情報に基づいて、複数の第1ユーザが読んだ文章に関する文章情報を取得する。 In step ST201, the second acquisition unit 14 collects sentence information related to sentences read by the plurality of first users based on the user information of each of the plurality of first users grouped in step ST103 shown in FIG. to get
 ステップST202において、特定部15は、例えば、ステップST103でグループ化が行われた同一グループ内の複数の第1ユーザC,Dのうちの第2ユーザC及び第3ユーザDであって、第2取得部14によって取得する第2ユーザCの文章情報と第3ユーザDの文章情報に基づいて、第2ユーザCは読んだ文章のうち第3ユーザDは読んでいない文章を特定する。 In step ST202, the identifying unit 15 identifies, for example, the second user C and the third user D among the plurality of first users C and D in the same group grouped in step ST103, Based on the sentence information of the second user C and the sentence information of the third user D acquired by the acquisition unit 14, the sentences read by the second user C but not read by the third user D are specified.
 ステップST203において、推薦部16は、ステップST202で特定される文章を第3ユーザDに推薦する。すなわち、推薦部16は、ステップST202で特定される文章に関する文章情報を第3ユーザDが閲覧できるように出力部を制御する。推薦部16は、出力部として、例えば、通信部22及び表示部23のうち少なくとも一方、換言すると通信部22及び表示部23のグループから選択される少なくとも1つを制御してもよい。なお、推薦部16は、ステップST202で特定される文章に関する文章情報を記憶するよう、出力部の一実施形態としての記憶部21を制御してもよい。 In step ST203, the recommendation unit 16 recommends the text identified in step ST202 to the third user D. That is, the recommendation unit 16 controls the output unit so that the third user D can browse the text information related to the text specified in step ST202. The recommendation unit 16 may control, for example, at least one of the communication unit 22 and the display unit 23, in other words, at least one selected from the group of the communication unit 22 and the display unit 23, as the output unit. Note that the recommendation unit 16 may control the storage unit 21 as an embodiment of the output unit so as to store sentence information related to the sentence specified in step ST202.
 次に、一実施形態に係る情報処理方法として、第1ユーザ(第4ユーザ)に推薦文章を推薦する処理について説明する。
 図6は、一実施形態に係る情報処理方法について説明するための第3のフローチャートである。
Next, as an information processing method according to an embodiment, processing for recommending a recommended sentence to the first user (fourth user) will be described.
FIG. 6 is a third flowchart for explaining the information processing method according to one embodiment.
 ステップST301において、第2取得部14は、図4に示すステップST103でグループ化が行われた複数の第1ユーザのそれぞれのユーザ情報に基づいて、複数の第1ユーザが読んだ文章に関する文章情報を取得する。 In step ST301, the second acquisition unit 14 collects sentence information related to sentences read by the plurality of first users based on the user information of each of the plurality of first users grouped in step ST103 shown in FIG. to get
 ステップST302において、特定部15は、例えば、ステップST103でグループ化が行われた同一グループ内の1又は複数の第1ユーザEのうちの第4ユーザEであって、記憶部21に記憶される対応情報において対応付けられる推薦文章のうち、ステップST301で取得する第4ユーザEの文章情報に基づいて第4ユーザEが読んでいない推薦文章を特定する。 In step ST302, the identification unit 15 is, for example, the fourth user E among one or more first users E in the same group grouped in step ST103, and is stored in the storage unit 21. Of the recommended sentences associated in the correspondence information, recommended sentences that the fourth user E has not read are specified based on the sentence information of the fourth user E acquired in step ST301.
 ステップST303において、推薦部16は、ステップST302で特定される推薦文章を第4ユーザEに推薦する。すなわち、推薦部16は、ステップST302で特定される推薦文章に関する推薦文章情報を第4ユーザEが閲覧できるように出力部を制御する。推薦部16は、出力部として、例えば、通信部22及び表示部23のうち少なくとも一方、換言すると通信部22及び表示部23のグループから選択される少なくとも1つを制御してもよい。なお、推薦部16は、ステップST302で特定される推薦文章に関する推薦文章情報を記憶するよう、出力部の一実施形態としての記憶部21を制御してもよい。 In step ST303, the recommendation unit 16 recommends the recommended text identified in step ST302 to the fourth user E. That is, the recommendation unit 16 controls the output unit so that the fourth user E can browse the recommended sentence information related to the recommended sentence specified in step ST302. The recommendation unit 16 may control, for example, at least one of the communication unit 22 and the display unit 23, in other words, at least one selected from the group of the communication unit 22 and the display unit 23, as the output unit. Note that the recommendation unit 16 may control the storage unit 21 as an embodiment of the output unit so as to store recommended text information related to the recommended text specified in step ST302.
 上述した情報処理装置1の各部は、コンピュータの演算処理装置等の機能として実現されてもよい。すなわち、情報処理装置1の第1取得部11、推定部12、グループ設定部13、第2取得部14、特定部15及び推薦部16(制御部10)は、コンピュータの演算処理装置等による第1取得機能、推定機能、グループ設定機能、第2取得機能、特定機能及び推薦機能(制御機能)としてそれぞれ実現されてもよい。
 情報処理プログラムは、上述した各機能をコンピュータに実現させることができる。情報処理プログラムは、例えば、メモリ、ソリッドステートドライブ、ハードディスクドライブ又は光ディスク等の、コンピュータで読み取り可能な非一時的な記録媒体に記録されていてもよい。
 また、上述したように、情報処理装置1の各部は、コンピュータの演算処理装置等で実現されてもよい。その演算処理装置等は、例えば、集積回路等によって構成される。このため、情報処理装置1の各部は、演算処理装置等を構成する回路として実現されてもよい。すなわち、情報処理装置1の第1取得部11、推定部12、グループ設定部13、第2取得部14、特定部15及び推薦部16(制御部10)は、コンピュータの演算処理装置等を構成する第1取得回路、推定回路、グループ設定回路、第2取得回路、特定回路及び推薦回路(制御回路)として実現されてもよい。
 また、情報処理装置1の記憶部21、通信部22及び表示部23(出力部)は、例えば、演算処理装置等の機能を含む記憶機能、通信機能及び表示機能(出力機能)として実現されもよい。また、情報処理装置1の記憶部21、通信部22及び表示部23(出力部)は、例えば、集積回路等によって構成されることにより記憶回路、通信回路及び表示回路(出力回路)として実現されてもよい。また、情報処理装置1の記憶部21、通信部22及び表示部23(出力部)は、例えば、複数のデバイスによって構成されることにより記憶装置、通信装置及び表示装置(出力装置)として構成されてもよい。
Each unit of the information processing apparatus 1 described above may be implemented as a function of an arithmetic processing unit of a computer or the like. That is, the first acquiring unit 11, the estimating unit 12, the group setting unit 13, the second acquiring unit 14, the identifying unit 15, and the recommending unit 16 (control unit 10) of the information processing device 1 are configured by an arithmetic processing unit of a computer or the like. It may be implemented as a first acquisition function, an estimation function, a group setting function, a second acquisition function, a specific function, and a recommendation function (control function).
The information processing program can cause the computer to implement each function described above. The information processing program may be recorded in a non-temporary computer-readable recording medium such as a memory, solid state drive, hard disk drive, or optical disc.
Further, as described above, each part of the information processing device 1 may be realized by an arithmetic processing device of a computer or the like. The arithmetic processing unit or the like is configured by an integrated circuit or the like, for example. Therefore, each part of the information processing device 1 may be implemented as a circuit that constitutes an arithmetic processing device or the like. That is, the first acquisition unit 11, the estimation unit 12, the group setting unit 13, the second acquisition unit 14, the identification unit 15, and the recommendation unit 16 (control unit 10) of the information processing device 1 constitute an arithmetic processing unit of a computer. may be implemented as a first acquisition circuit, an estimation circuit, a group setting circuit, a second acquisition circuit, a specification circuit, and a recommendation circuit (control circuit).
Further, the storage unit 21, the communication unit 22, and the display unit 23 (output unit) of the information processing device 1 may be implemented as, for example, a storage function including functions of an arithmetic processing unit, a communication function, and a display function (output function). good. Further, the storage unit 21, the communication unit 22, and the display unit 23 (output unit) of the information processing device 1 are realized as a storage circuit, a communication circuit, and a display circuit (output circuit) by being configured by an integrated circuit or the like, for example. may The storage unit 21, the communication unit 22, and the display unit 23 (output unit) of the information processing device 1 are configured as a storage device, a communication device, and a display device (output device), for example, by being configured by a plurality of devices. may
 情報処理装置1は、上述した複数の各部のうち1又は任意の複数を組み合わせることが可能である。
 本開示では、「情報」の文言を使用しているが、「情報」の文言は「データ」と言い換えることができ、「データ」の文言は「情報」と言い換えることができる。
The information processing apparatus 1 can combine any one of the plurality of units described above or an arbitrary plurality thereof.
Although the term "information" is used in this disclosure, the term "information" can be interchanged with "data" and the term "data" can be interchanged with "information."
[本実施形態の態様及び効果]
 次に、本実施形態の一態様及び各態様が奏する効果について説明する。なお、以下に記載する効果は一例であり、各態様が奏する効果は以下に記載するものに限定されることはない。
[Aspects and effects of the present embodiment]
Next, one aspect of the present embodiment and effects produced by each aspect will be described. Note that the effects described below are merely examples, and the effects of each aspect are not limited to those described below.
(態様1)
 一態様の情報処理装置は、文章と、その文章を所定単位毎に区切った際にユーザによって所定単位に指標が付される特定位置と、文章と特定位置とに基づいて推定されるユーザの分類とを対応付けた対応情報を記憶する記憶部と、第1ユーザが文章を読んだ際に第1ユーザが指標を付す特定位置に関するユーザ情報を取得する第1取得部と、第1取得部によって取得するユーザ情報と、記憶部に記憶する対応情報とに基づいて、第1ユーザの特定位置に応じた分類を推定する推定部と、推定部によって推定される分類毎に複数の第1ユーザのグループ化を行うグループ設定部と、を備える。
 これにより、情報処理装置は、文章の推薦に利用する、ユーザのグループ化を行うことができる。情報処理装置は、ユーザが所定単位毎に区切った文章に指標を付すため、ユーザそれぞれの好みをより反映させることができ、ユーザの分類をより適切に行うことができる。
(Aspect 1)
An information processing apparatus according to one aspect includes a text, a specific position where a user attaches an index to a predetermined unit when the text is divided into predetermined units, and a user classification estimated based on the text and the specific position. a first acquisition unit for acquiring user information about a specific position marked by the first user when the first user reads a sentence; an estimation unit for estimating a classification corresponding to a specific position of the first user based on the acquired user information and the corresponding information stored in the storage unit; and a plurality of first users for each classification estimated by the estimation unit. and a group setting unit that performs grouping.
Accordingly, the information processing apparatus can group users for use in recommending sentences. Since the information processing apparatus attaches an index to the sentences that the user has divided into predetermined units, it is possible to reflect the preferences of each user and to classify the users more appropriately.
(態様2)
 一態様の情報処理装置は、グループ設定部によってグループ化が行われた複数の第1ユーザのそれぞれのユーザ情報に基づいて、複数の第1ユーザが読んだ文章に関する文章情報を取得する第2取得部と、グループ設定部によってグループ化が行われた複数の第1ユーザのうちの第2ユーザ及び第3ユーザであって、第2取得部によって取得する第2ユーザの文章情報と第3ユーザの文章情報に基づいて、第2ユーザは読んだ文章のうち第3ユーザは読んでいない文章を特定する特定部と、特定部によって特定される文章を第3ユーザに推薦する推薦部と、を備えることとしてもよい。
 同一グループ内の複数のユーザは、文章に対して興味を持つ点及び文章の参考となる点等に類似性が有ると考えられる。したがって、情報処理装置は、同一グループ内のユーザに対して自身の好みにあった文章を推薦することができる。
(Aspect 2)
An information processing apparatus according to one aspect acquires text information related to text read by a plurality of first users based on user information of each of the plurality of first users grouped by a group setting unit. and a second user and a third user among a plurality of first users grouped by the group setting unit, wherein the text information of the second user and the text information of the third user acquired by the second acquisition unit Based on the sentence information, the second user has a specification unit that specifies sentences that the third user has not read among the sentences that the second user has read, and a recommendation unit that recommends the sentences specified by the specification unit to the third user. You can do it.
A plurality of users in the same group are considered to have similarities in terms of their interest in the text and the points of reference for the text. Therefore, the information processing apparatus can recommend sentences that suit the user's taste to users in the same group.
(態様3)
 一態様の情報処理装置では、記憶部が記憶する対応情報には、ユーザの分類毎に推薦する推薦文章がさらに対応付けられており、特定部は、グループ設定部によってグループ化が行われた複数の第1ユーザのうちの第4ユーザであって、記憶部に記憶される対応情報において対応付けられる推薦文章のうち、第2取得部によって取得する第4ユーザの文章情報に基づいて第4ユーザが読んでいない推薦文章を特定し、推薦部は、特定部によって特定される推薦文章を第4ユーザに推薦することとしてもよい。
 これにより、情報処理装置は、ユーザの分類毎、すなわちグループ毎に対応付けられる推薦文章を、対応するグループに分類するユーザに対して推薦することができる。
(Aspect 3)
In one aspect of the information processing device, the correspondence information stored in the storage unit is further associated with recommended sentences recommended for each user classification, and the specifying unit includes a plurality of sentences grouped by the group setting unit. A fourth user among the first users of the fourth user based on the text information of the fourth user acquired by the second acquisition unit among the recommended texts associated in the correspondence information stored in the storage unit The fourth user may specify the recommended sentences that the fourth user has not read, and the recommendation unit may recommend the recommended sentences specified by the specifying unit to the fourth user.
Accordingly, the information processing apparatus can recommend recommended sentences associated with each user classification, that is, each group, to the users classified into the corresponding groups.
(態様4)
 一態様の情報処理装置では、記憶部が記憶する対応情報に対応付けられる文章は、研究者をインタビューした際に作成されるインタビュー記事であり、対応情報に対応付けられる特定位置は、文章としてのインタビュー記事を所定単位としての文節毎に区切った際の文節の位置であり、第1取得部は、第1ユーザがインタビュー記事を読んだ際に、第1ユーザが参考となるとしてインタビュー記事の特定の文節に付した1又は複数の指標の特定位置に関するユーザ情報を取得することとしてもよい。
 これにより、情報処理装置は、ユーザが興味を持つインタビュー記事に応じてそのユーザを分類することができる。
(Aspect 4)
In one aspect of the information processing device, the sentence associated with the correspondence information stored in the storage unit is an interview article created when interviewing a researcher, and the specific position associated with the correspondence information is a sentence as a sentence. It is the position of the clause when the interview article is divided into clauses as a predetermined unit, and the first acquisition unit specifies the interview article as a reference for the first user when the first user reads the interview article. It is also possible to obtain user information on the specific position of one or more indicators attached to the clause.
Thereby, the information processing device can classify the user according to the interview article that the user is interested in.
(態様5)
 一態様の情報処理方法では、文章と、その文章を所定単位毎に区切った際にユーザによって所定単位に指標が付される特定位置と、文章と特定位置とに基づいて推定されるユーザの分類とを対応付けた対応情報を記憶する記憶部を備えるコンピュータが、第1ユーザが文章を読んだ際に第1ユーザが指標を付す特定位置に関するユーザ情報を取得する第1取得ステップと、第1取得ステップによって取得するユーザ情報と、記憶部に記憶する対応情報とに基づいて、第1ユーザの特定位置に応じた分類を推定する推定ステップと、推定ステップによって推定される分類毎に複数の第1ユーザのグループ化を行うグループ設定ステップと、を実行する。
 これにより、情報処理方法は、上述した一態様の情報処理装置と同様の効果を奏することができる。
(Aspect 5)
In one aspect of the information processing method, a sentence, a specific position where a user attaches an index to a predetermined unit when the sentence is divided into predetermined units, and a classification of the user estimated based on the sentence and the specific position a first obtaining step of obtaining user information about a specific position marked by the first user when the first user reads a sentence; an estimation step of estimating a classification according to the specific position of the first user based on the user information acquired by the acquisition step and the correspondence information stored in the storage unit; and a group setting step for grouping one user.
Accordingly, the information processing method can achieve the same effect as the information processing apparatus of one aspect described above.
(態様6)
 一態様の情報処理プログラムは、コンピュータに、文章と、その文章を所定単位毎に区切った際にユーザによって所定単位に指標が付される特定位置と、文章と特定位置とに基づいて推定されるユーザの分類とを対応付けた対応情報を記憶する記憶機能と、第1ユーザが文章を読んだ際に第1ユーザが指標を付す特定位置に関するユーザ情報を取得する第1取得機能と、第1取得機能によって取得するユーザ情報と、記憶機能に記憶する対応情報とに基づいて、第1ユーザの特定位置に応じた分類を推定する推定機能と、推定機能によって推定される分類毎に複数の第1ユーザのグループ化を行うグループ設定機能と、を実現させる。
 これにより、情報処理プログラムは、上述した一態様の情報処理装置と同様の効果を奏することができる。
(Aspect 6)
An information processing program of one aspect provides a computer with a sentence, a specific position marked by a user on a predetermined unit when the sentence is divided into predetermined units, and an estimation based on the sentence and the specific position. a storage function for storing correspondence information associated with user classifications; a first acquisition function for acquiring user information relating to a specific position marked by the first user when the first user reads a sentence; an estimation function for estimating a classification corresponding to a specific position of the first user based on the user information acquired by the acquisition function and the correspondence information stored in the storage function; and a group setting function for grouping one user.
Accordingly, the information processing program can achieve the same effect as the information processing apparatus of one aspect described above.
1 情報処理装置
10 制御部
11 第1取得部
12 推定部
13 グループ設定部
14 第2取得部
15 特定部
16 推薦部
21 記憶部
22 通信部
23 表示部
1 information processing device 10 control unit 11 first acquisition unit 12 estimation unit 13 group setting unit 14 second acquisition unit 15 identification unit 16 recommendation unit 21 storage unit 22 communication unit 23 display unit

Claims (6)

  1.  文章と、当該文章を所定単位毎に区切った際にユーザによって所定単位に指標が付される特定位置と、前記文章と前記特定位置とに基づいて推定されるユーザの分類とを対応付けた対応情報を記憶する記憶部と、
     第1ユーザが前記文章を読んだ際に第1ユーザが指標を付す特定位置に関するユーザ情報を取得する第1取得部と、
     前記第1取得部によって取得するユーザ情報と、前記記憶部に記憶する対応情報とに基づいて、第1ユーザの特定位置に応じた分類を推定する推定部と、
     前記推定部によって推定される分類毎に複数の第1ユーザのグループ化を行うグループ設定部と、
    を備える情報処理装置。
    A correspondence obtained by associating a sentence, a specific position where a user attaches an index to a predetermined unit when the sentence is divided into predetermined units, and a classification of the user estimated based on the sentence and the specific position. a storage unit that stores information;
    a first acquisition unit that acquires user information about a specific position marked by the first user when the first user reads the text;
    an estimation unit for estimating a classification corresponding to a specific position of the first user based on the user information acquired by the first acquisition unit and the correspondence information stored in the storage unit;
    a group setting unit that groups a plurality of first users for each classification estimated by the estimation unit;
    Information processing device.
  2.  前記グループ設定部によってグループ化が行われた複数の第1ユーザのそれぞれのユーザ情報に基づいて、複数の第1ユーザが読んだ文章に関する文章情報を取得する第2取得部と、
     前記グループ設定部によってグループ化が行われた複数の第1ユーザのうちの第2ユーザ及び第3ユーザであって、前記第2取得部によって取得する第2ユーザの文章情報と第3ユーザの文章情報に基づいて、第2ユーザは読んだ文章のうち第3ユーザは読んでいない文章を特定する特定部と、
     前記特定部によって特定される文章を第3ユーザに推薦する推薦部と、
    を備える請求項1に記載の情報処理装置。
    a second acquisition unit that acquires sentence information related to sentences read by a plurality of first users based on the user information of each of the plurality of first users grouped by the group setting unit;
    A second user and a third user among a plurality of first users grouped by the group setting unit, wherein text information of the second user and text of the third user acquired by the second acquisition unit a specifying unit for specifying, based on the information, sentences read by the second user but not read by the third user;
    a recommendation unit that recommends the text specified by the specifying unit to a third user;
    The information processing apparatus according to claim 1, comprising:
  3.  前記記憶部が記憶する対応情報には、ユーザの分類毎に推薦する推薦文章がさらに対応付けられており、
     前記特定部は、前記グループ設定部によってグループ化が行われた複数の第1ユーザのうちの第4ユーザであって、前記記憶部に記憶される対応情報において対応付けられる推薦文章のうち、前記第2取得部によって取得する第4ユーザの文章情報に基づいて第4ユーザが読んでいない推薦文章を特定し、
     前記推薦部は、前記特定部によって特定される推薦文章を第4ユーザに推薦する
    請求項2に記載の情報処理装置。
    The correspondence information stored in the storage unit is further associated with recommended sentences recommended for each user classification,
    The identifying unit selects a fourth user among the plurality of first users grouped by the group setting unit, and selects the recommended text corresponding to the corresponding information stored in the storage unit. Identifying recommended texts not read by the fourth user based on the text information of the fourth user acquired by the second acquisition unit;
    The information processing apparatus according to claim 2, wherein the recommendation unit recommends the recommended text specified by the specification unit to the fourth user.
  4.  前記記憶部が記憶する対応情報に対応付けられる前記文章は、研究者をインタビューした際に作成されるインタビュー記事であり、対応情報に対応付けられる特定位置は、前記文章としての前記インタビュー記事を所定単位としての文節毎に区切った際の文節の位置であり、
     前記第1取得部は、第1ユーザがインタビュー記事を読んだ際に、第1ユーザが参考となるとして前記インタビュー記事の特定の文節に付した1又は複数の指標の特定位置に関するユーザ情報を取得する
    請求項1~3のいずれか1項に記載の情報処理装置。
    The sentence associated with the correspondence information stored in the storage unit is an interview article created when the researcher was interviewed, and the specific position associated with the correspondence information is the interview article as the sentence. It is the position of the clause when it is divided for each clause as a unit,
    When the first user reads the interview article, the first acquisition unit acquires user information about a specific position of one or more indicators attached to a specific phrase of the interview article as a reference for the first user. The information processing apparatus according to any one of claims 1 to 3.
  5.  文章と、当該文章を所定単位毎に区切った際にユーザによって所定単位に指標が付される特定位置と、前記文章と前記特定位置とに基づいて推定されるユーザの分類とを対応付けた対応情報を記憶する記憶部を備えるコンピュータが、
     第1ユーザが前記文章を読んだ際に第1ユーザが指標を付す特定位置に関するユーザ情報を取得する第1取得ステップと、
     前記第1取得ステップによって取得するユーザ情報と、前記記憶部に記憶する対応情報とに基づいて、第1ユーザの特定位置に応じた分類を推定する推定ステップと、
     前記推定ステップによって推定される分類毎に複数の第1ユーザのグループ化を行うグループ設定ステップと、
    を実行する情報処理方法。
    A correspondence obtained by associating a sentence, a specific position where a user attaches an index to a predetermined unit when the sentence is divided into predetermined units, and a classification of the user estimated based on the sentence and the specific position. A computer comprising a storage unit for storing information,
    a first obtaining step of obtaining user information about a specific position marked by the first user when the first user reads the text;
    an estimation step of estimating a classification corresponding to a specific position of the first user based on the user information acquired by the first acquisition step and the correspondence information stored in the storage unit;
    a group setting step of grouping a plurality of first users for each classification estimated by the estimation step;
    Information processing method that performs
  6.  コンピュータに、
     文章と、当該文章を所定単位毎に区切った際にユーザによって所定単位に指標が付される特定位置と、前記文章と前記特定位置とに基づいて推定されるユーザの分類とを対応付けた対応情報を記憶する記憶機能と、
     第1ユーザが前記文章を読んだ際に第1ユーザが指標を付す特定位置に関するユーザ情報を取得する第1取得機能と、
     前記第1取得機能によって取得するユーザ情報と、前記記憶機能に記憶する対応情報とに基づいて、第1ユーザの特定位置に応じた分類を推定する推定機能と、
     前記推定機能によって推定される分類毎に複数の第1ユーザのグループ化を行うグループ設定機能と、
    を実現させる情報処理プログラム。
    to the computer,
    A correspondence obtained by associating a sentence, a specific position where a user attaches an index to a predetermined unit when the sentence is divided into predetermined units, and a classification of the user estimated based on the sentence and the specific position. a memory function for storing information;
    a first acquisition function for acquiring user information about a specific position marked by the first user when the first user reads the text;
    an estimation function for estimating a classification corresponding to a specific position of the first user based on the user information acquired by the first acquisition function and the correspondence information stored in the storage function;
    A group setting function for grouping a plurality of first users for each classification estimated by the estimation function;
    Information processing program that realizes
PCT/JP2022/025966 2021-08-10 2022-06-29 Information processing device, information processing method, and information processing program WO2023017690A1 (en)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
JP2021130738A JP7373213B2 (en) 2021-08-10 2021-08-10 Information processing device, information processing method, and information processing program
JP2021-130738 2021-08-10

Publications (1)

Publication Number Publication Date
WO2023017690A1 true WO2023017690A1 (en) 2023-02-16

Family

ID=85199929

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/JP2022/025966 WO2023017690A1 (en) 2021-08-10 2022-06-29 Information processing device, information processing method, and information processing program

Country Status (2)

Country Link
JP (1) JP7373213B2 (en)
WO (1) WO2023017690A1 (en)

Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP2013109734A (en) * 2011-11-24 2013-06-06 Canon Inc Document retrieval device, document retrieval method and program
JP2014235632A (en) * 2013-06-04 2014-12-15 キヤノン株式会社 Document management system, control method of document management system and program
US20200097509A1 (en) * 2018-04-30 2020-03-26 Innoplexus Ag System and method for providing recommendations of documents

Patent Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP2013109734A (en) * 2011-11-24 2013-06-06 Canon Inc Document retrieval device, document retrieval method and program
JP2014235632A (en) * 2013-06-04 2014-12-15 キヤノン株式会社 Document management system, control method of document management system and program
US20200097509A1 (en) * 2018-04-30 2020-03-26 Innoplexus Ag System and method for providing recommendations of documents

Also Published As

Publication number Publication date
JP7373213B2 (en) 2023-11-02
JP2023025470A (en) 2023-02-22

Similar Documents

Publication Publication Date Title
US20200250465A1 (en) Accurate tag relevance prediction for image search
Lu et al. Content-based collaborative filtering for news topic recommendation
US9965717B2 (en) Learning image representation by distilling from multi-task networks
US8027977B2 (en) Recommending content using discriminatively trained document similarity
JP5350472B2 (en) Product ranking method and product ranking system for ranking a plurality of products related to a topic
US8559682B2 (en) Building a person profile database
US20170236055A1 (en) Accurate tag relevance prediction for image search
US8805026B1 (en) Scoring items
US10311374B2 (en) Categorization of forms to aid in form search
US20080281764A1 (en) Machine Learning System
CN110147499B (en) Labeling method, recommendation method and recording medium
US11301528B2 (en) Selecting content objects for recommendation based on content object collections
JP2005317018A (en) Method and system for calculating importance of block in display page
US20130198192A1 (en) Author disambiguation
CN105874427B (en) Help information is identified based on application context
CA2632156A1 (en) Information retrieval system and method using a bayesian algorithm based on probabilistic similarity scores
JP2014197300A (en) Text information processor, text information processing method, and text information processing program
Jo Using K Nearest Neighbors for text segmentation with feature similarity
US9223854B2 (en) Document relevance determining method and computer program
CN112151187B (en) Information query method, device, computer equipment and storage medium
WO2022010579A1 (en) Document conversion engine
KR20190130410A (en) Apparatus and method for recommending customized content for smart senior
WO2023017690A1 (en) Information processing device, information processing method, and information processing program
TW201243627A (en) Multi-label text categorization based on fuzzy similarity and k nearest neighbors
Wei et al. Finding related publications: extending the set of terms used to assess article similarity

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 22855754

Country of ref document: EP

Kind code of ref document: A1

NENP Non-entry into the national phase

Ref country code: DE