WO2015191063A1 - Systems and methods for content on-boarding - Google Patents
Systems and methods for content on-boarding Download PDFInfo
- Publication number
- WO2015191063A1 WO2015191063A1 PCT/US2014/041935 US2014041935W WO2015191063A1 WO 2015191063 A1 WO2015191063 A1 WO 2015191063A1 US 2014041935 W US2014041935 W US 2014041935W WO 2015191063 A1 WO2015191063 A1 WO 2015191063A1
- Authority
- WO
- WIPO (PCT)
- Prior art keywords
- recommendation
- modified
- candidate document
- data items
- program code
- Prior art date
Links
- 238000000034 method Methods 0.000 title claims abstract description 74
- 238000011160 research Methods 0.000 claims abstract description 39
- 238000004590 computer program Methods 0.000 claims abstract description 16
- 238000012552 review Methods 0.000 claims description 24
- 230000004044 response Effects 0.000 claims description 12
- 230000001960 triggered effect Effects 0.000 claims description 4
- 230000004931 aggregating effect Effects 0.000 claims 4
- 230000002730 additional effect Effects 0.000 claims 1
- KISFEBPWFCGRGN-UHFFFAOYSA-M sodium;2-(2,4-dichlorophenoxy)ethyl sulfate Chemical compound [Na+].[O-]S(=O)(=O)OCCOC1=CC=C(Cl)C=C1Cl KISFEBPWFCGRGN-UHFFFAOYSA-M 0.000 claims 1
- 230000008569 process Effects 0.000 description 22
- 238000004891 communication Methods 0.000 description 21
- 230000000737 periodic effect Effects 0.000 description 9
- 230000009471 action Effects 0.000 description 7
- 230000008859 change Effects 0.000 description 6
- 239000003973 paint Substances 0.000 description 6
- 238000010586 diagram Methods 0.000 description 5
- 230000002950 deficient Effects 0.000 description 4
- 230000000694 effects Effects 0.000 description 4
- 230000000153 supplemental effect Effects 0.000 description 4
- 230000001413 cellular effect Effects 0.000 description 3
- 238000012546 transfer Methods 0.000 description 3
- IRLPACMLTUPBCL-KQYNXXCUSA-N 5'-adenylyl sulfate Chemical compound C1=NC=2C(N)=NC=NC=2N1[C@@H]1O[C@H](COP(O)(=O)OS(O)(=O)=O)[C@@H](O)[C@H]1O IRLPACMLTUPBCL-KQYNXXCUSA-N 0.000 description 2
- 206010057430 Retinal injury Diseases 0.000 description 2
- 230000005540 biological transmission Effects 0.000 description 2
- 230000008021 deposition Effects 0.000 description 2
- 230000006870 function Effects 0.000 description 2
- 230000004048 modification Effects 0.000 description 2
- 238000012986 modification Methods 0.000 description 2
- 230000002085 persistent effect Effects 0.000 description 2
- 238000012545 processing Methods 0.000 description 2
- 241000222120 Candida <Saccharomycetales> Species 0.000 description 1
- RTAQQCXQSZGOHL-UHFFFAOYSA-N Titanium Chemical compound [Ti] RTAQQCXQSZGOHL-UHFFFAOYSA-N 0.000 description 1
- 230000006978 adaptation Effects 0.000 description 1
- 229940000425 combination drug Drugs 0.000 description 1
- 230000003247 decreasing effect Effects 0.000 description 1
- 230000007123 defense Effects 0.000 description 1
- 229940079593 drug Drugs 0.000 description 1
- 238000005516 engineering process Methods 0.000 description 1
- 239000000463 material Substances 0.000 description 1
- 230000005055 memory storage Effects 0.000 description 1
- 238000010295 mobile communication Methods 0.000 description 1
- 238000012544 monitoring process Methods 0.000 description 1
- 230000003287 optical effect Effects 0.000 description 1
- 230000008520 organization Effects 0.000 description 1
- 238000003756 stirring Methods 0.000 description 1
- 230000001052 transient effect Effects 0.000 description 1
- 230000003442 weekly effect Effects 0.000 description 1
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/90—Details of database functions independent of the retrieved data types
- G06F16/95—Retrieval from the web
- G06F16/953—Querying, e.g. by the use of web search engines
- G06F16/9535—Search customisation based on user profiles and personalisation
Definitions
- This disclosure relates generally to the collection of content. More specifically, the disclosure is directed towards systems and methods for recommending the uploading, also referred to as on-boarding, of candidate documents to an on-line research system.
- On-line research systems are invaluable tools that are used in nearly every business, legal, scientific and academic environment.
- online research systems are continually used by attorneys, court personnel and students in orde to continuously research and be kept informed of the most recent legal decisions, statutes and legislation, indeed, it is not uncommon for a judicial decision issued as recently as the previous day to have a substantial impact on an attorney's strateg or a court's legal, analysis. Further, it is not ' uncommon for a judicial decision from a lower court or a different jurisdiction to also have an impact on an attorney's strategy or a court's legal analysis.
- the candidate court, document may be lost forever, despite a change in circumstances or user requirements that may greatly increase the importance and relevancy of the candidate document.
- These risks are especially prevalent when looking io coart. documents from lower courts or unpopular jurisdictions, where this manual, process is employed more often to result in the candidate document not being on-boarded.
- One theoretical "solution” to this issue is to simply on-board all documents, including for example, small claim court decisions, local building code, and the like. Yet such a "solution” is simply not economically feasible given the high potential cost of on-boarding ail documents.
- the method includes receiving, from an electronic device, a set of data items associated with a candidate document, the candidate document being a document that is a candidate to be made available via the on-line research system and storing the set of da ta items in a first memory.
- the set of data items are then automatically analyzed using a computer program stored in the first memory and a recommendation as to whether to obtain or not obtain the candidate document is generated using the computer program , A signal is then generated based upon the
- the method further includes, in response to the step o f t ransmitting the signal and wherein the recommendation is an obtain recomm endation, receiving an. electronic version of the candidate documen t from the electronic device and storing the electronic version in a second memory.
- the method further includes, in response to the ste of transmitting the recommendation and wherein the recommendation is a do not obtain
- the method further includes storing the modified recommendation, in the first memory, generating a modified signal based upon the modified recommendation and transmitting the modified signal to the electronic device.
- the step of reviewing is ongoing and is triggered by at least one of an event and an end of a time period since the last time the step of reviewing was performed. I one embodiment, the method farther includes, in response to the step of transmitting the modified signal and wherein the modified
- recommendation is an obtain recommendation, receiving an electronic version of the candidate document from the electronic de vice and storing the electronic version in the second memory.
- a system as well as articles that include a machine-readable medium storing machine-readable program code for implementing the various techniques, are disclosed. Details of various embodiments are discussed in greater detail below.
- FIG. 1 is a schematic depicting an exemplary computer-based system for generating a. recommendation to on-board a candidate document to an on-line research system
- FIG . 2 is a flow diagram illustrating an exemplary computer-implemented method for generating a recommendation to on-board a candidate document to an on-line research system
- FIG. 3 is a flow diagram illustrating an exemplary computer-implemented method for on-boarding a candidate document to an on-line research system
- FIG. 4 is a flow diagram illustrating an exemplar computer-implemented method for generating a modified recommendation to an on-line research system during a subsequent review
- FIG. 5 is a flow diagram il lustrating a farther detailed exemplary computer- implemented method for generating a .recommendation to on-board a candidate document to an on-line research system; and [0017] FIGS. 6 and 6 A are flow diagrams illustrating a further detailed exemplary computer-implemented method for generating a modified recommendation to an on-line research system during a subsequent review.
- FIG. 1 an example of a suitable computing system 100 within which embodiments of the disclosure may be implemented is presented.
- the computing system 100 is only one example and is not intended to suggest any l imitation as to the scope of use or functionality of the disclosure. Neither should the computing system 100 be interpreted as having any dependency or requirement relating to any one or combina tion of illustrated components.
- the present disclosure is operational with numerous other general purpose or special purpose computing consumer electronics, network PCs, minicomputers- mainframe computers, laptop computers, as well as distributed computing environments that include any of the above systems or devices, and the like.
- the disclosure may be described in the general context of computer-executable instructions, such as program modules, being executed by a computer.
- program modules include routines, programs, objects, components, data- structures, loop code segments and constructs, and other computer instruction known t those skilled in the art that perform particular tasks or implement particular abstract data types.
- the disclosure can be practiced in distributed computing environments where tasks are performed by remote processing devices that are linked through a communications network, to a distributed computing environment, program modules are located in both local and remote computer storage media including memory storage devices. Tasks performed by the programs and modules are described bel w and with the aid of figures. Those skilled in the art may implement the description and figures as processor executable instructions, which may be written on any form of a computer readable media.
- the computing system 100 includes a server device .1 10 configured to include a processor 1 12, such as a central processing unit
- the server 1 10 is part of an on-line research system. In another embodiment, the server 1 10 is separate from the on-line research system and transmits one or more candidate documents to be stored within the on-line research system.
- the non-volatile memory As shown in the Fig. 1. example, in one embodiment, the non-volatile memory
- the 120 is configured to include a recommendation module 122, a scoring module 124 and a communication module 126.
- the scoring module 124 is configured to analyze one or more data items associated with a candidate document on an iterative basis and generate a score for each of the data items using one or more of the rules maintained in a set of predefined scoring patterns 136, as well as a combined overall score for the candidate document using the individual data item scores.
- the recommendation module 122 is configured to generate a recommendation to obtain or to not obtain a candidate document, using one or more of the rules mai ntained in the set of predefined scoring patterns 1 36 and the scores generated by the scoring module 124.
- a communication module 126 is provided to receive the set of data items associated with the candidate document, as well as any additional information related to the set of data items, and to generate and transmit a signal associated with a recommendation to obtain or not obtain the candidate document. Additional details of modules 122, 124 and 126 are discussed in connection with FIGS, 2-5.
- a network 1 0 is provided that can include various devices such as routers, server, and switching elements connected in an Intranet, Extranet or internet, configuration.
- the network 150 uses wired
- the network 150 employs wireless communication protocols to transfer information between the access device 160, the server device 1 1 , the dat store 1 30 and the content servers 170 and 180.
- the network 1 0 may be a cellular or mobile network employing digital cellular standards including but not limited to the 3GPP, 3GPP2 and AMPS faniilv of standards such as Global System for Mobile Communications (GSM), Genera! Packet Radio Service (GPRS). CDMAOne, CDMA20O0, Evolution-Data Optimized (EV-DO), LIE Advanced, Enhanced Data Rates for GSM Evolution (EDGE), Universal Mobile Telecommunications System (UMTS), Digital
- GSM Global System for Mobile Communications
- GPRS Genera! Packet Radio Service
- DECT Enhanced Cordless Telecommunications
- IS-136 TDMA Digital AMPS
- the network 150 may also be a Wide Area Network (W AN), such as the internet, which employs one or more transmission protocols, e.g. TCP/IP.
- W AN Wide Area Network
- the network 150 may employ a combination of digital cellular standards and transmission protocols, in yet other embodiments, the network 150 may employ a combination of wired and wireless technologies to transfer information between the access device 160, the sewer device 1 10, the data store 130 and the content servers 170 and ⁇ 80.
- the data store 130 is a repository that maintains and stores information utilized by the before-mentioned modules 122, 124 and 126.
- the data store 130 is a relational database.
- the data store 130 is a director server, such as a Lightweight Directory Access Protocol f'LDAP").
- the data store 130 is an area of non-volatile memory 120 of the server device 1 10.
- the data store 1.30 includes a candidate data item database 132, an on-boarded document database 134 and the set of predefined scoring patterns 136.
- the on-boarded document database 134 maintains the electronic versions of the on-boarded documents, i.e. candidate documents that have been uploaded, or on-boarded, .from, the access device 1 0. Examples of on-boarded documents include, but are not limi ted to, court documents, such as judicial,
- the candidate data item database 132 maintains the set of data items received, by the communication module 126 and used by the scoring module 124 to analyze whether a candidate document should be on-boarded.
- the set of data items are derived from a candidate document and may include, but are not limited to, the candidate document type, level and jurisdiction of the court, identification of the parties, identification of the court officers, including the judge or judge and counsel for the parties, nature of the legal mater, and subject Biatter data, such as the legal concept at issue, damages data and fact pattern data.
- the candidate data item database 1 2 .maintains the set of data items in a structured dat store, such as a relational or hieratehal database.
- the set of predefined scoring patterns 136 includes one or more scoring rules used by the scoring module 124 to score individual data items and the candidate document itself and by the recommendation module 122 in order to make a
- the set of predefined scoring patterns 136 is maintained in a structured data store, such as a relational or hierarchal database, and is established by an administrator using the administrator device 190 to determine the scoring patterns to be utilized by the scoring module 124 in scoring individual dat items,
- the data store 130 shown in FIG. I is connected to the network 150. It will be appreciated by one skilled in the art that the data store 130 and/or any of the information shown therein, can be distributed across various servers and be accessible to the sewer 1 10 over the network 150; be coupled directly to the server 1 10; be configured as part of server 1 10 and interconnected to processor 1 12.
- RAM i 14 the one or more input-output devices 116 and the non-volatile memory 120 via the common bus i 18; or be configured in an area of non-volatile memory 120 of the server 1 10.
- Content servers 170 and 180 are each configured to include a content server processor, RAM, one or more input-output devices, such as a display device and keyboard, and non-volatile memory, all of which are interconnected via a common bus and controlled by the respective content server processor. According to one embodiment, content, servers 170 and 180 provide additional information to the scoring module 124 so that it can ' perform a subsequent analysis of the set of data items.
- the content server 170 may be a server that, provides news content, such as global or national news, financial news, sporting event news or entertainment news.
- the content server 180 may be a library of usage data regarding the activities of users of the on-line research system.
- the access device 160 is mobile device having graphical user interface ("GUI") 1 4, a digital signal processor 1 2, with an application module 2 A, internal and external storage components (not shown), a power management system (not shown), an audio component (not shown), audio input/output components (not shown), an image capture and process system (not shown), RF antenna (not shown) and a subscriber identification module (SIM) (not shown).
- GUI graphical user interface
- SIM subscriber identification module
- the access device 160 is a general purpose or special purpose computing device comprising a processor, transien and persistent storage devices, an input/output subsystem, a bus to provide a communications path between components comprising the general purpose or special purpose computer, and a web- based client application, such as a web browser, which allows a user to access the server 1 10 and the content servers 170 and ⁇ 80,
- a web- based client application such as a web browser
- Examples of web browsers are known in the art, and include well-known web browsers such as such as Microsoft® internet Explorer®, Google ChromeTM, ozilla Firefoxf and Apple® Safari®.
- the admi istrator device 190 is a general purpose or special purpose computing device comprising a processor, transient and persistent storage devices, an input/output subsystem, a bus to provide a communications path betwee
- administrator device .1 0 is a mobile device having a GUI (not shown), a digital signal processor with an application module (not shown), internal and external storage components (not shown), a power management:
- an audio component (not shown ), an audio input/output components (not shown), an image capture and process system (not shown), RF antenna (not shown), and a subscriber identification module (SIM) (not shown).
- SIM subscriber identification module
- system 100 shown in FIG. 1 is only one embodiment of the disclosure.
- Other system embodiments of the disclosure may include additional structures that are not shown, such as secondary storage and additional computational devices.
- various other embodiments of the disclosure include fewer structures than those shown in FIG. 1.
- the disclosure is implemented on a single computing device in a non-networked standalone configuration. Data input and requests are communicated to the computing device via an input device, such as a keyboard and/or mouse. Data output, such as the computed significance score, of the system is communicated from the computing device to a display device, such as a computer monitor.
- the communication module 126 receives a set of data items associated with a candidate document from the access device 160, step 210.
- a candidate document include, but are not limited, court documents, such as judicial decisions, orders and opinions, complaints, answers, briefs, legal memorandum, expert reports, deposition transcripts, trial transcripts, hearing tratiscripts and party contentions, as well as state and federal statutes, administrative codes, newspaper and magazine articles, public records, Saw journals, law review's, treatises and legal forms.
- the access device 160 is a mobile device in which a user sitting in a remote location, such as legal runner in a courthouse, uploads a set of data items associated with the candidate document located at the courthouse via the network 150 to the communication module 126 through the user interface 164.
- the access device 160 is a general purpose computer from which a user monitors publicly accessible databases and uploads, via the network 150 t the communication module 126, a set of data i tems associated with a candida te doc umen t the user located in the one of the public ly accessible databases.
- the candidate document is a court document and the set of data items associated with the candidate document includes the candidate document type, level and jurisdiction of the court, identification of the pat ties, identification of the court officers, including the judge and/or counsel for the parties, and subject matter data, such as the legal concept at issue, damages data and fact pattern data.
- the candidate document may be a complaint asserting a products liability cause of action filed in the Supreme Court of the State of New York and the set of dat items includes: "doc type: first filed complaint,"
- the user interface 164 in order to receive the set of data items associated wit a candidate document, provides a combination of vacant search fields configured to receive text inputs, e.g. a text box, and pre-defined fields, which include a plurality of pre-defined input values presented in a drop-down menu configuration.
- the user interface 164 may provide a pre-defined field for the "doc type” that includes a drop down menu with a listing of pre-defined document type values, such as "complaint,” “answer” “answer and eounter-ciaini(s)" from which the user may choose, while providing a vacant field for the deiendani party data.
- the user interface 164 provides only vacant search fields for a user to manually enter the set of data items associated with the candidate document, hi another embodiment, the user interface 164, provides a combination of predefined and vacant input fields, which are a presented to a user in a prioritized sequence, so that the user may input data values in an iterative step process as additional information is needed,
- the set of data items is the stored in memor within the candidate data item database 132.
- the set of data items associated with the candidate document is stored and maintained in a structured document, such as an. extensible Markup Language (XML) file.
- XML extensible Markup Language
- the set of received data items are then automatically analyzed in order to determine a
- the scoring module 124 analyzes the set of received data items associated with the candidate document and determines individual scores for each of the data items in order to subsequently determine an overall score for the candidate document. Details regarding the specific scoring methodology are discussed in connection with FIGS, 5, 6 and 6A. lite scoring module 124 makes its determination using the set of the pre-defined scoring patterns 136, In another embodiment, the scoring module 124 analyzes the set of received data items associated with the candidate document using a logic model.
- the recommendation module 122 to obtain or not obtain the candidate document.
- the recommendation module 122 receives an overall score for the candidate document and determines whether the candidate document should be presently obtained or not. obtained by the system 100.
- the recommendation module 122 makes its determination to obtain or not obtain the candidate doc ument, based on whether the overa ll score of the doc ument is greater than a threshold value.
- the recommendation module 122 makes its determination based on the overall outcome of a logic model. According to another
- the recommendation module 122 first makes a determinatio as to whether that is has scores for sufficient number data items to make a reconimendation and if not, generates a recommendation requesting additional data items.
- the communication module 126 once the recommendation is generated, the communication module 126 generates a signal associated with the
- the communication module 1.26 transmits the generated signal to the access device 160, In one embodiment, the communication module 126 transmits the signal immediately upon completion of the generation of the signal .
- the communication module 126 transmits the recommendation generated by the recommendation module 122 to the access device 160.
- the recommendation is one of two. possibilities; (i) ''Obtain Candidate Document" or (it) "Do Not Obtain Candidate
- the recommendatio is one of three
- step 320 a determination is made as to whether or not to obtain the candidate document. If the recommendation is to obtain the candidate document, process flow continues to step 330, in whi ch a electronic version of the candidate document is generated .
- the user of the access device 1.60 receives the recommendatio and undertakes the process of generating the electronic version of the document.
- an electroaic version the document is generated using techniques well-known in the an such as via use of a conventional paper scanner or a camera on a smartphone or tablet wi th a corresponding mobile application.
- the candidate document is a complaint asserting a produc ts liability cause of action filed in the Supreme Court of the State of New York
- a legal .runner sitting in the courthouse having received the recommendation on his mobile device to obtain the complaint, generates an electronic version of the complaint using a portable scanner connected to his mobile device.
- step 340 once the electronic version of the candidate document is generated, it is transmitted from the access device 1 0 to the communication module 126, step 340.
- the electronic version of the candidate document is then stored in memory within candidate document database 134,
- step 320 if the recommendation is to not obtain the candidate document, then the set of data items associated with the candidate document is maintained in the candidate data item database 132 for subsequent review as shown in step 360.
- step 370 the set of data items associated with the candidate document are later analyzed in view of additional information related to the set of data items.
- additional information related to the set of data item includes an increase in notoriety for a given data item. For example, where one of the data items for a candidate court document is the identity of a lawyer corporation, additional information discovered during a subsequent review would include the fact that the court corporation has filed documents with the Securities and Exchange Commission evidencing their intention to go public.
- Additional information related to the set of data items in one embodiment, is obtained from content servers 170 and 180.
- content server 170 may be a financial news wire that provides headlines regarding financial events globally,
- the scoring module 124 analyzes the set of received data items associated with the candidate document in view of the related additional information and determines a modification, e.g. an increase or decrease, to each of the indi vidual scores for each data item in order to subsequently determine an overall modified score for the candidate document. Details regarding the specific scoring methodology are discussed in connection with FIGS. 6 and 6A.
- the scoring module 124 makes its modified determination using the set of the pre-defined scoring patterns 1 6. in another embodiment, the scoring module 124 analyzes the set of received data items in view of the additional information using a logic model.
- the recommendation module 122 receives a modified overall score for the candidate document and determines at this juncture whether the candidate document should be presently obtained or not obtained by the system 100, The recommendation module 122 makes its modified determination t obtain or not obtain, the candidate document based on whether the overall score of the document is greater than a threshold value.
- a threshold value the increase in the score for the data item, "party identity" in turn increased the overall score of the candidate court document, making the overall score greater than a threshold value and changing the recommendatio from a "do not obtain candidate document" to an "obtain candidate document.”
- process flow returns to step 330, in which method 300 repeats the process of generating and transmitting an electronic version of the candidate document based on the modified recommendation. Conversely, if a determination is made to retrain from generating a modified recommendation to obtain the candidate document, process flow continues to step 360, in which the set of data items associated with the candidate document continue to be maintained i the candidate data item database 132 for subsequent review.
- a exemplary compnier-implemented method 400 for generating a modified recom endation to an on-line research system dining a subsequent review- is disclosed.
- the set of data items associated with the candidate document i maintained in the candidate data item database 132 for subsequent review step 410.
- the subsequent review is initiated by a triggering occurrence, such as the occurrence of an event or a scheduled periodic review.
- a scheduled periodic review is a subsequent review to occur on a periodic basis, such as annually, monthly, weekly, or daily.
- the defined periodic review rides, such as the frequency of the subsequent review, are maintained in the set of pre-defined scoring patterns 136. in one
- an administrator using the administrator device 190, defines the rules for the periodic review and has the ability to modify the periodic review rules to alter the frequency of the subsequent review as desired.
- an event is a significant occurrence relating to one or more of the data items associated with the candidate document, such as a national news event, financial news event, spotting news event, legal news event and or a legal concept event.
- a national news event such as a national news event, financial news event, spotting news event, legal news event and or a legal concept event.
- the data item for a candidate document is the identity of the plaintiff husband and wife, and an event would be a widespread news report that the plaintiff husband has been romantically involved with a famous Hollywood actress.
- the occurrence of an event is defined by a set of rides maintained within the set of pre-defined scoring patterns 136.
- a rule may be to obtain the candidate document if the number of times a data item appears in a news story is h igher titan a threshold value. For example, if a rule is that a judge's identity appears in more than five thousand news articles, where a judge is nominated to the United States Supreme Court and hence here name appears in more approximately twelve thousand news articles, the nomination would be defined as an event since the data i tem, in this example the judge's name, appeared in more than five thousand news articles.
- Another rule might be to obtain the candidate document if the data item is related to a change in a legal concept.
- the data item is the nature of the suit
- fair labor standards act a ruling by the United States Supreme Court regarding the definition of "principals activities" under the Fair Labor Standards Act is defined as an event as it involves a major ruling regarding the Fair Labor Standards Act by the Supreme Court, hi another example, the occurrence of an event may constitute a rule set up by an administrator at the administrator device 190 based on usage data, from an on-line research system, such as the data usage demonstrating that in the last year, users of the on-line research system have searched for three times more bankruptcy cases than in the previous two years.
- information regarding events is received from content servers
- content server 170 may be a newswire that provides a variety of headlines, such as headlines related to scandals in the entertainment industry or tie website of the United States Supreme Court posting their most recent decisions.
- Content server 180 may, for example, comprise a library of usage data regarding the activities of users with an on-line research system.
- the scoring module 124 identifies the occurrence of an event based on the rules maintained in the set of pre-defined scoring patterns 136. For example, the scoring module 124 determines an event occurrence of a party's name appeasing in the news more than threshoid value from the content server 170, which maintains such intormation from the newswire feed. If an event has not occurred, then process flow returns to step 410 and the set of data items continue to be maintained in the candidate data item database I 32. Alternatively, in step 414, a determination is made as to whether a periodic review is scheduled.
- scoring module 124 determines whether it been three months since the last periodic review or since the "do not obtain" recommendation was first transmitted based on information maintained in the set of pre-defined scoring patterns 136. IT the scheduled time has not occurred, then process flo w returns to step 410 and the set of data items continue to be m aintain ed in the candidate data item database 132.
- step 416 additional, .information related to the set. of data items associated with the candidate document is received.
- additional information related to the set of data item includes an increase in notoriety for a given data item. For example, the fact that a lawyer corporation has filed documents with the Securities and Exchange Commission evidencing their intention to go public.
- the set of data item associated with the candidate document is then analyzed in view of the additional information related to the set of data items.
- the scoring module 124 having analyzed the set of received data items in view of the related additional information, makes a determination as to whether the individual scores for each data item should be increased or decreased, in order to subsequently determine an overall modified score for the candidate document. [0056] In step 4:20, a determination is then made as whether to generate a modified recommendation to obtain the candidate document based upon the additional information.
- the recommendation module 122 receives a modified overall score for the candidate document and determines at this juncture whether the candidate document should be presently obtained or not obtained by the system 100, The recommendation module 122 makes its modified determination to obtain or not obtain the candidate document based on whether the overall score of the document is areafer than a threshold value, if a determination is made to refrain from generating a modified "obtain" recommendation, the process flow returns to step 410, in which the set of data items associated with the candidate document is then maintained in the candidate data item database .132 for subsequent review.
- the electronic version of the candidate document is stored in memor within the candidate document database 134.
- FIG. 5 a further detailed exemplary computer-implemented method 500 for generating a recommendation t on- oard a candidate document to an on-line research system is disclosed.
- the scoring module .124 analyzes the set of data items associated with a candidate document from the access device 160, step 510, In steps 512 though 51 , a series of scores is then determined for the candidate document from the set of associated data items.
- a case-type score, a damages score, a party score, a participant score and a uniqueness score are determined for the candidate document by the scoring module 124, via steps 512-51 , based on a plurality of defined rules ma intained in the set of pre-defined patterns 136.
- a case-type score is a score assi ned to the candidate document based on the data item for the nature of the suit within the court document.
- the set of pre-defined patterns 136 may include a set of defined rules that sets forth that all intellectual property suits, professional negligence suits, class action suits, medical malpractice suits, and products liability suits are to be assigned a case-type score of 3; all fraud, breach of contract, bankruptcy and employment discrimination suits are to receive a score of 2; and all divorce, premises liability and motor vehicle suits are to receive a score of 1.
- a party score is a score assigned to the candidate document based on the dat item for the identification of the party in the court document.
- the set of pre-defined patterns 136 may include a set of defined rules thai dictates that any party that is a Fortune 100 company or a specific government agency, such as the Securities and Exchange Commission or the United States Attorney General's office, receives a score of 3; any state government, Fortune 500 company, publicly traded company or individual with celebrity status receives a score of 2; and any private individual or small privately held company receives a score of 1.
- a participant score is a score assigned to the candidate document based on the data item for the identification of the any participants in the court document, such as the judge, counsel, a party's counsel, or a party's counsel's law firm.
- the set of pre-defined patterns 136 may include a set of defined rules that mandates that any large law firm, e.g. more than 500 attorneys, receives a score of 3; any medium-sized law firm receives a score of 2; and any small firm or solo practitioner receives a score of 1.
- a damages score is a score assigned to the candidate document based on the damages awarded or sought in the court document.
- the set of pre-defined patterns 136 may inciude a set of defined rules that cases wherein damages sought or awarded are in excess of US $10 Million are to receive a score of 3, cases wherein dama g es awarded or sonant are between US $ 1 Million and US $ 10 Million are to receive a score of 2, any cases wherein any damages awarded or sought are less than US SI Million are to receive a score of 1.
- a uniqueness score is a score assigned to the candidate document based on the data items associated with the fact pattern of the candidate document. For example, a uniqueness score may be based on the products at issue, the legal concept at issue or the geographic location at issue.
- the set of pre-defined patterns 136 may incl ude a set of defined rules that sets forth that varying numerical score values from 1 to 3 for these individual characteristics. For example, if the candidate document, involves a pharmaceutical drug, the uniqueness score is scored higher than a court document pertaining to a malfnnctioning appliance.
- the candidate document relates to a hotly contested legal concept, such as a criminal case invoking the- accidental discharge of a firearm by a minor, which touches on gun control
- the uniqueness score is scored higher than a court document pertaining to an assault and battery incident having taken place at a night club without the use of weapons.
- the number and types of scores determined for a candidate docimient are not limited to the number and types of scores described herein, which are being disclosed solely to serve as exemplary score types, and that other score types may be determined from the set of data items associated with the candidate document by the scoring module 124, Further, the examples of each of the score types disclosed herein are presented purely for illustrative purposes and are not intended to limit the exemplary score disclosed herein. Additionally, it is important to note that the scoring scale is not limited to a 1 to 3 numerical scale, but can utilize any variation of a scoring scale, as well any other methods known in the art for scoring,
- the scores are then aggregated at step 520 by the scoring module 124.
- the individual scores are aggregated by adding each of the individual scores. For example, the case-type score, the damages score, the party score, the participant score and the uniqueness score would be added with the total sum being the overall score for the candidate complaint document.
- the individual scores are aggregated using a weighted sum model , For example, using a weighted sum model, the case- type score, the damages score and the party score would be weighted higher than the participant score and the uniqueness score in determining the overali score of the candidate document
- the set of pre-defined patterns 136 includes a defined rule that sets forth the overali threshold numerical score that is to be used by the recommendation module 122 to generate an. "obtain.” or a "do not obtain” recommendation.
- the candidate document is a complaint asserting a products liability cause of action and the set of data items includes "nature of suit: tort products liability” being assigned a case-type score of 3; "plaintiffs): lames Smithson, Anna Sniithson” and “defendants): ABC Paint Supplies” being assigned a party score of 1 ; “damages: compliant seeks $5 M US” being assigned a damages score of 2 and "fact pattern data: allegation of ABC Prime -l paint causing retinal damage to husband, occupation painter” being assigned a uniqueness score of 1 , an overall score would be the stun total of the individual scores, m this example, an overall score of 7.
- the threshold value to obtain a document may be defined within the set of pre-defined patterns 136 may be 8, in which ease the recommendation module 122 would determine that the candidate document should not be obtained
- step 540 a signal associated with the "obtain” recommendation is generated and transmitted by the communication module 126 to the access device 160, Subsequently, at step 544, an electronic version of the candidate document is generated and received .from the access device 1 0 and stored in the on-boarded document database 134 and the process flow ends. It should be noted that once an "obtain" recommendation is generated and the candidate document is ultimately obtained and stored, the set of data items associated with the candidate document also continues to be maintained in the candidate item database 132 for subsequent analysis,
- step 550 In which the set of data items associated with the candidate document continue to be maintained in candidate item database 132,
- the scoring module 124 analyzes the set of data items associated with the candidate document in view of the additional information recei ved from the content servers 170. and 180,
- content server 170 may be a newswire that provides a variety of headlines, such as headlines related to scandals in the entertainment industry or the website of the United States Supreme Court posting their most recent decisions.
- Content server 180 may, for example, a library of usage data regarding the activities of users with an on-line research system.
- Individual data items from the set. of data items associated with the candidate document are then analyzed based upon the received additional information.
- the individual data items analyzed include the party data, court officer data, product data, fact pattern data and legal concept data in order to determin whether there has been any change to the set of data items.
- An example of this would be when the plaintiff or lawyer to the legal proceeding in the candidate document has gained popularity in the news.
- the party score is increased at step 620, Continuing with the previous example, the party score for the plaintiff wife in the divorce decree increased from a score of I to 3 because she has become a popular Hollywood actress often mentioned in the news.
- step 614 a determination is made as to whether any of the
- participant from the candidate document has gained notoriety. This may occur when the judge, counsel, counsel's firms, experts or relevant third-parties have gained notoriety. Examples of participants gaining notoriety include where a judge is elevated to a higher court or decides to abandon his judicial career to pursue a political career, or a pharmaceutical manufacturer invol ed in a products liability action files documents with the Securities and Exchange
- the participant score is increased at step 622.
- a product gaining notoriety Is where the candidate document pertains to a products liabilit case involving a defective tire, and later a series of class action suits are filed involving the defective tire. If a determination is made that the product at issue has gained notoriety, the uniqueness score is increased at step 624. Similarly, at step 6 I S, a determination is made as to whether the fact pattern from the candidate document has gained notoriety.
- the candidate documents pertain to a court's decisio on divided patent infringement and later the United States Supreme Court reverses the country's long standing doctrine on what is required to prove divided infringement. If a determination is made thai the Saria concept has changed, the c ase-type score is increased a t step 628.
- the scoring module 124 aggregates the individual scores, including any increased scores at step 640. As disclosed previously, the scores are aggregated by summing the individual scores according to one embodiment. In another embodiment, the scores are aggregated using a weighted sum mode. Next, at step 650, a determination is made by the recommendation module ⁇ 22 as to whether the combined score is greater than the threshold value. In one embodiment, the set.
- pre-defined patterns 136 includes a defined rule that sets forth the overall threshold numerical score that continues to be used by the recommendation module 122 to generate a modified "obtain” or a "do not obtain” recommendation, for example, where the candidate document is a complaint asserting a products liability cause of action for defective paint and the set of data items includes the identity of the plaintiff who has since been charged as a serial murder and is claiming an insanity, the party score would be increased because of the increase in notoriety of the plaintiff, and the uniqueness score would increase as well because its alleged in his defense that the insanity was caused by the use of the defective paint, having thereby increased the .notoriety of the product at issue in the candidate document.
- the uniqueness score and patty score would each be increased to a score of 3, modifying the overall score of die candidate document to an 1 1 , which would be the compared to threshold value of 8 maintained in the set of pre-defined scoring patterns 136.
- step 660 a signal associated with the "obtain” recommendatio is generated and transmitted by the communication module 126 to the access device 1 0.
- step 664 a electronic version of the candidate document is generated and received from the access device 160 and stored in the on-boarded document database 134, Alternatively, if the combined score is less than the threshold value, a "do not obtain” recommendation is generated at step 670 and process flow then continues to step 672, in which the set of data items associated with the candidate document continue to be maintained in candidate item database 1 2.
- step 630 in which "do not obtain” recommendation is generated and then the set of data items associated wit the candidate document continue to be maintained in candidate item database 132 via step 632.
- FIGS. 1 through 6A are conceptual illustrations allowing for an explanation of the present disclosure. It should be understood that various aspects of the embodiments of the present disclosure could be implemented in hardware, firmware, software, or combinations thereof, hi such embodiments, the various components and/or steps would be implemented in hardware, firmware, and/or software to perform the functions of the present disclosure. That is, the same piece of hardware, firmware, or module of software could perform one or more of the illustrated blocks (e.g., components o steps).
- computer software e.g., programs or other instructions
- data is stored on a machine -readable medium as part of a compu ter program product, and is loaded into a computer system or other device or machine via. a removable storage drive, hard drive, or communications interface.
- Computer programs also called
- machine readable medium e.g., a random access memory (RAM); a read only memory (ROM); a removable storage unit (e.g., a magnetic or optical disc, flash memory device, or the like); a hard disk; or the like.
- RAM random access memory
- ROM read only memory
- removable storage unit e.g., a magnetic or optical disc, flash memory device, or the like
- hard disk e.g., a hard disk; or the like.
Abstract
Description
Claims
Priority Applications (3)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
AU2014396820A AU2014396820B2 (en) | 2014-06-11 | 2014-06-11 | Systems and methods for content on-boarding |
CA2951519A CA2951519C (en) | 2014-06-11 | 2014-06-11 | Systems and methods for content on-boarding |
PCT/US2014/041935 WO2015191063A1 (en) | 2014-06-11 | 2014-06-11 | Systems and methods for content on-boarding |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
PCT/US2014/041935 WO2015191063A1 (en) | 2014-06-11 | 2014-06-11 | Systems and methods for content on-boarding |
Publications (1)
Publication Number | Publication Date |
---|---|
WO2015191063A1 true WO2015191063A1 (en) | 2015-12-17 |
Family
ID=54834012
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
PCT/US2014/041935 WO2015191063A1 (en) | 2014-06-11 | 2014-06-11 | Systems and methods for content on-boarding |
Country Status (3)
Country | Link |
---|---|
AU (1) | AU2014396820B2 (en) |
CA (1) | CA2951519C (en) |
WO (1) | WO2015191063A1 (en) |
Cited By (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN111382560A (en) * | 2018-12-13 | 2020-07-07 | 鸿合科技股份有限公司 | Courseware making method and system and electronic equipment |
Citations (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20040019846A1 (en) * | 2002-07-24 | 2004-01-29 | Xerox Corporation | System and method for managing document retention of shared documents |
US20120011427A1 (en) * | 2010-07-09 | 2012-01-12 | Lexisnexis, A Division Of Reed Elsevier Inc. | Systems and Methods for Linking Items to a Matter |
US20120330946A1 (en) * | 2010-02-03 | 2012-12-27 | Arredondo Pablo D | Intuitive, contextual information search and presentation systems and methods |
-
2014
- 2014-06-11 CA CA2951519A patent/CA2951519C/en active Active
- 2014-06-11 WO PCT/US2014/041935 patent/WO2015191063A1/en active Application Filing
- 2014-06-11 AU AU2014396820A patent/AU2014396820B2/en active Active
Patent Citations (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20040019846A1 (en) * | 2002-07-24 | 2004-01-29 | Xerox Corporation | System and method for managing document retention of shared documents |
US20120330946A1 (en) * | 2010-02-03 | 2012-12-27 | Arredondo Pablo D | Intuitive, contextual information search and presentation systems and methods |
US20120011427A1 (en) * | 2010-07-09 | 2012-01-12 | Lexisnexis, A Division Of Reed Elsevier Inc. | Systems and Methods for Linking Items to a Matter |
Cited By (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN111382560A (en) * | 2018-12-13 | 2020-07-07 | 鸿合科技股份有限公司 | Courseware making method and system and electronic equipment |
Also Published As
Publication number | Publication date |
---|---|
AU2014396820B2 (en) | 2017-08-17 |
CA2951519A1 (en) | 2015-12-17 |
CA2951519C (en) | 2023-09-26 |
AU2014396820A1 (en) | 2016-12-08 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
US9411879B2 (en) | Systems, methods, and interfaces for aggregating and providing information regarding legal professionals | |
US20160314549A1 (en) | Motion mapping and analytics | |
US20130346128A1 (en) | System and Method of Reviewing and Producing Documents | |
Issacharoff et al. | Unintended consequences of mandatory disclosure | |
Silva et al. | Process analytics approach for R&D project selection | |
CN111680153A (en) | Big data authentication method and system based on knowledge graph | |
Zhao et al. | Embedding-based recommender system for job to candidate matching on scale | |
Lizotte | Publish or Perish: The Electronic Availability of Summary Judgments by Eight District Courts | |
Zahra et al. | A framework for user characterization based on tweets using machine learning algorithms | |
Gottschalk et al. | Stages of knowledge management technology in the value shop: the case of police investigation performance | |
Friedland et al. | Finding tribes: identifying close-knit individuals from employment patterns | |
Liu et al. | What’s not trending on Weibo: China’s missing climate change discourse | |
Stavrakantonakis et al. | Linked Open Vocabulary ranking and terms discovery | |
US20150339772A1 (en) | System for buying and selling securities over a distributed communications network | |
Porwol et al. | Structuring e-participation perspectives: Mapping and aligning models to core facets | |
Dodonov et al. | Information Operations Recognition: From Nonlinear Analysis to Decision-Making | |
CN112837140A (en) | Data processing method, device, equipment and storage medium | |
CA2951519C (en) | Systems and methods for content on-boarding | |
Alkhred et al. | Sharing explicit knowledge: designing a peacekeeping operation databank | |
US20150363397A1 (en) | Systems and methods for content on-boarding | |
Sangeetha et al. | Modelling of E-governance framework for mining knowledge from massive grievance redressal data | |
CN116108955A (en) | Method, device, equipment and storage medium for upgrading and early warning of social contradiction disputes | |
Luthfi et al. | A framework for analyzing how governments open their data: Institution, technology, and process aspects influencing decision-making | |
Valsamidis et al. | Knowledge management for business intelligence measurement in an e-business system | |
Reed et al. | What's trending? mining topical trends in UGC systems with YouTube as a case study |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
121 | Ep: the epo has been informed by wipo that ep was designated in this application |
Ref document number: 14894639 Country of ref document: EP Kind code of ref document: A1 |
|
ENP | Entry into the national phase |
Ref document number: 2951519 Country of ref document: CA |
|
ENP | Entry into the national phase |
Ref document number: 2014396820 Country of ref document: AU Date of ref document: 20140611 Kind code of ref document: A |
|
NENP | Non-entry into the national phase |
Ref country code: DE |
|
122 | Ep: pct application non-entry in european phase |
Ref document number: 14894639 Country of ref document: EP Kind code of ref document: A1 |