WO2021026428A1 - Data entry feature for information tracking system - Google Patents

Data entry feature for information tracking system Download PDF

Info

Publication number
WO2021026428A1
WO2021026428A1 PCT/US2020/045353 US2020045353W WO2021026428A1 WO 2021026428 A1 WO2021026428 A1 WO 2021026428A1 US 2020045353 W US2020045353 W US 2020045353W WO 2021026428 A1 WO2021026428 A1 WO 2021026428A1
Authority
WO
WIPO (PCT)
Prior art keywords
keyword
character
indicate
processor
keywords
Prior art date
Application number
PCT/US2020/045353
Other languages
French (fr)
Inventor
Gabriel Enrique REINA
David HIRSCHFELD
Original Assignee
Zinatt Technologies, Inc.
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Zinatt Technologies, Inc. filed Critical Zinatt Technologies, Inc.
Priority to EP20850073.6A priority Critical patent/EP4010838A4/en
Priority to CN202311018897.6A priority patent/CN117112598A/en
Priority to US16/969,420 priority patent/US11783127B2/en
Priority to CN202080067387.6A priority patent/CN115210708B/en
Priority to JP2022507854A priority patent/JP2022543870A/en
Publication of WO2021026428A1 publication Critical patent/WO2021026428A1/en
Priority to US18/240,996 priority patent/US20240070391A1/en

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F40/00Handling natural language data
    • G06F40/20Natural language analysis
    • G06F40/279Recognition of textual entities
    • G06F40/284Lexical analysis, e.g. tokenisation or collocates
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F40/00Handling natural language data
    • G06F40/20Natural language analysis
    • G06F40/279Recognition of textual entities
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/20Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
    • G06F16/24Querying
    • G06F16/242Query formulation
    • G06F16/243Natural language query formulation
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/30Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
    • G06F16/34Browsing; Visualisation therefor
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F40/00Handling natural language data
    • G06F40/10Text processing
    • G06F40/103Formatting, i.e. changing of presentation of documents
    • G06F40/117Tagging; Marking up; Designating a block; Setting of attributes
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F40/00Handling natural language data
    • G06F40/20Natural language analysis
    • G06F40/279Recognition of textual entities
    • G06F40/289Phrasal analysis, e.g. finite state techniques or chunking
    • G06F40/295Named entity recognition
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V30/00Character recognition; Recognising digital ink; Document-oriented image-based pattern recognition
    • G06V30/10Character recognition
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F40/00Handling natural language data
    • G06F40/10Text processing
    • G06F40/166Editing, e.g. inserting or deleting
    • G06F40/169Annotation, e.g. comment data or footnotes

Definitions

  • the present disclosure generally relates to information tracking, and more particularly to systems and methods for processing data entries for investigative information tracking.
  • the present disclosure provides a method by a computing device for processing text data associated with an investigation.
  • the method includes analyzing the text data to identify a plurality of keywords.
  • the method also includes determining whether each of the plurality of keywords exists in one or more databases.
  • the method includes tagging the keyword with a plurality of characters for storage.
  • the plurality of characters includes at least a first character to indicate a start of the tagging, a second character to indicate a corresponding database for storing the keyword, and a third character to indicate an end of the tagging.
  • the plurality of characters includes one or more letters, numbers, punctuation marks, and special symbols found on a keyboard.
  • Each of the first, second, and third characters can have than one character.
  • the second character is intermediate the first character and the third character, while the keyword is intermediate the second character and the third character.
  • the method includes highlighting the tagged keyword with a first color to indicate that user input is needed to add information about the tagged keyword.
  • the method includes highlighting the tagged keyword with a second color to indicate that user input is needed to classify the two or more separate words.
  • the method includes highlighting the keyword with a third color to indicate that the keyword already exists.
  • the method includes storing the tagged keyword in the corresponding database. The tagged keyword can also be stored in multiple databases.
  • the present disclosure provides a system for processing text data associated with an investigation.
  • the system includes a processor, a memory, and one or more databases.
  • the memory includes instructions that, when executed by the processor, cause the processor to analyze the text data to identify a plurality of keywords.
  • the processor also determines whether each of the plurality of keywords exists in the one or more databases. When a keyword in the plurality of keywords is not found in the one or more databases, the processor tags the keyword with a plurality of characters for storage.
  • the plurality of characters includes at least a first character to indicate a start of the tagging, a second character to indicate a corresponding database for storing the keyword, and a third character to indicate an end of the tagging.
  • the plurality of characters includes one or more letters, numbers, punctuation marks, and special symbols found on a keyboard.
  • Each of the first, second, and third characters can have than one character.
  • the second character is intermediate the first character and the third character, while the keyword is intermediate the second character and the third character.
  • the processor highlights the tagged keyword with a first color to indicate that user input is needed to add information about the tagged keyword.
  • the processor highlights the tagged keyword with a second color to indicate that user input is needed to classify the two or more separate words.
  • the processor highlights the keyword with a third color to indicate that the keyword already exists.
  • the processor stores the tagged keyword in the corresponding database.
  • the present disclosure provides a non- transitory computer readable medium that has instructions stored thereon.
  • the instructions when executed by a processor, cause the processor to analyze text data to identify a plurality of keywords and determine whether each of the plurality of keywords exists in one or more databases.
  • the instructions when a keyword in the plurality of keywords is not found in the one or more databases, the instructions cause the processor to tag the keyword with a plurality of characters for storage.
  • the plurality of characters includes at least a first character to indicate a start of the tagging, a second character to indicate a corresponding database for storing the keyword, and a third character to indicate an end of the tagging.
  • the instructions also cause the processor to store the tagged keyword in the corresponding database.
  • the plurality of characters includes one or more letters, numbers, punctuation marks, and special symbols found on a keyboard.
  • FIG. 1 is a block diagram illustrating an information tracking system
  • FIG. 2 is a flow chart illustrating a method of processing data entries for the information tracking system of FIG. 1;
  • FIG. 3 is a conceptual diagram illustrating various example databases
  • FIG. 4 is a conceptual diagram illustrating an example format for processing data entries
  • FIG. 5 is a conceptual diagram illustrating example processed data entries
  • FIGS. 6-8 are conceptual diagrams illustrating example user interfaces for processed data entries; and [0018] FIGS. 9-10 are conceptual diagrams illustrating other examples of processed data entries.
  • Coupled is used to include both arrangements wherein two or more components are in direct physical contact and arrangements wherein the two or more components are not in direct contact with each other (e.g., the components are “coupled” via at least a third component), but yet still cooperate or interact with each other.
  • numeric terminology such as first and second, is used in reference to various components or features. Such use is not intended to denote an ordering of the components or features. Rather, numeric terminology is used to assist the reader in identifying the component or features being referenced and should not be narrowly interpreted as providing a specific order of components or features.
  • FIG. 1 One of ordinary skill in the art will realize that the embodiments provided can be implemented in hardware, software, firmware, and/or a combination thereof.
  • Programming code according to the embodiments can be implemented in any viable programming language such as C, C++, HTML, XTML, JAVA or any other viable high-level programming language, or a combination of a high-level programming language and a lower level programming language.
  • FIG. 1 One of ordinary skill in the art will realize that the embodiments provided can be implemented in hardware, software, firmware, and/or a combination thereof.
  • Programming code according to the embodiments can be implemented in any viable programming language such as C, C++, HTML, XTML, JAVA or any other viable high-level programming language, or a combination of a high-level programming language and a lower level programming language.
  • FIG. 1 illustrates an information tracking system 100 that includes a computing device 102 (e.g., a desktop, a laptop, a mobile device, etc.) in communication, via a network 104 (e.g., local area network, wide area network, the Internet, etc.), with one or more databases 106A-106N implemented on non-transitory, computer-readable storage mediums (e.g., servers).
  • Databases 106A-106N are configured to store data associated with an investigation.
  • the investigation may be related to law enforcement (e.g., drug transfers, money laundering, wire fraud, identity theft, etc.), although other types of investigation (e.g., employment issues) or other non-investigation are also contemplated. While FIG.
  • databases 106A-106N may be separate units, in other embodiments, databases 106A-106N may be implement as a single unit. Additionally, in the present disclosure, database may include tables of information stored in any suitable manner, storage locations of data, or storage locations within the present system. Any type of accessible storage architecture is contemplated by the present disclosure.
  • Computing device 102 includes a processor 108 (e.g., a microprocessor, a microcontroller, logic circuitry, etc.), a memory 110, and a communication module 112.
  • Processor 108 is configured to receive and process the data associated with the investigation. Processing the data entails categorizing or tagging the data for storage in databases 106A-106N. Once processed, computing device 102 can transmit the data to databases 106A-106N using communication module 112 via network 104.
  • Processor 108 is also configured to analyze the data and generate an investigative report based on the analysis. While not shown, computing device 102 may include additional components (e.g., input/output devices) used for operating computing device 102.
  • a user operating computing device 102, can access databases 106A-106N to retrieve, save, and/or modify the data stored therein.
  • the data may include information such as personal identification information (e.g., names), location information (e.g., addresses), vehicle information, property information, financial information, and any other relevant information associated with the investigation.
  • the data may be in the form of text data (e.g., an email, a text message, a transcription of an audio file, a letter, etc.).
  • the data may be metadata which may or may not be viewable by the user.
  • Method 200 can be performed by computing device 102.
  • computing device 102 receives and analyzes the text data to identify a plurality of keywords. Keywords can be identified based on comparisons to stored terms. For example, computing device 102 can identify keywords related to a vehicle when words in the text data mention a particular vehicle make or model. As another example, computing device 102 can identify keywords related to an event when words in the text data mention a date, a time of day, a day of the week, etc. In still another example, computing device 102 can identify keywords based on special symbols, such as recognizing an email based on the “@” symbol.
  • computing device 102 determines whether each of the plurality of keywords exists in one or more databases (e.g., databases 106A-106N). In particular, computing device 102 can perform a search of databases 106A-106N to determine if an identified keyword is already present in any of databases 106A-106N.
  • databases 106A-106N e.g., databases 106A-106N.
  • FIG. 3 lists various example databases 302 that the keyword can be saved to. For example, if the keyword is related to a vehicle, then the keyword can be tagged with a character (“ve”) 304 for storage in a vehicles database (“Vehicles”) 306.
  • each of database 302 can be implemented as one of databases 106A-106N of FIG. 1.
  • the plurality of characters used to tag the keyword includes at least a first character to indicate a start of the tagging, a second character to indicate a corresponding database for storing the keyword, and a third character to indicate an end of the tagging.
  • Each of the first, second, and third characters may include more than one character.
  • FIG. 4 shows an example format for tagging a keyword represented by data 402.
  • a first character 404 in the form of a forward slash symbol (‘7”), is used to indicate the start of the tagging.
  • a second character 406, in the form of two letters (“mw”) is used to indicate a specific database that data 402 should be saved to.
  • the letters “mw” stand for “Money Wires,” which is the name of a database used to store information related to money wirings.
  • One or more blank spaces 410 may exist before and/or after data 402.
  • second character 406 is intermediate first character 404 and third character 408, while data 402 (or keyword) is intermediate second character 406 and third character 408.
  • the tagging format is not limited to the illustration shown in FIG. 4 as other formats may be contemplated in other embodiments.
  • a keyword may be saved to more than one database by including additional characters to indicate multiple databases.
  • the plurality of characters used for tagging a keyword can include any number and combination of letters, numbers, punctuation marks, and special symbols found on a standard keyboard.
  • Tagging keywords by computing device 102 allows the keywords to be accurately and efficiently stored in databases 106A-106N. This in turn enables easier information searching, information retrieval, information association, and information forecasting during the course of the investigation.
  • the tagging can be performed by a user.
  • FIG. 5 illustrates an example text data 500 that has been processed by computing device 102 to identify and tag keywords.
  • Text data 500 may be a description of a phone call between two parties.
  • computing device 102 determines that a keyword is found to not exist in any of the databases (e.g., any of databases 106A-106N), that keyword is tagged for storage using a plurality of characters according to the format shown in FIG. 4.
  • keywords 502-508 are tagged for storage.
  • keyword 502 is determined to be a slang word, and as such, keyword 502 is tagged to be stored in a “Vocabulary” database designated “vo.”
  • keyword 504 is determined to be a person’s name, and as such, keyword 504 is tagged to be stored in a “Names Mentioned” database designated “nm.”
  • Other examples include keyword 506 tagged to be stored in an “Addresses” database designated “ad,” and keyword 508 tagged to be stored in a “Vehicles” database designated “ve.”
  • a tagged keyword may require additional information to describe the keyword (e.g., from a user).
  • computing device 102 highlights the tagged keyword with a first color to indicate that user input is needed to add information about the tagged keyword.
  • keyword 502 is highlighted with a green color to indicate additional user input.
  • an example user interface 602 is generated when a user selects (e.g., double-clicks on) keyword 502.
  • User interface 602 is in the form of a data entry window that allows the user to enter information (e.g., definition, notes, etc.) for keyword 502.
  • a tagged keyword may comprise a combination of two or more separate words.
  • computing device 102 highlights the tagged keyword with a second different color to indicate that user input is needed to categorize or classify the two or more separate words.
  • keyword 508 is highlighted with a red color to indicate additional user input.
  • Keyword 508 includes two separate words 704, 706 that describe a color and a make of a vehicle, respectively.
  • an example user interface 702 is generated when a user selects (e.g., double-clicks on) keyword 508.
  • User interface 702 is in the form of a mapping window that allows the user to match words 704, 706 to their corresponding descriptions.
  • User interface 702 also includes labels 708 that indicate auto-populated related information.
  • computing device 102 may change the highlighting of keyword 508 to a different color (e.g., to a green color).
  • a keyword is found to exist in one of the databases (e.g., one of databases 106A-106N)
  • that keyword is not tagged and is highlighted with a third color to indicate that the keyword already exists. For example, referring back to FIG. 5, keywords 510, 512 are highlighted in yellow to indicate that computing device 102 found these keywords in the databases.
  • an example user interface 802 is generated when a user chooses (e.g., double-clicks on) an already existing keyword 804.
  • User interface 802 is in the form of a data entry window that allows the user to view and/or modify any of the information associated with keyword 804.
  • FIGS. 9-10 illustrate other examples of text data processed by computing device 102.
  • text data 900 includes a tagged keyword 902 which is in the form of a phrase with multiple words.
  • Tagged keyword 902 describes a place but without an actual address.
  • computing device 102 has highlighted tagged keyword 902 in red indicate that additional user input is required (e.g., to determine the actual address through a different source).
  • computing device 102 can provisionally tag a keyword but will not activate it until or unless a user reviews the tagging.
  • text data 1000 includes provisionally tagged keywords 1002, 1004.
  • Computing device 102 has recognized the keywords but has not permanently tagged them.
  • a user can review provisionally tagged keywords 1002, 1004, and upon selecting them (e.g., double-clicking), computing device 102 can activate their tagging by highlighting keywords 1002, 1004 in red. This also indicates that additional user input (e.g., information to describe the keywords) is needed.
  • text data 100 can be used to create an event.
  • an event may be added to the system calendar by typing “/up tomorrow A ” which corresponds to upcoming events / calendar event.
  • System and methods disclosed herein allow for improved efficiency in terms of processing and integrating data associated with an investigation into one or more databases.
  • a machine such as a general-purpose processor (e.g., a microprocessor, a microcontroller, a state machine, etc.), a digital signal processor (DSP), an application specific integrated circuit (ASIC), a field programmable gate array (FPGA) or other programmable logic device, discrete gate or transistor logic, discrete hardware components, or any combination thereof designed to perform the functions described herein.
  • a general-purpose processor e.g., a microprocessor, a microcontroller, a state machine, etc.
  • DSP digital signal processor
  • ASIC application specific integrated circuit
  • FPGA field programmable gate array
  • a software module can reside in RAM memory, flash memory, ROM memory, EPROM memory, EEPROM memory, registers, hard disk, a removable disk, a CD-ROM, or any other form of computer-readable storage medium known in the art.
  • An exemplary storage medium can be coupled to the processor such that the processor can read information from, and write information to, the storage medium.
  • the storage medium can be integral to the processor.
  • references to “one embodiment,” “an embodiment,” “an example embodiment,” etc. indicate that the embodiment described may include a particular feature, structure, or characteristic, but every embodiment may not necessarily include the particular feature, structure, or characteristic. Moreover, such phrases are not necessarily referring to the same embodiment. Further, when a particular feature, structure, or characteristic is described in connection with an embodiment, it is submitted that it is within the knowledge of one skilled in the art to affect such feature, structure, or characteristic with the benefit of this disclosure in connection with other embodiments whether or not explicitly described. After reading the description, it will be apparent to one skilled in the relevant art(s) how to implement the disclosure in alternative embodiments.

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • General Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • Computational Linguistics (AREA)
  • Artificial Intelligence (AREA)
  • General Health & Medical Sciences (AREA)
  • Health & Medical Sciences (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Mathematical Physics (AREA)
  • Data Mining & Analysis (AREA)
  • Databases & Information Systems (AREA)
  • Computer Vision & Pattern Recognition (AREA)
  • Multimedia (AREA)
  • Machine Translation (AREA)

Abstract

A method for processing text data includes analyzing the text data to identify a plurality of keywords. The method also includes determining whether each of the plurality of keywords already exists in one or more databases. When a keyword in the plurality of keywords is not found in the one or more databases, the method includes tagging the keyword with a plurality of characters for storage. The plurality of characters includes at least a first character to indicate a start of the tagging, a second character to indicate a corresponding database for storing the keyword, and a third character to indicate an end of the tagging. The method also includes storing the tagged keyword in the corresponding database.

Description

DATA ENTRY FEATURE FOR INFORMATION TRACKING SYSTEM
CROSS-REFERENCE TO RELATED APPLICATIONS [0001] The present application claims the benefit of priority to U.S. Provisional Application No. 62/883,917, filed on August 7, 2019, the entire disclosure of which is hereby expressly incorporated herein by reference.
FIELD OF THE DISCLOSURE
[0002] The present disclosure generally relates to information tracking, and more particularly to systems and methods for processing data entries for investigative information tracking.
BACKGROUND OF THE DISCLOSURE
[0003] Current investigations and reports by law enforcement are typically manually processed by using conventional word processing and/or spreadsheet programs. However, such system lacks the ability to efficiently manage and integrate the information into a database. For example, a drug trafficking investigation may have a large amount of uncategorized or untagged data that needs to be preprocessed before storage in the database. Conventional methods require extensive man-hours to sort and clean up the various entries, which may be prone to errors. Due to the nature of some errors, opportunities to gather more evidence, prevent further crimes, and apprehend suspects can be frustrated. Therefore, a need exists to better process and manage the data obtained during the course of an investigation.
SUMMARY
[0004] According to one embodiment, the present disclosure provides a method by a computing device for processing text data associated with an investigation. The method includes analyzing the text data to identify a plurality of keywords. The method also includes determining whether each of the plurality of keywords exists in one or more databases. When a keyword in the plurality of keywords is not found in the one or more databases, the method includes tagging the keyword with a plurality of characters for storage. The plurality of characters includes at least a first character to indicate a start of the tagging, a second character to indicate a corresponding database for storing the keyword, and a third character to indicate an end of the tagging.
[0005] In a further aspect, the plurality of characters includes one or more letters, numbers, punctuation marks, and special symbols found on a keyboard. Each of the first, second, and third characters can have than one character. The second character is intermediate the first character and the third character, while the keyword is intermediate the second character and the third character.
[0006] In another aspect, the method includes highlighting the tagged keyword with a first color to indicate that user input is needed to add information about the tagged keyword. When the tagged keyword includes two or more separate words, the method includes highlighting the tagged keyword with a second color to indicate that user input is needed to classify the two or more separate words. When the keyword in the plurality of keywords is found in the one or more databases, the method includes highlighting the keyword with a third color to indicate that the keyword already exists. Moreover, the method includes storing the tagged keyword in the corresponding database. The tagged keyword can also be stored in multiple databases.
[0007] According to another embodiment, the present disclosure provides a system for processing text data associated with an investigation. The system includes a processor, a memory, and one or more databases. The memory includes instructions that, when executed by the processor, cause the processor to analyze the text data to identify a plurality of keywords.
The processor also determines whether each of the plurality of keywords exists in the one or more databases. When a keyword in the plurality of keywords is not found in the one or more databases, the processor tags the keyword with a plurality of characters for storage. The plurality of characters includes at least a first character to indicate a start of the tagging, a second character to indicate a corresponding database for storing the keyword, and a third character to indicate an end of the tagging.
[0008] In a further aspect, the plurality of characters includes one or more letters, numbers, punctuation marks, and special symbols found on a keyboard. Each of the first, second, and third characters can have than one character. The second character is intermediate the first character and the third character, while the keyword is intermediate the second character and the third character.
[0009] In another aspect, the processor highlights the tagged keyword with a first color to indicate that user input is needed to add information about the tagged keyword. When the tagged keyword includes two or more separate words, the processor highlights the tagged keyword with a second color to indicate that user input is needed to classify the two or more separate words. When the keyword in the plurality of keywords is found in the one or more databases, the processor highlights the keyword with a third color to indicate that the keyword already exists. Moreover, the processor stores the tagged keyword in the corresponding database.
[0010] According to yet another embodiment, the present disclosure provides a non- transitory computer readable medium that has instructions stored thereon. The instructions, when executed by a processor, cause the processor to analyze text data to identify a plurality of keywords and determine whether each of the plurality of keywords exists in one or more databases. When a keyword in the plurality of keywords is not found in the one or more databases, the instructions cause the processor to tag the keyword with a plurality of characters for storage. The plurality of characters includes at least a first character to indicate a start of the tagging, a second character to indicate a corresponding database for storing the keyword, and a third character to indicate an end of the tagging. The instructions also cause the processor to store the tagged keyword in the corresponding database. The plurality of characters includes one or more letters, numbers, punctuation marks, and special symbols found on a keyboard.
BRIEF DESCRIPTION OF THE DRAWINGS
[0011] The above-mentioned and other features and advantages of this disclosure, and the manner of attaining them, will become more apparent and the invention itself will be better understood by reference to the following description of embodiments of the invention taken in conjunction with the accompanying drawings, wherein:
[0012] FIG. 1 is a block diagram illustrating an information tracking system;
[0013] FIG. 2 is a flow chart illustrating a method of processing data entries for the information tracking system of FIG. 1;
[0014] FIG. 3 is a conceptual diagram illustrating various example databases;
[0015] FIG. 4 is a conceptual diagram illustrating an example format for processing data entries;
[0016] FIG. 5 is a conceptual diagram illustrating example processed data entries;
[0017] FIGS. 6-8 are conceptual diagrams illustrating example user interfaces for processed data entries; and [0018] FIGS. 9-10 are conceptual diagrams illustrating other examples of processed data entries.
[0019] Corresponding reference characters indicate corresponding parts throughout the several views. The exemplifications set out herein illustrate exemplary embodiments of the disclosure and such exemplifications are not to be construed as limiting the scope of the disclosure in any manner.
DETAILED DESCRIPTION
[0020] For the purposes of promoting an understanding of the principles of the present disclosure, reference is now made to the embodiments illustrated in the drawings, which are described below. The exemplary embodiments disclosed herein are not intended to be exhaustive or to limit the disclosure to the precise form disclosed in the following detailed description. Rather, these exemplary embodiments were chosen and described so that others skilled in the art may utilize their teachings.
[0021] The terms “couples,” “coupled,” and variations thereof are used to include both arrangements wherein two or more components are in direct physical contact and arrangements wherein the two or more components are not in direct contact with each other (e.g., the components are “coupled” via at least a third component), but yet still cooperate or interact with each other.
[0022] Throughout the present disclosure and in the claims, numeric terminology, such as first and second, is used in reference to various components or features. Such use is not intended to denote an ordering of the components or features. Rather, numeric terminology is used to assist the reader in identifying the component or features being referenced and should not be narrowly interpreted as providing a specific order of components or features.
[0023] One of ordinary skill in the art will realize that the embodiments provided can be implemented in hardware, software, firmware, and/or a combination thereof. Programming code according to the embodiments can be implemented in any viable programming language such as C, C++, HTML, XTML, JAVA or any other viable high-level programming language, or a combination of a high-level programming language and a lower level programming language. [0024] FIG. 1 illustrates an information tracking system 100 that includes a computing device 102 (e.g., a desktop, a laptop, a mobile device, etc.) in communication, via a network 104 (e.g., local area network, wide area network, the Internet, etc.), with one or more databases 106A-106N implemented on non-transitory, computer-readable storage mediums (e.g., servers). Databases 106A-106N are configured to store data associated with an investigation. The investigation may be related to law enforcement (e.g., drug transfers, money laundering, wire fraud, identity theft, etc.), although other types of investigation (e.g., employment issues) or other non-investigation are also contemplated. While FIG. 1 shows databases 106A-106N as being separate units, in other embodiments, databases 106A-106N may be implement as a single unit. Additionally, in the present disclosure, database may include tables of information stored in any suitable manner, storage locations of data, or storage locations within the present system. Any type of accessible storage architecture is contemplated by the present disclosure.
[0025] Computing device 102 includes a processor 108 (e.g., a microprocessor, a microcontroller, logic circuitry, etc.), a memory 110, and a communication module 112. Processor 108 is configured to receive and process the data associated with the investigation. Processing the data entails categorizing or tagging the data for storage in databases 106A-106N. Once processed, computing device 102 can transmit the data to databases 106A-106N using communication module 112 via network 104. Processor 108 is also configured to analyze the data and generate an investigative report based on the analysis. While not shown, computing device 102 may include additional components (e.g., input/output devices) used for operating computing device 102.
[0026] A user, operating computing device 102, can access databases 106A-106N to retrieve, save, and/or modify the data stored therein. The data may include information such as personal identification information (e.g., names), location information (e.g., addresses), vehicle information, property information, financial information, and any other relevant information associated with the investigation. In one embodiment, the data may be in the form of text data (e.g., an email, a text message, a transcription of an audio file, a letter, etc.). In other embodiments, the data may be metadata which may or may not be viewable by the user.
[0027] Referring now to FIG. 2, a method 200 of processing text data associated with an investigation is shown. Method 200 can be performed by computing device 102. At block 202, computing device 102 receives and analyzes the text data to identify a plurality of keywords. Keywords can be identified based on comparisons to stored terms. For example, computing device 102 can identify keywords related to a vehicle when words in the text data mention a particular vehicle make or model. As another example, computing device 102 can identify keywords related to an event when words in the text data mention a date, a time of day, a day of the week, etc. In still another example, computing device 102 can identify keywords based on special symbols, such as recognizing an email based on the “@” symbol.
[0028] At block 204, computing device 102 determines whether each of the plurality of keywords exists in one or more databases (e.g., databases 106A-106N). In particular, computing device 102 can perform a search of databases 106A-106N to determine if an identified keyword is already present in any of databases 106A-106N.
[0029] At block 206, when the keyword not found in any of databases 106A-106N, computing device 102 tags the keyword with a plurality of characters for storage in a particular database. FIG. 3 lists various example databases 302 that the keyword can be saved to. For example, if the keyword is related to a vehicle, then the keyword can be tagged with a character (“ve”) 304 for storage in a vehicles database (“Vehicles”) 306. In one embodiment, each of database 302 can be implemented as one of databases 106A-106N of FIG. 1.
[0030] The plurality of characters used to tag the keyword includes at least a first character to indicate a start of the tagging, a second character to indicate a corresponding database for storing the keyword, and a third character to indicate an end of the tagging. Each of the first, second, and third characters may include more than one character. As an illustration, FIG. 4 shows an example format for tagging a keyword represented by data 402. A first character 404, in the form of a forward slash symbol (‘7”), is used to indicate the start of the tagging. A second character 406, in the form of two letters (“mw”), is used to indicate a specific database that data 402 should be saved to. In this example, the letters “mw” stand for “Money Wires,” which is the name of a database used to store information related to money wirings. A third character 408, in the form of a caret symbol (“L”), is used to indicate the end of the tagging. One or more blank spaces 410 may exist before and/or after data 402. In this manner, second character 406 is intermediate first character 404 and third character 408, while data 402 (or keyword) is intermediate second character 406 and third character 408. It should be noted that the tagging format is not limited to the illustration shown in FIG. 4 as other formats may be contemplated in other embodiments. For example, a keyword may be saved to more than one database by including additional characters to indicate multiple databases. In general, the plurality of characters used for tagging a keyword can include any number and combination of letters, numbers, punctuation marks, and special symbols found on a standard keyboard.
[0031] Tagging keywords by computing device 102 allows the keywords to be accurately and efficiently stored in databases 106A-106N. This in turn enables easier information searching, information retrieval, information association, and information forecasting during the course of the investigation. In one embodiment, the tagging can be performed by a user.
[0032] FIG. 5 illustrates an example text data 500 that has been processed by computing device 102 to identify and tag keywords. Text data 500 may be a description of a phone call between two parties. When computing device 102 determines that a keyword is found to not exist in any of the databases (e.g., any of databases 106A-106N), that keyword is tagged for storage using a plurality of characters according to the format shown in FIG. 4. In FIG. 5, keywords 502-508 are tagged for storage. For example, keyword 502 is determined to be a slang word, and as such, keyword 502 is tagged to be stored in a “Vocabulary” database designated “vo.” In another example, keyword 504 is determined to be a person’s name, and as such, keyword 504 is tagged to be stored in a “Names Mentioned” database designated “nm.” Other examples include keyword 506 tagged to be stored in an “Addresses” database designated “ad,” and keyword 508 tagged to be stored in a “Vehicles” database designated “ve.”
[0033] In some embodiments, a tagged keyword may require additional information to describe the keyword (e.g., from a user). As such, computing device 102 highlights the tagged keyword with a first color to indicate that user input is needed to add information about the tagged keyword. In FIG. 5, keyword 502 is highlighted with a green color to indicate additional user input. Referring to FIG. 6, an example user interface 602 is generated when a user selects (e.g., double-clicks on) keyword 502. User interface 602 is in the form of a data entry window that allows the user to enter information (e.g., definition, notes, etc.) for keyword 502.
[0034] In some embodiments, a tagged keyword may comprise a combination of two or more separate words. As such, computing device 102 highlights the tagged keyword with a second different color to indicate that user input is needed to categorize or classify the two or more separate words. In FIG. 5, keyword 508 is highlighted with a red color to indicate additional user input. Keyword 508 includes two separate words 704, 706 that describe a color and a make of a vehicle, respectively. Referring to FIG. 7, an example user interface 702 is generated when a user selects (e.g., double-clicks on) keyword 508. User interface 702 is in the form of a mapping window that allows the user to match words 704, 706 to their corresponding descriptions. User interface 702 also includes labels 708 that indicate auto-populated related information. Once the user has successfully classified words 704, 706, computing device 102 may change the highlighting of keyword 508 to a different color (e.g., to a green color). [0035] When computing device 102 determines that a keyword is found to exist in one of the databases (e.g., one of databases 106A-106N), that keyword is not tagged and is highlighted with a third color to indicate that the keyword already exists. For example, referring back to FIG. 5, keywords 510, 512 are highlighted in yellow to indicate that computing device 102 found these keywords in the databases.
[0036] When a keyword already exists, a user can also access information about that keyword. Referring to FIG. 8, an example user interface 802 is generated when a user chooses (e.g., double-clicks on) an already existing keyword 804. User interface 802 is in the form of a data entry window that allows the user to view and/or modify any of the information associated with keyword 804.
[0037] FIGS. 9-10 illustrate other examples of text data processed by computing device 102. In FIG. 9, text data 900 includes a tagged keyword 902 which is in the form of a phrase with multiple words. Tagged keyword 902 describes a place but without an actual address. As such, computing device 102 has highlighted tagged keyword 902 in red indicate that additional user input is required (e.g., to determine the actual address through a different source).
[0038] In some embodiments, computing device 102 can provisionally tag a keyword but will not activate it until or unless a user reviews the tagging. In FIG. 10, text data 1000 includes provisionally tagged keywords 1002, 1004. Computing device 102 has recognized the keywords but has not permanently tagged them. A user can review provisionally tagged keywords 1002, 1004, and upon selecting them (e.g., double-clicking), computing device 102 can activate their tagging by highlighting keywords 1002, 1004 in red. This also indicates that additional user input (e.g., information to describe the keywords) is needed.
[0039] In certain embodiments, text data 100 can be used to create an event. For example, when keyword 1004 is encountered during the data entry process described above, an event may be added to the system calendar by typing “/up tomorrowA” which corresponds to upcoming events / calendar event.
[0040] System and methods disclosed herein allow for improved efficiency in terms of processing and integrating data associated with an investigation into one or more databases.
Such efficiencies can result in reduced man-hours and lower costs for the investigation. Moreover, such systems and methods may reduce data entry errors that are typically present with conventional systems and methods in the industry. [0041] The various illustrative modules and logical blocks described in connection with the embodiments disclosed herein can be implemented or performed by a machine, such as a general-purpose processor (e.g., a microprocessor, a microcontroller, a state machine, etc.), a digital signal processor (DSP), an application specific integrated circuit (ASIC), a field programmable gate array (FPGA) or other programmable logic device, discrete gate or transistor logic, discrete hardware components, or any combination thereof designed to perform the functions described herein.
[0042] The steps of a method, process, or algorithm described in connection with the embodiments disclosed herein can be embodied directly in hardware, in a software module executed by a processor, or in a combination of the two. A software module can reside in RAM memory, flash memory, ROM memory, EPROM memory, EEPROM memory, registers, hard disk, a removable disk, a CD-ROM, or any other form of computer-readable storage medium known in the art. An exemplary storage medium can be coupled to the processor such that the processor can read information from, and write information to, the storage medium.
Alternatively, the storage medium can be integral to the processor.
[0043] While this invention has been described as having exemplary designs, the present invention can be further modified within the spirit and scope of this disclosure. This application is therefore intended to cover any variations, uses, or adaptations of the invention using its general principles. Further, this application is intended to cover such departures from the present disclosure as come within known or customary practice in the art to which this invention pertains and which fall within the limits of the appended claims.
[0044] Furthermore, the connecting lines shown in the various figures contained herein are intended to represent exemplary functional relationships and/or physical couplings between the various elements. It should be noted that many alternative or additional functional relationships or physical connections may be present in a practical system. However, the benefits, advantages, solutions to problems, and any elements that may cause any benefit, advantage, or solution to occur or become more pronounced are not to be construed as critical, required, or essential features or elements. The scope is accordingly to be limited by nothing other than the appended claims, in which reference to an element in the singular is not intended to mean “one and only one” unless explicitly so stated, but rather “one or more.”
[0045] Moreover, where a phrase similar to “at least one of A, B, or C” is used in the claims, it is intended that the phrase be interpreted to mean that A alone may be present in an embodiment, B alone may be present in an embodiment, C alone may be present in an embodiment, or that any combination of the elements A, B or C may be present in a single embodiment; for example, A and B, A and C, B and C, or A and B and C.
[0046] Systems, methods and apparatus are provided herein. In the detailed description herein, references to “one embodiment,” “an embodiment,” “an example embodiment,” etc., indicate that the embodiment described may include a particular feature, structure, or characteristic, but every embodiment may not necessarily include the particular feature, structure, or characteristic. Moreover, such phrases are not necessarily referring to the same embodiment. Further, when a particular feature, structure, or characteristic is described in connection with an embodiment, it is submitted that it is within the knowledge of one skilled in the art to affect such feature, structure, or characteristic with the benefit of this disclosure in connection with other embodiments whether or not explicitly described. After reading the description, it will be apparent to one skilled in the relevant art(s) how to implement the disclosure in alternative embodiments.
[0047] Furthermore, no element, component, or method step in the present disclosure is intended to be dedicated to the public regardless of whether the element, component, or method step is explicitly recited in the claims. As used herein, the terms “comprises”, “comprising”, or any other variation thereof, are intended to cover a non-exclusive inclusion, such that a process, method, article, or apparatus that comprises a list of elements does not include only those elements but may include other elements not expressly listed or inherent to such process, method, article, or apparatus.

Claims

CLAIMS What is claimed is:
1. A method for processing text data comprising: analyzing, by a computing device, the text data to identify a plurality of keywords; determining, by the computing device, whether each of the plurality of keywords exists in one or more databases; and when a keyword in the plurality of keywords is not found in the one or more databases, tagging, by the computing device, the keyword with a plurality of characters for storage, the plurality of characters including at least a first character to indicate a start of the tagging, a second character to indicate a corresponding database for storing the keyword, and a third character to indicate an end of the tagging.
2. The method of claim 1, wherein the plurality of characters includes one or more letters, numbers, punctuation marks, and special symbols found on a keyboard.
3. The method of claim 1, wherein each of the first, second, and third characters comprises more than one character.
4. The method of claim 1, wherein the second character is intermediate the first character and the third character, and the keyword is intermediate the second character and the third character.
5. The method of claim 1, further comprising highlighting the tagged keyword with a first color to indicate that user input is needed to add information about the tagged keyword.
6. The method of claim 5, wherein the tagged keyword comprises two or more separate words, and the method further comprises highlighting the tagged keyword with a second color to indicate that user input is needed to classify the two or more separate words.
7. The method of claim 6, wherein when the keyword in the plurality of keywords is found in the one or more databases, highlighting the keyword with a third color to indicate that the keyword already exists.
8. The method of claim 1, further comprising storing the tagged keyword in the corresponding database.
9. The method of claim 1, wherein the tagged keyword is stored in multiple databases.
10. A system comprising: a processor; one or more databases; and a memory including instructions that, when executed by the processor, cause the processor to: analyze text data to identify a plurality of keywords; determine whether each of the plurality of keywords exists in the one or more databases; and when a keyword in the plurality of keywords is not found in the one or more databases, tag the keyword with a plurality of characters for storage, the plurality of characters including at least a first character to indicate a start of the tagging, a second character to indicate a corresponding database for storing the keyword, and a third character to indicate an end of the tagging.
11. The system of claim 10, wherein the plurality of characters includes one or more letters, numbers, punctuation marks, and special symbols found on a keyboard.
12. The system of claim 10, wherein each of the first, second, and third characters comprises more than one character.
13. The system of claim 10, wherein the second character is intermediate the first character and the third character, and the keyword is intermediate the second character and the third character.
14. The system of claim 10, wherein the instructions, when executed by the processor, further cause the processor to highlight the tagged keyword with a first color to indicate that user input is needed to add information about the tagged keyword.
15. The system of claim 14, wherein the tagged keyword comprises two or more separate words, and the instructions, when executed by the processor, further cause the processor to highlight the tagged keyword with a second color to indicate that user input is needed to classify the two or more separate words.
16. The system of claim 15, wherein when the keyword in the plurality of keywords is found in the one or more databases, the instructions, when executed by the processor, further cause the processor to highlight the keyword with a third color to indicate that the keyword already exists.
17. The system of claim 10, wherein the instructions, when executed by the processor, further cause the processor to store the tagged keyword in the corresponding database.
18. A non-transitory computer readable medium having stored thereon instructions that, when executed by a processor, cause the processor to: analyze text data to identify a plurality of keywords; determine whether each of the plurality of keywords exists in one or more databases; and when a keyword in the plurality of keywords is not found in the one or more databases, tag the keyword with a plurality of characters for storage, the plurality of characters including at least a first character to indicate a start of the tagging, a second character to indicate a corresponding database for storing the keyword, and a third character to indicate an end of the tagging.
19. The non-transitory computer readable medium of claim 18, wherein the instructions, when executed by the processor, further cause the processor to store the tagged keyword in the corresponding database.
20. The non-transitory computer readable medium of claim 18, wherein the plurality of characters includes one or more letters, numbers, punctuation marks, and special symbols found on a keyboard.
PCT/US2020/045353 2019-08-07 2020-08-07 Data entry feature for information tracking system WO2021026428A1 (en)

Priority Applications (6)

Application Number Priority Date Filing Date Title
EP20850073.6A EP4010838A4 (en) 2019-08-07 2020-08-07 Data entry feature for information tracking system
CN202311018897.6A CN117112598A (en) 2019-08-07 2020-08-07 Method and system for processing text data, and non-transitory computer readable medium
US16/969,420 US11783127B2 (en) 2019-08-07 2020-08-07 Data entry feature for information tracking system
CN202080067387.6A CN115210708B (en) 2019-08-07 2020-08-07 Method and system for processing text data, and non-transitory computer readable medium
JP2022507854A JP2022543870A (en) 2019-08-07 2020-08-07 Data entry function for information tracking system
US18/240,996 US20240070391A1 (en) 2019-08-07 2023-08-31 Data entry feature for information tracking system

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
US201962883917P 2019-08-07 2019-08-07
US62/883,917 2019-08-07

Related Child Applications (2)

Application Number Title Priority Date Filing Date
US16/969,420 A-371-Of-International US11783127B2 (en) 2019-08-07 2020-08-07 Data entry feature for information tracking system
US18/240,996 Continuation US20240070391A1 (en) 2019-08-07 2023-08-31 Data entry feature for information tracking system

Publications (1)

Publication Number Publication Date
WO2021026428A1 true WO2021026428A1 (en) 2021-02-11

Family

ID=74503740

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/US2020/045353 WO2021026428A1 (en) 2019-08-07 2020-08-07 Data entry feature for information tracking system

Country Status (5)

Country Link
US (2) US11783127B2 (en)
EP (1) EP4010838A4 (en)
JP (1) JP2022543870A (en)
CN (2) CN117112598A (en)
WO (1) WO2021026428A1 (en)

Families Citing this family (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN117112598A (en) * 2019-08-07 2023-11-24 齐纳特科技公司 Method and system for processing text data, and non-transitory computer readable medium

Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO1999034307A1 (en) * 1997-12-29 1999-07-08 Infodream Corporation Extraction server for unstructured documents
US20110022941A1 (en) 2006-04-11 2011-01-27 Brian Osborne Information Extraction Methods and Apparatus Including a Computer-User Interface
US20120150866A1 (en) * 2008-06-20 2012-06-14 Lexisnexis Group Systems and methods for document searching
US20130144863A1 (en) * 2011-05-25 2013-06-06 Forensic Logic, Inc. System and Method for Gathering, Restructuring, and Searching Text Data from Several Different Data Sources

Family Cites Families (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN102567365B (en) 2010-12-26 2016-07-06 上海量明科技发展有限公司 A kind of it is directed to input method and the system that key word is labeled
US9501455B2 (en) * 2011-06-30 2016-11-22 The Boeing Company Systems and methods for processing data
CN105550298B (en) 2015-12-11 2019-12-10 北京搜狗科技发展有限公司 Keyword fuzzy matching method and device
CN117112598A (en) * 2019-08-07 2023-11-24 齐纳特科技公司 Method and system for processing text data, and non-transitory computer readable medium

Patent Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO1999034307A1 (en) * 1997-12-29 1999-07-08 Infodream Corporation Extraction server for unstructured documents
US20110022941A1 (en) 2006-04-11 2011-01-27 Brian Osborne Information Extraction Methods and Apparatus Including a Computer-User Interface
US20120150866A1 (en) * 2008-06-20 2012-06-14 Lexisnexis Group Systems and methods for document searching
US20130144863A1 (en) * 2011-05-25 2013-06-06 Forensic Logic, Inc. System and Method for Gathering, Restructuring, and Searching Text Data from Several Different Data Sources

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
See also references of EP4010838A4

Also Published As

Publication number Publication date
US11783127B2 (en) 2023-10-10
EP4010838A1 (en) 2022-06-15
CN117112598A (en) 2023-11-24
JP2022543870A (en) 2022-10-14
US20240070391A1 (en) 2024-02-29
US20230141184A1 (en) 2023-05-11
EP4010838A4 (en) 2023-08-30
CN115210708A (en) 2022-10-18
CN115210708B (en) 2023-09-01

Similar Documents

Publication Publication Date Title
US11663254B2 (en) System and engine for seeded clustering of news events
US8296284B2 (en) Guided navigation system
US8108413B2 (en) Method and apparatus for automatically discovering features in free form heterogeneous data
US20090006391A1 (en) Automatic categorization of document through tagging
US8484194B1 (en) Training set construction for taxonomic classification
US20100082657A1 (en) Generating synonyms based on query log data
US20120124029A1 (en) Cross media knowledge storage, management and information discovery and retrieval
US20100114899A1 (en) Method and system for business intelligence analytics on unstructured data
US11609959B2 (en) System and methods for generating an enhanced output of relevant content to facilitate content analysis
EA001738B1 (en) Method and system for evaluating a data set
WO2021068932A1 (en) Method based on electronic book for presenting information associated with entity
US20240070391A1 (en) Data entry feature for information tracking system
US10699112B1 (en) Identification of key segments in document images
US20220343353A1 (en) Identifying Competitors of Companies
US20220229854A1 (en) Constructing ground truth when classifying data
CN109783612B (en) Report data positioning method and device, storage medium and terminal
CN101894158B (en) Intelligent retrieval system
US20170132275A1 (en) Query handling in search systems
CN115099922A (en) Financial data query method, system, readable storage medium and computer equipment
CN104899755A (en) Multi-dimensional complex condition advertisement indexing method
US10643227B1 (en) Business lines
CN112133308A (en) Method and device for multi-label classification of voice recognition text
US11966421B2 (en) System, method, and computer program for a context-based data-driven classifier
CN116467347B (en) Stock questioning and answering method
CN117493996A (en) Construction method of police situation cascade classification model

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 20850073

Country of ref document: EP

Kind code of ref document: A1

ENP Entry into the national phase

Ref document number: 2022507854

Country of ref document: JP

Kind code of ref document: A

NENP Non-entry into the national phase

Ref country code: DE

ENP Entry into the national phase

Ref document number: 2020850073

Country of ref document: EP

Effective date: 20220307