US12443844B2 - Neural network trained using ordinal loss function - Google Patents
Neural network trained using ordinal loss functionInfo
- Publication number
- US12443844B2 US12443844B2 US17/509,840 US202117509840A US12443844B2 US 12443844 B2 US12443844 B2 US 12443844B2 US 202117509840 A US202117509840 A US 202117509840A US 12443844 B2 US12443844 B2 US 12443844B2
- Authority
- US
- United States
- Prior art keywords
- sample
- ordinal
- computer
- neural network
- mapping
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Active, expires
Links
Images
Classifications
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06N—COMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N3/00—Computing arrangements based on biological models
- G06N3/02—Neural networks
- G06N3/08—Learning methods
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F18/00—Pattern recognition
- G06F18/20—Analysing
- G06F18/21—Design or setup of recognition systems or techniques; Extraction of features in feature space; Blind source separation
- G06F18/211—Selection of the most significant subset of features
- G06F18/2113—Selection of the most significant subset of features by ranking or filtering the set of features, e.g. using a measure of variance or of feature cross-correlation
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F18/00—Pattern recognition
- G06F18/20—Analysing
- G06F18/21—Design or setup of recognition systems or techniques; Extraction of features in feature space; Blind source separation
- G06F18/213—Feature extraction, e.g. by transforming the feature space; Summarisation; Mappings, e.g. subspace methods
- G06F18/2137—Feature extraction, e.g. by transforming the feature space; Summarisation; Mappings, e.g. subspace methods based on criteria of topology preservation, e.g. multidimensional scaling or self-organising maps
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F18/00—Pattern recognition
- G06F18/20—Analysing
- G06F18/22—Matching criteria, e.g. proximity measures
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F18/00—Pattern recognition
- G06F18/20—Analysing
- G06F18/24—Classification techniques
- G06F18/243—Classification techniques relating to the number of classes
- G06F18/2431—Multiple classes
Definitions
- a computer-implemented method can include receiving multiple samples, each sample a computer-processable data structure corresponding to a real-world object and including a data element indicating the sample's class, the class one of n predefined classes to which each sample is linked.
- the method can include feeding each sample to an ordinal mapping deep neural network that maps each sample to a sample point of a multidimensional space.
- the method can include predicting the class of each sample based on an ordinal mapping of each sample point by the ordinal mapping deep neural network.
- the method can include iteratively adjusting parameters of the ordinal mapping deep neural network in response to misclassifying one or more of the samples by the ordinal mapping deep neural network.
- the iteratively adjusting can be based on an expected ordinal mapping loss determined by an ordinal mapping loss function that measures (a) distances between a hyperplane extending through each sample point in the multidimensional space and each other sample point of a same class and (b) overlap between sample points of different classes.
- a system includes a processor configured to initiate operations.
- the operations can include receiving multiple samples, each sample a computer-processable data structure corresponding to a real-world object and including a data element indicating the sample's class, the class one of n predefined classes to which each sample is linked.
- the operations can include feeding each sample to an ordinal mapping deep neural network that maps each sample to a sample point of a multidimensional space.
- the operations can predict the class of each sample based on an ordinal mapping of each sample point by the ordinal mapping deep neural network.
- the operations can include iteratively adjusting parameters of the ordinal mapping deep neural network in response to misclassifying one or more of the samples by the ordinal mapping deep neural network.
- the iteratively adjusting can be based on an expected ordinal mapping loss determined by an ordinal mapping loss function that measures (a) distances between a hyperplane extending through each sample point in the multidimensional space and each other sample point of a same class and (b) overlap between sample points of different classes.
- a computer program product includes one or more computer readable storage media having instructions stored thereon.
- the instructions are executable by a processor to initiate operations.
- the operations can include receiving multiple samples, each sample a computer-processable data structure corresponding to a real-world object and including a data element indicating the sample's class, the class one of n predefined classes to which each sample is linked.
- the operations can include feeding each sample to an ordinal mapping deep neural network that maps each sample to a sample point of a multidimensional space.
- the operations can include predicting the class of each sample based on an ordinal mapping of each sample point by the ordinal mapping deep neural network.
- the operations can include iteratively adjusting parameters of the ordinal mapping deep neural network in response to misclassifying one or more of the samples by the ordinal mapping deep neural network.
- the iteratively adjusting can be based on an expected ordinal mapping loss determined by an ordinal mapping loss function that measures (a) distances between a hyperplane extending through each sample point in the multidimensional space and each other sample point of a same class and (b) overlap between sample points of different classes.
- FIG. 1 illustrates an example ordinal mapping deep neural network system.
- FIG. 2 illustrates an example method of training the ordinal mapping deep neural network system of FIG. 1 .
- FIG. 3 illustrates an example ordinal mapping
- FIGS. 4 A and 4 B are example matrix transformations for determining an ordinal mapping loss.
- FIG. 5 illustrates an example computer system that uses an ordinal mapping deep neural network to rank items of text.
- FIGS. 6 A and 6 B illustrate example screen shots of a display generated with the computer system of FIG. 5 .
- FIG. 7 illustrates an example a computing node.
- FIG. 8 illustrates an example cloud computing environment.
- FIG. 9 illustrates example abstraction model layers.
- a neural network is a highly versatile vehicle for classifying samples of various kinds.
- the samples can be images, documents, or a host of other real-world objects that can be represented by feature vectors, the elements of which correspond to characteristics of the real-world objects.
- the feature vector elements of an image can correspond to the pixel data of the image.
- Each element of a feature vector corresponding to a document for example, can correspond to a specific word of a predetermined vocabulary, the value of which indicates whether the word occurs in the document.
- a neural network can classify a sample of a real-world object (e.g., image, document) based on a feature vector whose elements characterize the real-world object.
- the classification is a nominal classification.
- Some classification models generate a probability indicating the likelihood that a sample belongs to a category, the summation of the probabilities across every category summing to one. But a probability regarding the likely accuracy of a single classification does not provide an ordinal ranking among different classes.
- Some feedforward neural networks for multiclass classification problems have ordered classes but only one output unit, the values restricted to the unit interval [0, 1]. The interval must be subdivided into K subintervals (one for each class), according to a specific probabilistic model.
- example methods, systems, and computer program products are provided that are capable of training a neural network using an inventive ordinal mapping loss function.
- neural networks are trained by minimizing the expected value of a loss function.
- the ordinal mapping loss function disclosed herein calculates, during model training, the model error upon completion of a feedforward cycle.
- An aspect of the inventive arrangements disclosed herein is that the ordinal mapping deep neural network is trained using the ordinal mapping loss function and provides a mapping to a one-dimensional space.
- the ordinal mapping deep neural network establishes an ordering and a distance metric that provides both a prediction of ordinal classes and rank ordering or ranking.
- the ordinal mapping deep neural network trained using the ordinal mapping loss function can provide an effective average of ranking of the ordinal classes.
- One aspect of the ordinal mapping deep neural network trained using the ordinal mapping loss function is sample-to-sample pairwise comparisons. Based on the pairwise comparisons the ordinal mapping deep neural network maximizes distances between pairs of sample points (e.g., vectors) classified in different classes, the distance measured in a direction determined by the ordinal mapping deep neural network. At the same time sample points classified in the same classes are drawn closer thereby minimizing distances within each class.
- the process also provides the ordinal mapping deep neural network with “regularization.” Regularization involves selecting or adjusting a neural network's complexity. Too many parameters make for poor generalization of the neural network due to overfitting. Too few jeopardizes the neural network's accuracy.
- the ordinal mapping deep neural network disclosed herein provides regularization that largely resolves the trade-off. By reducing the number of parameters (weights and biases) while preserving predictive accuracy, moreover, the ordinal mapping deep neural network also reduces transfers between memory and processor, lessening strain on both memory capacity and bandwidth thus further enhancing computation efficiency.
- Other neural networks break classifying into multiple binary classifications, which typically cannot replicate an ordered metric (or estimated average ranking) without invoking a number of restrictive assumptions. These deep neural networks typically require piecing together multiple components. Unlike the ordinal mapping deep neural network trained using the ordinal mapping loss function disclosed herein, other deep neural networks must piece together multiple components and do not generate an optimized metric. The ordinal mapping deep neural network trained using the ordinal mapping loss function, as disclosed herein, does generate an optimized metric, an ordering and distance metric defined on a one-dimensional space.
- the ordinal mapping deep neural network is integrated within a system that scans documents and automatically extracts predetermined data points (e.g., statistics, factual assertions, predictions).
- the system enables a user to search, select, rate, and export to a device the data points, which the system pulls from various sources (e.g., publications, news feeds, websites).
- the ordinal mapping deep neural network provides a mapping of the data points to a one-dimensional space. Based on the mapping the data points can be ranked on a predetermined scale (e.g., ⁇ 1 to 5).
- a predetermined scale e.g., ⁇ 1 to 5
- One aspect of the ranking using the ordinal mapping deep neural network is the generation of directed distances (part of the distance metric described above).
- a sample point classified in class two and a sample point classified in class five not only differ by three units, but (as vectors) when projected on a number line (one-dimensional space), the mapping space value for the class five sample point also ranks three units greater than that of the class two sample point.
- ordinal mapping deep neural network One of the advantages of the ordinal mapping deep neural network disclosed herein over approaches that pre-date the ordinal hyperplane loss (OHPL) approach, is the ordinal mapping deep neural network's enablement of direct estimation of ordered classes.
- pre-OHPL ordinal hyperplane loss
- Another earlier approach required complex sampling, whereby a classification was treated as a complex binary problem.
- the ordinal mapping deep neural network's enablement of direct estimation of ordered classes disclosed herein overcomes these obstacles and limitations.
- the ordinal mapping deep neural network disclosed herein is an advance over the OHPL approach, as well.
- the ordinal mapping deep neural network is not restricted by the need to calculate hyperplane centroids.
- the ordinal mapping loss function used in the ordinal mapping deep neural network employs a centroid-free loss function.
- centroid-free means that a loss function defined as being centroid free does not use centroids in the calculation. Accordingly, using the centroid-free ordinal mapping loss function, the ordinal mapping deep neural network, unlike the OHPL approach, does not requiring calculating hyperplane centroids in the loss calculation for training the ordinal mapping deep neural network. Removal of hyperplane centroids from the calculation enables processing data in smaller batches.
- the ordinal mapping deep neural network can minimize computational cost, while ensuring that individual sample points (described below) are compared to as many points as possible within a full dataset. Accordingly, the ordinal mapping deep neural network is an improvement in computational efficiency over the OHPL approach as well as the pre-OHPL approach.
- the ordinal mapping deep neural network has been shown to predict ordinal classifications better than those made using the OHPL approach. For example, in performing ordinal classifications of verbal statements according to predetermined criteria, while both achieved greater than 90 percent accuracy, OHPL never achieved better than 93 percent whereas the ordinal mapping deep neural network achieved a 95 percent accuracy.
- FIGS. 1 and 2 respectively, illustrate example ordinal mapping system (system) 100 and example methodology 200 performed by system 100 .
- System 100 illustratively includes ordinal mapping deep neural network (DNN 102 ).
- Ordinal mapping deep DNN 102 illustratively comprises an input layer 104 comprising one or more units (not explicitly shown) that each outputs the result of an operation that feeds into one or more units of the succeeding layer, one or more hidden layers 106 comprising one or more logically organized units (not explicitly shown) that each outputs the result of an operation that feeds into unit(s) of a succeeding layer, and output layer 108 , also comprising one or more units (not explicitly shown) that outputs the result of the operations performed by each of the successive layers.
- input layer 104 comprising one or more units (not explicitly shown) that each outputs the result of an operation that feeds into one or more units of the succeeding layer
- hidden layers 106 comprising one or more logically organized units (not explicitly shown) that each outputs the result of
- system 100 can be implemented in hardware (e.g., dedicated hardwired circuitry), software (e.g., program code executed by one or more processors), or a combination thereof.
- system 100 in certain embodiments is implemented in a computing node, such as computing node 700 (e.g., cloud-based server), in which system 100 comprises program code that is electronically stored in a memory, such as memory 728 , and executes on one or more processors, such as processor 716 of computer system 712 ( FIG. 7 ).
- computing node 700 e.g., cloud-based server
- system 100 comprises program code that is electronically stored in a memory, such as memory 728 , and executes on one or more processors, such as processor 716 of computer system 712 ( FIG. 7 ).
- system 100 receives samples 110 (a set of labeled training examples) comprising training data for training ordinal mapping DNN 102 .
- Each sample is a computer-processable data structure corresponding to a real-world object and includes a data element indicating the sample's class, the class being one of n predefined classes to which each sample is linked.
- the data structure can represent a vector, a matrix, or higher-order tensor, each element of which is a feature that characterizes the real-world object. For example, if the real-world object is text, then the sample may be an n-tuple, or vector, x , in which each element indicates by a zero or one depending on whether the text includes a word from an n-word vocabulary. If the real-world object is an image, for example, then the sample may be a multi-dimensional tensor whose features correspond to the image's pixel representation.
- Ordinal mapping DNN 102 is a classifier trained using supervised learning, and accordingly, each of the samples 110 are labeled with a target value.
- the target value of a sample indicates the correct class to which the sample belongs.
- ordinal mapping DNN 102 predicts the class of a sample.
- Ordinal mapping DNN 102 misclassifies the sample if the predicted class differs from the sample's target value. For example, if the samples are text, the target value can correspond to how well the text supports a certain point of view, rank ordered on an arbitrary scale. Using samples thus labeled, ordinal mapping DNN 102 can learn to identify portions of text (e.g., sentences) that, based on their underlying features, would likely garner the same ranking.
- each image's target value can be a ranked response that the image elicited when viewed by a select group of viewers.
- Ordinal mapping DNN 102 accordingly can learn to recognize images and rank each image according to the reaction each is predicted to elicit. Learning by ordinal mapping DNN 102 entails generating a prediction for each of set of training samples and comparing the prediction for each sample with the target value of each sample.
- each sample is fed into ordinal mapping DNN 102 .
- Ordinal mapping DNN 102 maps each sample x (vector, matrix, or higher order tensor) to a sample point ⁇ ( x ) in a multidimensional space n , x ⁇ ( x ) ⁇ n .
- FIG. 3 illustrates a mapping, ⁇ , that maps each sample x from space 300 to space 302 in 2 .
- Ordinal mapping DNN 102 learns the mapping ⁇ from the same or a comparable set of training samples. It is explicitly noted that although the two-dimensional space can be presented graphically for purpose of illustration, ordinal mapping DNN 102 can generate mappings to much higher-ordered multidimensional spaces n . Illustratively, in FIG.
- each of the samples corresponds to one of three classes.
- the data structure representation includes a data element indicating each sample's class.
- Ordinal mapping DNN 102 illustratively maps the samples in a direction (from the origin) such that the sample points in 2 , at least partially, are separated by class, the separation commensurate with each sample point's distance from the origin.
- the direction of separation as generated by ordinal mapping DNN 102 , extends outwardly in a direction of greatest separation of the samples by class.
- ordinal mapping DNN 102 predicts the class of each sample x based on an ordinal mapping of each sample point, ⁇ ( x ) ⁇ n .
- the output of ordinal mapping DNN 102 is w ′ ⁇ ( x ), where w ′ is a transpose vector of w , ⁇ ( x ) is the mapping of sample x (also a vector) to a sample point in n , and w ′ ⁇ ( x ) is the inner, or dot, product of the vectors.
- the scalar w ′ ⁇ ( x ) is the predicted value (ordinal class) generated by ordinal mapping DNN 102 .
- hyperplanes 304 and 306 separate the classes, albeit not perfectly.
- Hyperplanes 304 and 306 are generalized representations of the sample points and chosen to be perpendicular to the direction of greatest separation.
- Hyperplanes 304 and 306 are determined by and correspond to classification parameters w , which are determined by ordinal mapping DNN 102 .
- the purpose of training ordinal mapping DNN 102 is to enhance the prediction accuracy of the output w ′ ⁇ ( x ).
- parameters w of ordinal mapping DNN 102 are iteratively adjusted in response to a misclassifying one or more of the samples by the ordinal mapping DNN 102 .
- Ordinal mapping DNN 102 iteratively adjusts the parameters based on an expected loss determined by an ordinal mapping loss function in response to the misclassifying.
- the ordinal mapping loss function measures (a) distances between a hyperplane extending through each sample point in a class in the multidimensional space and each other sample point of the same class (within class differences) and (b) the overlap between sample points of different classes (between class differences).
- the ordinal mapping loss can be computed using matrix algebra.
- a matrix Z is based on the w ′ ⁇ ( x ) outputs of the ordinal mapping DNN 102 (predicted class of each sample x ), and a matrix C comprises the elements that are corresponding target values (the actual class of each sample x ).
- Z ′ ( z 1 z 1 z 1 z 2 z 2 z 2 z 3 z 3 )
- T ( z 1 z 1 z 1 z 2 z 2 z 3 z 3 z 3 )
- the matrix C replicates each column vector, whose i-th element, c i , is the target value (class) of the i-th predicted value, w′ ⁇ ( x ) i , of the i-th sample x i of a training set.
- C is also a square matrix.
- the corresponding target values are the elements vector of the vector (c 1 , c 2 , c 3 ) T .
- the transpose matrix Z T is subtracted from the matrix Z, generating
- Z - Z ′ ( z 1 - z 1 z 1 - z 2 z 1 - z 3 z 2 - z 1 z 2 - z 2 z 2 - z 3 z 3 - z 1 z 3 - z 2 z 3 - z 3 - z 1 z 3 - z 2 z 3 - z 3 )
- the difference between predicted values can be computed as the distance between hyperplanes through each sample point z i and z j . If class c i (the target value (or actual class) corresponding to predicted class z i ) is different from class c j (the target value (or actual class) corresponding to predicted class z j ), then the pair of samples for which ordinal mapping DNN 102 generates predicted values z i and z j are from different classes.
- Between-class errors are differences between predicted values z i ⁇ z j , i ⁇ j, from different classes that are less than the difference between the corresponding target values (actual classes) (c i ⁇ c j ).
- a difference z i ⁇ z j between predicted classes that is less than the difference between the actual classes (c i ⁇ c j ), if different (c i ⁇ c j , of two samples connotes an overlap between sample points from separate classes. That is, the ordinal mapping DNN 102 is not providing predictions that sufficiently distinguish between the different ordering of the classes.
- the sum includes only errors that are greater than a predetermined margin, such that the each remaining element of (C ⁇ C′) ⁇ (Z ⁇ Z′) is replaced by Max ⁇ [(c i ⁇ c j ) ⁇ (z i ⁇ z j )] ⁇ margin, 0 ⁇ .
- the other component of the ordinal mapping loss is the in-class error.
- the between-class error tests the sufficiency of the separation between predictions for samples from different classes
- the in-class error tests the closeness of predictions of samples in the same class.
- the matrix (C ⁇ C′) ⁇ (Z ⁇ Z′) is also transformed to determine in-class error.
- the margin typically is less than one. For example, in certain arrangements, the margin is 0.3. All remaining elements are replaced by zero.
- the in-class error is the sum each of the resulting non-zero terms of the transformed matrix.
- the ordinal mapping loss is the sum of both between-class and in-class errors.
- ordinal mapping loss has been described without loss of generality in the context of only three predictions ( w ′ ⁇ ( x ) 1 , w ′ ⁇ ( x ) 2 , w ′ ⁇ ( x ) 3 ) T and three corresponding target values (actual classes) (c 1 , c 2 , c 3 ) T , whose square matrices are each 3-by-3 matrices, it is expressly noted that in most applications the predictions and corresponding target values will be much greater (e.g., a thousand or more).
- the transformed matrix of FIG. 4 A is obtained by creating a square matrix for the class differences, C ⁇ C′, subtracting the square matrix created from differences between corresponding predicted values, Z ⁇ Z′, and modifying each element according to the above-described process for determining between-class errors.
- the non-zero elements of the transformed matrix of FIG. 4 A correspond to elements in which between-class differences are less than the actual class differences—that is, there is overlap indicating insufficient separation.
- the ordinal mapping loss due to between-class error is the sum of the non-zero elements.
- the transformed matrix of FIG. 4 B is obtained by creating the square matrix for the class differences, C ⁇ C′, subtracting the square matrix created from differences between corresponding predicted values, Z ⁇ Z′, and modifying each according to the above-described process for determining in-class errors.
- a predetermined margin e.g., 0.3
- In-class differences greater than the predetermined margin correspond to too great a difference between predictions belonging to the same class.
- the ordinal mapping loss due to in-class error is the sum of the non-zero elements.
- ordinal mapping DNN 102 iteratively adjusts the parameters by incrementally changing each parameter in a direction that generates two changes.
- One change tends to minimize, with respect to each sample, the distances between a hyperplane (perpendicular to the direction of separation) extending through the sample in the multidimensional space and each other sample of a same class.
- the other change tends to minimize overlap between samples of different classes, or equivalently, tends to maximize distances between samples in different classes.
- ordinal mapping DNN 102 can generate an ordering and distance metric that not only predicts the ordinal classes of newly presented sample, but also provides a one-dimensional mapping that generates an average ranking.
- Ordinal mapping DNN 102 can rank each newly presented sample's data structure (e.g., representing a vector, matrix, or higher order tensor) by mapping each data structure to a one-dimensional space having a defined ordering and distance metric.
- An ordinal mapping or ranking differs, say, from a confidence score that only provides a measure of statistical confidence that a classification is accurate. The confidence indicates that one classification is more likely correct than another, but it does not provide comparative information that on classification ranks, relative to some scale, higher than another.
- a score based on the number of keywords that appear in a document does not rank one document relative to another beyond the number of keywords occurring or number of times a keyword occurs in the respective documents.
- the score does not indicate that one document ranks higher relative to another, based for example on how well each document supports a particular point of view.
- Ordinal mapping DNN 102 is capable of providing such rankings.
- ordinal mapping DNN 102 trained using the optimal mapping loss function does not require an enormous number of samples (e.g., millions in the context of survey responses) for learning to calculate the average ranking.
- Other deep neural networks break the problem into multiple binary problems and cannot replicate ordinal mapping DNN 102 's ordered metric (estimated average), without invoking a number of additional restrictive assumptions, which require piecing together multiple components rather than generating an optimized metric.
- Ordinal mapping DNN 102 in certain arrangements, is capable of ranking factual statements identified in scanned-in text. Using the ordinal mapping loss function and a training set of ranked samples, ordinal mapping DNN 102 can learn to rank factual statements based on a predetermined scale that ranks factual statements according to how well each supports a predefined point of view.
- FIG. 5 illustrates example computer system 500 that receives text from database 502 . The text can be received from various sources, such as web sites, news feeds, scanned-in or saved documents, and the like. Computer system 500 extracts from the text relevant portions (data points), such as statistics, facts and predictions generated by various sources using application 504 .
- Application 504 enables users to search for, find, save, rate, and export highly rated portions of text from published reports, abstracts, professional reports, news feeds, and various websites that are submitted by multiple users.
- ordinal mapping DNN 102 can generate ordinal rankings
- ordinal mapping DNN 102 can rank text (or portions thereof) on an arbitrary scale (e.g., ⁇ 1 to 5). With a scale, say, of ⁇ 1 to 5, factual statements receiving a ⁇ 1 rating are deemed worthless, unreadable, or otherwise unusable. Those receiving a 0 rating are readable, but do not represent items (e.g., fact, prediction, statistic, key quote) that support a predetermined point of view. Remaining items receive a ranking between 1 and 5. Those receiving a 5 are of the highest quality and thus, based on ordinal mapping DNN 102 's ranking, likely to be highly useful to users.
- computer system 500 applies ordinal mapping DNN 102 with respect to sentences that are extracted from text obtained from one or more sources identified as credible and that are useful in supporting a point of view.
- a sentence may contain a specific fact or statement that is supported by research, data and/or analytics.
- Such sentences can be qualitative in nature, but may typically be quantitative in nature, containing portions that reference percentages, time frames, currency, sizes, volumes, capacity amounts, or the like.
- An aspect of ordinal mapping DNN 102 a capability to link users directly to specific, highly rated sentences in documents or text rather than linking them solely to the text or document.
- Rankings determined by ordinal mapping DNN 102 can enable a user to filter out all but text, including portions thereof or specific sentences, that receive ranking greater than a predetermined, user-specified threshold.
- a user is also able to add the user's own rankings to individual items, such that ordinal mapping DNN 102 can leverage newly added rankings to refine the predictions generated by ordinal mapping DNN 102 .
- FIG. 6 A illustrates example screen presentation 600 of example sentences 602 and 604 , which are extracted from credible text and provide quantitative data pertaining to the user-selected subject matter regarding the proliferation of blockchains.
- Sentences 602 and 604 are rendered by computer system 500 on display 506 .
- Screen presentation 600 presents the factual statements (dealing with blockchains) that have been filtered in response to a user request to include only those receiving a rank generated by ordinal mapping DNN 102 that is 4 or greater on the predetermined scale from ⁇ 1 to 5.
- the user request in some arrangements, creates a link between at least one factual statement (e.g., sentence) and text that contains the at least one factual statement.
- the link can comprise a data structure that couples the factual statement with metadata indicating a computer storage location (e.g., database element), at which is electronically stored the text (e.g., document) from which the factual statement is extracted by application 504 .
- Sentences 602 and 604 (factual statements) presented on display 506 are those identified as supporting a predetermined point of view (rapid growth in use of blockchains) and are ranked 4 or higher by ordinal mapping DNN 102 .
- FIG. 6 B illustrates example screen presentation 606 which displays sentence 608 (factual statement regarding growth of blockchain use), which is shown in expanded form within the display of text 610 from which sentence 608 is extracted.
- computer system 500 is capable of showing a sentence (expanded or otherwise highlighted) that contains the factual statement within a display of the text from which the factual statement supporting a point of view is extracted.
- a user is able to download and read through text (e.g., news feed, article, report, or the like), within which computer system 500 , based on a ranking provided by ordinal mapping DNN 102 , visually distinguishes key elements (e.g., sentences) of the text.
- computer system 500 can display portions of text (e.g., sentences or factual statement regarding a user-selected subject) in bold, color contrasted, enlarged, or otherwise distinguished from other portions of text.
- the sentence(s) or other key element(s) of the text can be ones that computer system 500 visually distinguishes in response to determining that the ordinal mapping DNN 102 ranking of the sentence(s) or other key element(s) exceeds a minimal threshold specified by the user.
- computer system 500 can display a visually distinguished factual statement whose ranking indicates that the factual statement strongly supports a predetermined point of view.
- ordinal mapping DNN 102 can be trained to rank order other types of newly presented samples. For example, presented with samples comprising a training set of labeled images, ordinal mapping DNN 102 can learn using the ordinal mapping loss function to rank order newly presented images.
- the ranking can correspond, for example, to the appeal (e.g., determined by a specific group of viewers) of each image.
- a computer system such as computer system 500 with display 506 , one or more such images can be displayed by superimposing on each of the one or more images the ranking of each of the one or more images generated, the ranking according to a predefined criterion (e.g., interest or enjoyment likely generated in a viewer).
- FIG. 7 illustrates a schematic of an example of a computing node 700 .
- computing node 700 is an example of a suitable cloud computing node.
- Computing node 700 is not intended to suggest any limitation as to the scope of use or functionality of embodiments of the invention described herein.
- Computing node 700 is capable of performing any of the functionality described within this disclosure.
- Computing node 700 includes a computer system 712 , which is operational with numerous other general-purpose or special-purpose computing system environments or configurations.
- Examples of well-known computing systems, environments, and/or configurations that may be suitable for use with computer system 712 include, but are not limited to, personal computer systems, server computer systems, thin clients, thick clients, hand-held or laptop devices, multiprocessor systems, microprocessor-based systems, set top boxes, programmable consumer electronics, network PCs, minicomputer systems, mainframe computer systems, and distributed cloud computing environments that include any of the above systems or devices, and the like.
- Computer system 712 may be described in the general context of computer system-executable instructions, such as program modules, being executed by a computer system.
- program modules may include routines, programs, objects, components, logic, data structures, and so on that perform particular tasks or implement particular abstract data types.
- Computer system 712 may be practiced in distributed cloud computing environments where tasks are performed by remote processing devices that are linked through a communications network.
- program modules may be located in both local and remote computer system storage media including memory storage devices.
- computer system 712 is shown in the form of a general-purpose computing device.
- the components of computer system 712 may include, but are not limited to, one or more processors 716 , a memory 728 , and a bus 718 that couples various system components including memory 728 to processor 716 .
- processor means at least one hardware circuit configured to carry out instructions.
- the hardware circuit may be an integrated circuit.
- Examples of a processor include, but are not limited to, a central processing unit (CPU), an array processor, a vector processor, a digital signal processor (DSP), a field-programmable gate array (FPGA), a programmable logic array (PLA), an application specific integrated circuit (ASIC), programmable logic circuitry, and a controller.
- CPU central processing unit
- DSP digital signal processor
- FPGA field-programmable gate array
- PDA programmable logic array
- ASIC application specific integrated circuit
- programmable logic circuitry and a controller.
- the carrying out of instructions of a computer program by a processor comprises executing or running the program.
- “run” and “execute” comprise a series of actions or events performed by the processor in accordance with one or more machine-readable instructions.
- “Running” and “executing,” as defined herein refer to the active performing of actions or events by the processor.
- the terms run, running, execute, and executing are used synonymously herein.
- Bus 718 represents one or more of any of several types of bus structures, including a memory bus or memory controller, a peripheral bus, an accelerated graphics port, and a processor or local bus using any of a variety of bus architectures.
- bus architectures include Industry Standard Architecture (ISA) bus, Micro Channel Architecture (MCA) bus, Enhanced ISA (EISA) bus, Video Electronics Standards Association (VESA) local bus, Peripheral Component Interconnect (PCI) bus, and PCI Express (PCIe) bus.
- Computer system 712 typically includes a variety of computer system-readable media. Such media may be any available media that is accessible by computer system 712 , and may include both volatile and non-volatile media, removable and non-removable media.
- Memory 728 may include computer system readable media in the form of volatile memory, such as random-access memory (RAM) 730 and/or cache memory 732 .
- Computer system 712 may further include other removable/non-removable, volatile/non-volatile computer system storage media.
- storage system 734 can be provided for reading from and writing to a non-removable, non-volatile magnetic media and/or solid-state drive(s) (not shown and typically called a “hard drive”).
- a magnetic disk drive for reading from and writing to a removable, non-volatile magnetic disk (e.g., a “floppy disk”)
- an optical disk drive for reading from or writing to a removable, non-volatile optical disk such as a CD-ROM, DVD-ROM or other optical media
- each can be connected to bus 718 by one or more data media interfaces.
- memory 728 may include at least one program product having a set (e.g., at least one) of program modules that are configured to carry out the functions of embodiments of the invention.
- Program/utility 740 having a set (at least one) of program modules 742 , may be stored in memory 728 by way of example, and not limitation, as well as an operating system, one or more application programs, other program modules, and program data. Each of the operating system, one or more application programs, other program modules, and program data or some combination thereof, may include an implementation of a networking environment.
- Program modules 742 generally carry out the functions and/or methodologies of embodiments of the invention as described herein.
- one or more of the program modules may include an ECD system, such as ordinal mapping DNN system 100 , or portions thereof.
- Program/utility 740 is executable by processor 716 .
- Program/utility 740 and any data items used, generated, and/or operated upon by computer system 712 are functional data structures that impart functionality when employed by computer system 712 .
- a “data structure” is a physical implementation of a data model's organization of data within a physical memory. As such, a data structure is formed of specific electrical or magnetic structural elements in a memory. A data structure imposes physical organization on the data stored in the memory as used by an application program executed using a processor.
- Computer system 712 may also communicate with one or more external devices 714 such as a keyboard, a pointing device, a display 724 , etc.; one or more devices that enable a user to interact with computer system 712 ; and/or any devices (e.g., network card, modem, etc.) that enable computer system 712 to communicate with one or more other computing devices. Such communication can occur via input/output (I/O) interfaces 722 . Still, computer system 712 can communicate with one or more networks such as a LAN, a WAN, and/or a public network (e.g., the Internet) via network adapter 720 .
- networks such as a LAN, a WAN, and/or a public network (e.g., the Internet) via network adapter 720 .
- Computer system 712 also can communicate with other devices via a wireless communication subsystem integrated in computing node 7 100 or operating as a type of external device 714 operatively coupled with computer system 712 .
- the wireless communication subsystem can be designed to operate over one or more mobile networks, Wi-Fi networks, short range wireless networks (e.g., a Bluetooth, UWB), and/or any combination of the foregoing.
- network adapter 720 communicates with the other components of computer system 712 via bus 718 . It should be understood that although not shown, other hardware and/or software components could be used in conjunction with computer system 712 . Examples, include, but are not limited to: microcode, device drivers, redundant processing units, external disk drive arrays, RAID systems, tape drives, and data archival storage systems, etc.
- computing node 700 is used to illustrate an example of a computing node, such as a cloud computing node, it should be appreciated that a computer system using an architecture the same as or similar to that described in connection with FIG. 7 may be used in either cloud or non-cloud computing implementations in performing the various operations described herein. In this regard, the example embodiments described herein are not intended to be limited to any single type of computing environment.
- Computing node 700 is an example of a data processing system.
- “data processing system” means one or more hardware systems configured to process data, each hardware system including at least one processor programmed to initiate operations and memory.
- Computing node 700 is an example of computer hardware. Computing node 700 may include fewer components than shown or additional components not illustrated in FIG. 7 depending upon the particular type of device and/or system that is implemented. The particular operating system and/or application(s) included may vary according to device and/or system type as may the types of I/O devices included. Further, one or more of the illustrative components may be incorporated into, or otherwise form a portion of, another component. For example, a processor may include at least some memory.
- Computing node 700 is also an example of a server (e.g., cloud-based server).
- server means a data processing system configured to share services with one or more other data processing systems and/or devices, including client devices.
- client device means a data processing system that requests shared services from a server, and with which a user directly interacts. Examples of a client device include, but are not limited to, a workstation, a desktop computer, a computer terminal, a mobile computer, a laptop computer, a netbook computer, a tablet computer, a smart phone, and the like.
- the various user devices described herein may be client devices.
- Network infrastructure such as routers, firewalls, switches, access points and the like, are not client devices as the term “client device” is defined herein.
- Cloud computing is a model of service delivery for enabling convenient, on-demand network access to a shared pool of configurable computing resources (e.g., networks, network bandwidth, servers, processing, memory, storage, applications, virtual machines, and services) that can be rapidly provisioned and released with minimal management effort or interaction with a provider of the service.
- This cloud model may include at least five characteristics, at least three service models, and at least four deployment models.
- On-demand self-service a cloud consumer can unilaterally provision computing capabilities, such as server time and network storage, as needed automatically without requiring human interaction with the service's provider.
- Resource pooling the provider's computing resources are pooled to serve multiple consumers using a multi-tenant model, with different physical and virtual resources dynamically assigned and reassigned according to demand. There is a sense of location independence in that the consumer generally has no control or knowledge over the exact location of the provided resources but may be able to specify location at a higher level of abstraction (e.g., country, state, or datacenter).
- Rapid elasticity capabilities can be rapidly and elastically provisioned, in some cases automatically, to quickly scale out and rapidly released to quickly scale in. To the consumer, the capabilities available for provisioning often appear to be unlimited and can be purchased in any quantity at any time.
- Measured service cloud systems automatically control and optimize resource use by leveraging a metering capability at some level of abstraction appropriate to the type of service (e.g., storage, processing, bandwidth, and active user accounts). Resource usage can be monitored, controlled, and reported providing transparency for both the provider and consumer of the utilized service.
- level of abstraction appropriate to the type of service (e.g., storage, processing, bandwidth, and active user accounts).
- SaaS Software as a Service: the capability provided to the consumer is to use the provider's applications running on a cloud infrastructure.
- the applications are accessible from various client devices through a thin client interface such as a web browser (e.g., web-based e-mail).
- a web browser e.g., web-based e-mail
- the consumer does not manage or control the underlying cloud infrastructure including network, servers, operating systems, storage, or even individual application capabilities, with the possible exception of limited user-specific application configuration settings.
- PaaS Platform as a Service
- the consumer does not manage or control the underlying cloud infrastructure including networks, servers, operating systems, or storage, but has control over the deployed applications and possibly application hosting environment configurations.
- IaaS Infrastructure as a Service
- the consumer does not manage or control the underlying cloud infrastructure but has control over operating systems, storage, deployed applications, and possibly limited control of select networking components (e.g., host firewalls).
- Private cloud the cloud infrastructure is operated solely for an organization. It may be managed by the organization or a third party and may exist on-premises or off-premises.
- Public cloud the cloud infrastructure is made available to the general public or a large industry group and is owned by an organization selling cloud services.
- Hybrid cloud the cloud infrastructure is a composition of two or more clouds (private, community, or public) that remain unique entities but are bound together by standardized or proprietary technology that enables data and application portability (e.g., cloud bursting for load-balancing between clouds).
- a cloud computing environment is service oriented with a focus on statelessness, low coupling, modularity, and semantic interoperability.
- An infrastructure comprising a network of interconnected nodes.
- cloud computing environment 800 includes one or more cloud computing nodes 810 with which local computing devices used by cloud consumers, such as, for example, personal digital assistant (PDA) or cellular telephone 840 a , desktop computer 840 b , laptop computer 840 c , and/or automobile computer system 840 n may communicate.
- Computing nodes 810 may communicate with one another. They may be grouped (not shown) physically or virtually, in one or more networks, such as Private, Community, Public, or Hybrid clouds as described hereinabove, or a combination thereof.
- cloud computing environment 800 to offer infrastructure, platforms and/or software as services for which a cloud consumer does not need to maintain resources on a local computing device. It is understood that the types of computing devices 840 a - n shown in FIG. 8 are intended to be illustrative only and that computing nodes 810 and cloud computing environment 800 can communicate with any type of computerized device over any type of network and/or network addressable connection (e.g., using a web browser).
- FIG. 9 a set of functional abstraction layers provided by cloud computing environment 800 ( FIG. 8 ) is shown. It should be understood in advance that the components, layers, and functions shown in FIG. 9 are intended to be illustrative only and embodiments of the invention are not limited thereto. As depicted, the following layers and corresponding functions are provided:
- Hardware and software layer 960 includes hardware and software components.
- hardware components include mainframes 961 ; RISC (Reduced Instruction Set Computer) architecture-based servers 962 ; servers 963 ; blade servers 964 ; storage devices 965 ; and networks and networking components 966 .
- software components include network application server software 967 and database software 968 .
- Virtualization layer 970 provides an abstraction layer from which the following examples of virtual entities may be provided: virtual servers 971 ; virtual storage 972 ; virtual networks 973 , including virtual private networks; virtual applications and operating systems 974 ; and virtual clients 975 .
- management layer 980 may provide the functions described below.
- Resource provisioning 981 provides dynamic procurement of computing resources and other resources that are utilized to perform tasks within the cloud computing environment.
- Metering and Pricing 982 provide cost tracking as resources are utilized within the cloud computing environment, and billing or invoicing for consumption of these resources. In one example, these resources may include application software licenses.
- Security provides identity verification for cloud consumers and tasks, as well as protection for data and other resources.
- User portal 983 provides access to the cloud computing environment for consumers and system administrators.
- Service level management 984 provides cloud computing resource allocation and management such that required service levels are met.
- Service Level Agreement (SLA) planning and fulfillment 985 provide pre-arrangement for, and procurement of, cloud computing resources for which a future requirement is anticipated in accordance with an SLA.
- SLA Service Level Agreement
- Workloads layer 990 provides examples of functionality for which the cloud computing environment may be utilized. Examples of workloads and functions which may be provided from this layer include: mapping and navigation 991 ; software development and lifecycle management 992 ; virtual classroom education delivery 993 ; data analytics processing 994 ; transaction processing 995 ; and ordinal mapping DNN system 996 .
- another means at least a second or more.
- each of the expressions “at least one of A, B and C,” “at least one of A, B, or C,” “one or more of A, B, and C,” “one or more of A, B, or C,” and “A, B, and/or C” means A alone, B alone, C alone, A and B together, A and C together, B and C together, or A, B and C together.
- if means “in response to” or “responsive to,” depending upon the context.
- the phrase “if it is determined” may be construed to mean “in response to determining” or “responsive to determining” depending on the context.
- the phrase “if [a stated condition or event] is detected” may be construed to mean “upon detecting [the stated condition or event]” or “in response to detecting [the stated condition or event]” or “responsive to detecting [the stated condition or event]” depending on the context.
- one embodiment means that a particular feature, structure, or characteristic described in connection with the embodiment is included in at least one embodiment described within this disclosure.
- appearances of the aforementioned phrases and/or similar language throughout this disclosure may, but do not necessarily, all refer to the same embodiment.
- the phrases “in response to” and “responsive to” mean responding or reacting readily to an action or event. Thus, if a second action is performed “in response to” or “responsive to” a first action, there is a causal relationship between an occurrence of the first action and an occurrence of the second action. The phrases “in response to” and “responsive to” indicate the causal relationship.
- real-time means a level of processing responsiveness that a user or system senses as sufficiently immediate for a particular process or determination to be made, or that enables the processor to keep up with some external process.
- substantially means that the recited characteristic, parameter, or value need not be achieved exactly, but that deviations or variations, including for example, tolerances, measurement error, measurement accuracy limitations, and other factors known to those of skill in the art, may occur in amounts that do not preclude the effect the characteristic was intended to provide.
- the present invention may be a system, a method, and/or a computer program product at any possible technical detail level of integration
- the computer program product may include a computer readable storage medium (or media) having computer readable program instructions thereon for causing a processor to carry out aspects of the present invention
- the computer readable storage medium can be a tangible device that can retain and store instructions for use by an instruction execution device.
- the computer readable storage medium may be, for example, but is not limited to, an electronic storage device, a magnetic storage device, an optical storage device, an electromagnetic storage device, a semiconductor storage device, or any suitable combination of the foregoing.
- a non-exhaustive list of more specific examples of the computer readable storage medium includes the following: a portable computer diskette, a hard disk, a random access memory (RAM), a read-only memory (ROM), an erasable programmable read-only memory (EPROM or Flash memory), a static random access memory (SRAM), a portable compact disc read-only memory (CD-ROM), a digital versatile disk (DVD), a memory stick, a floppy disk, a mechanically encoded device such as punch-cards or raised structures in a groove having instructions recorded thereon, and any suitable combination of the foregoing.
- RAM random access memory
- ROM read-only memory
- EPROM or Flash memory erasable programmable read-only memory
- SRAM static random access memory
- CD-ROM compact disc read-only memory
- DVD digital versatile disk
- memory stick a floppy disk
- a mechanically encoded device such as punch-cards or raised structures in a groove having instructions recorded thereon
- a computer readable storage medium is not to be construed as being transitory signals per se, such as radio waves or other freely propagating electromagnetic waves, electromagnetic waves propagating through a waveguide or other transmission media (e.g., light pulses passing through a fiber-optic cable), or electrical signals transmitted through a wire.
- Computer readable program instructions described herein can be downloaded to respective computing/processing devices from a computer readable storage medium or to an external computer or external storage device via a network, for example, the Internet, a local area network, a wide area network and/or a wireless network.
- the network may comprise copper transmission cables, optical transmission fibers, wireless transmission, routers, firewalls, switches, gateway computers and/or edge servers.
- a network adapter card or network interface in each computing/processing device receives computer readable program instructions from the network and forwards the computer readable program instructions for storage in a computer readable storage medium within the respective computing/processing device.
- Computer readable program instructions for carrying out operations of the present invention may be assembler instructions, instruction-set-architecture (ISA) instructions, machine instructions, machine dependent instructions, microcode, firmware instructions, state-setting data, configuration data for integrated circuitry, or either source code or object code written in any combination of one or more programming languages, including an object oriented programming language such as Smalltalk, C++, or the like, and procedural programming languages, such as the “C” programming language or similar programming languages.
- the computer readable program instructions may execute entirely on the user's computer, partly on the user's computer, as a stand-alone software package, partly on the user's computer and partly on a remote computer or entirely on the remote computer or server.
- the remote computer may be connected to the user's computer through any type of network, including a LAN or a WAN, or the connection may be made to an external computer (for example, through the Internet using an Internet Service Provider).
- electronic circuitry including, for example, programmable logic circuitry, field-programmable gate arrays (FPGA), or programmable logic arrays (PLA) may execute the computer readable program instructions by utilizing state information of the computer readable program instructions to personalize the electronic circuitry, in order to perform aspects of the present invention.
- These computer readable program instructions may be provided to a processor of a computer, or other programmable data processing apparatus to produce a machine, such that the instructions, which execute via the processor of the computer or other programmable data processing apparatus, create means for implementing the functions/acts specified in the flowchart and/or block diagram block or blocks.
- These computer readable program instructions may also be stored in a computer readable storage medium that can direct a computer, a programmable data processing apparatus, and/or other devices to function in a particular manner, such that the computer readable storage medium having instructions stored therein comprises an article of manufacture including instructions which implement aspects of the function/act specified in the flowchart and/or block diagram block or blocks.
- the computer readable program instructions may also be loaded onto a computer, other programmable data processing apparatus, or other device to cause a series of operational steps to be performed on the computer, other programmable apparatus or other device to produce a computer implemented process, such that the instructions which execute on the computer, other programmable apparatus, or other device implement the functions/acts specified in the flowchart and/or block diagram block or blocks.
- each block in the flowchart or block diagrams may represent a module, segment, or portion of instructions, which comprises one or more executable instructions for implementing the specified logical function(s).
- the functions noted in the blocks may occur out of the order noted in the Figures.
- two blocks shown in succession may, in fact, be accomplished as one step, executed concurrently, substantially concurrently, in a partially or wholly temporally overlapping manner, or the blocks may sometimes be executed in the reverse order, depending upon the functionality involved.
Landscapes
- Engineering & Computer Science (AREA)
- Theoretical Computer Science (AREA)
- Data Mining & Analysis (AREA)
- Physics & Mathematics (AREA)
- General Engineering & Computer Science (AREA)
- Life Sciences & Earth Sciences (AREA)
- Artificial Intelligence (AREA)
- General Physics & Mathematics (AREA)
- Evolutionary Computation (AREA)
- Computer Vision & Pattern Recognition (AREA)
- Bioinformatics & Computational Biology (AREA)
- Evolutionary Biology (AREA)
- Bioinformatics & Cheminformatics (AREA)
- Health & Medical Sciences (AREA)
- Biomedical Technology (AREA)
- Biophysics (AREA)
- Computational Linguistics (AREA)
- General Health & Medical Sciences (AREA)
- Molecular Biology (AREA)
- Computing Systems (AREA)
- Mathematical Physics (AREA)
- Software Systems (AREA)
- Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
Abstract
Description
Claims (20)
Priority Applications (1)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| US17/509,840 US12443844B2 (en) | 2021-10-25 | 2021-10-25 | Neural network trained using ordinal loss function |
Applications Claiming Priority (1)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| US17/509,840 US12443844B2 (en) | 2021-10-25 | 2021-10-25 | Neural network trained using ordinal loss function |
Publications (2)
| Publication Number | Publication Date |
|---|---|
| US20230132127A1 US20230132127A1 (en) | 2023-04-27 |
| US12443844B2 true US12443844B2 (en) | 2025-10-14 |
Family
ID=86055932
Family Applications (1)
| Application Number | Title | Priority Date | Filing Date |
|---|---|---|---|
| US17/509,840 Active 2044-07-12 US12443844B2 (en) | 2021-10-25 | 2021-10-25 | Neural network trained using ordinal loss function |
Country Status (1)
| Country | Link |
|---|---|
| US (1) | US12443844B2 (en) |
Families Citing this family (1)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US12288562B2 (en) * | 2021-12-27 | 2025-04-29 | Nice Ltd. | System and method for spoofing detection |
Citations (17)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US6772149B1 (en) | 1999-09-23 | 2004-08-03 | Lexis-Nexis Group | System and method for identifying facts and legal discussion in court case law documents |
| US20170200205A1 (en) | 2016-01-11 | 2017-07-13 | Medallia, Inc. | Method and system for analyzing user reviews |
| CN109710574A (en) | 2018-12-25 | 2019-05-03 | 东软集团股份有限公司 | A kind of method and apparatus for extracting key message from document |
| US20200065656A1 (en) * | 2016-11-15 | 2020-02-27 | Google Llc | Training neural networks using a clustering loss |
| US20200125803A1 (en) | 2018-02-14 | 2020-04-23 | Capital One Services, Llc | Utilizing machine learning models to identify insights in a document |
| US20200265327A1 (en) | 2017-10-27 | 2020-08-20 | Google Llc | Selecting answer spans from electronic documents using neural networks |
| US10789363B1 (en) | 2018-09-28 | 2020-09-29 | NortonLifeLock Inc. | Identifying and protecting against computer security threats while preserving privacy of individual client devices using condensed local differential privacy (CLDP) |
| US20200342214A1 (en) * | 2018-06-05 | 2020-10-29 | Tencent Technology (Shenzhen) Company Limited | Face recognition method and apparatus, classification model training method and apparatus, storage medium and computer device |
| US10832171B2 (en) | 2017-09-29 | 2020-11-10 | Oracle International Corporation | System and method for data visualization using machine learning and automatic insight of outliers associated with a set of data |
| US20200356952A1 (en) | 2016-11-11 | 2020-11-12 | Iodine Software, Llc. | High fidelity clinical documentation improvement (cdi) smart scoring systems and methods |
| US20210024094A1 (en) * | 2019-07-22 | 2021-01-28 | Perceptive Automata, Inc. | Filtering user responses for generating training data for machine learning based models for navigation of autonomous vehicles |
| US20210034980A1 (en) * | 2016-03-18 | 2021-02-04 | Amazon Technologies, Inc. | Real-time visualization of machine learning models |
| US20210103814A1 (en) * | 2019-10-06 | 2021-04-08 | Massachusetts Institute Of Technology | Information Robust Dirichlet Networks for Predictive Uncertainty Estimation |
| US20210311996A1 (en) * | 2020-04-03 | 2021-10-07 | International Business Machines Corporation | Providing causality augmented information responses in a computing environment |
| US20230040110A1 (en) * | 2021-07-26 | 2023-02-09 | Booz Allen Hamilton Inc. | Continuously generalized ordinal regression |
| US11599927B1 (en) * | 2018-01-17 | 2023-03-07 | Amazon Technologies, Inc. | Artificial intelligence system using deep neural networks for pairwise character-level text analysis and recommendations |
| US20230122392A1 (en) * | 2020-03-30 | 2023-04-20 | Verily Life Sciences Llc | Artificial Intelligence-Based Assistant For Concurrent Review Of Needle Core Prostate Biopsies |
-
2021
- 2021-10-25 US US17/509,840 patent/US12443844B2/en active Active
Patent Citations (17)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US6772149B1 (en) | 1999-09-23 | 2004-08-03 | Lexis-Nexis Group | System and method for identifying facts and legal discussion in court case law documents |
| US20170200205A1 (en) | 2016-01-11 | 2017-07-13 | Medallia, Inc. | Method and system for analyzing user reviews |
| US20210034980A1 (en) * | 2016-03-18 | 2021-02-04 | Amazon Technologies, Inc. | Real-time visualization of machine learning models |
| US20200356952A1 (en) | 2016-11-11 | 2020-11-12 | Iodine Software, Llc. | High fidelity clinical documentation improvement (cdi) smart scoring systems and methods |
| US20200065656A1 (en) * | 2016-11-15 | 2020-02-27 | Google Llc | Training neural networks using a clustering loss |
| US10832171B2 (en) | 2017-09-29 | 2020-11-10 | Oracle International Corporation | System and method for data visualization using machine learning and automatic insight of outliers associated with a set of data |
| US20200265327A1 (en) | 2017-10-27 | 2020-08-20 | Google Llc | Selecting answer spans from electronic documents using neural networks |
| US11599927B1 (en) * | 2018-01-17 | 2023-03-07 | Amazon Technologies, Inc. | Artificial intelligence system using deep neural networks for pairwise character-level text analysis and recommendations |
| US20200125803A1 (en) | 2018-02-14 | 2020-04-23 | Capital One Services, Llc | Utilizing machine learning models to identify insights in a document |
| US20200342214A1 (en) * | 2018-06-05 | 2020-10-29 | Tencent Technology (Shenzhen) Company Limited | Face recognition method and apparatus, classification model training method and apparatus, storage medium and computer device |
| US10789363B1 (en) | 2018-09-28 | 2020-09-29 | NortonLifeLock Inc. | Identifying and protecting against computer security threats while preserving privacy of individual client devices using condensed local differential privacy (CLDP) |
| CN109710574A (en) | 2018-12-25 | 2019-05-03 | 东软集团股份有限公司 | A kind of method and apparatus for extracting key message from document |
| US20210024094A1 (en) * | 2019-07-22 | 2021-01-28 | Perceptive Automata, Inc. | Filtering user responses for generating training data for machine learning based models for navigation of autonomous vehicles |
| US20210103814A1 (en) * | 2019-10-06 | 2021-04-08 | Massachusetts Institute Of Technology | Information Robust Dirichlet Networks for Predictive Uncertainty Estimation |
| US20230122392A1 (en) * | 2020-03-30 | 2023-04-20 | Verily Life Sciences Llc | Artificial Intelligence-Based Assistant For Concurrent Review Of Needle Core Prostate Biopsies |
| US20210311996A1 (en) * | 2020-04-03 | 2021-10-07 | International Business Machines Corporation | Providing causality augmented information responses in a computing environment |
| US20230040110A1 (en) * | 2021-07-26 | 2023-02-09 | Booz Allen Hamilton Inc. | Continuously generalized ordinal regression |
Non-Patent Citations (4)
| Title |
|---|
| Mell, P. et al., The NIST Definition of Cloud Computing, National Institute of Standards and Technology, U.S. Dept. of Commerce, Special Publication 800-145, Sep. 2011, 7 pg. |
| Nayak, P., "Understanding Searches Better Than Ever Before," [online] Google, The Keyword, Oct. 25, 2019, retrieved from the Internet: <https://blog.google/products/search/search-language-understanding-bert/>, 2 pg. |
| Vanderheyden, B. et al., "Ordinal Hyperplane Loss," In Proceedings of 2018 IEEE International Conference on Big Data 2018. |
| Vanderheyden, B., "Ordinal HyperPlane Loss," Analytics and Data Science Dissertations, Kennesaw State University, Oct. 1, 2019, 164 pg. |
Also Published As
| Publication number | Publication date |
|---|---|
| US20230132127A1 (en) | 2023-04-27 |
Similar Documents
| Publication | Publication Date | Title |
|---|---|---|
| US11631029B2 (en) | Generating combined feature embedding for minority class upsampling in training machine learning models with imbalanced samples | |
| US11853877B2 (en) | Training transfer-focused models for deep learning | |
| US11048718B2 (en) | Methods and systems for feature engineering | |
| US10762439B2 (en) | Event clustering and classification with document embedding | |
| US11941496B2 (en) | Providing predictions based on a prediction accuracy model using machine learning | |
| US20220122000A1 (en) | Ensemble machine learning model | |
| US11720565B2 (en) | Automated query predicate selectivity prediction using machine learning models | |
| US11048564B2 (en) | API evolution and adaptation based on cognitive selection and unsupervised feature learning | |
| US20210012156A1 (en) | Explanation guided learning | |
| US12572592B2 (en) | Automated graph embedding recommendations based on extracted graph features | |
| US11676075B2 (en) | Label reduction in maintaining test sets | |
| US11334935B2 (en) | Method, system, and manufacture for light hypergraph based recommendation | |
| US11775894B2 (en) | Intelligent routing framework | |
| US11816127B2 (en) | Quality assessment of extracted features from high-dimensional machine learning datasets | |
| US11803374B2 (en) | Monolithic computer application refactoring | |
| US12536475B2 (en) | Feature segmentation-based ensemble learning for classification and regression | |
| US20160004664A1 (en) | Binary tensor factorization | |
| US12293393B2 (en) | Predictive service orchestration using threat modeling analytics | |
| US11109085B2 (en) | Utilizing one hash permutation and populated-value-slot-based densification for generating audience segment trait recommendations | |
| US20220343217A1 (en) | Intelligent support framework | |
| US11741128B2 (en) | Model-free high confidence data clustering | |
| US12443844B2 (en) | Neural network trained using ordinal loss function | |
| US12327256B2 (en) | Technical support service location recommendation using machine learning | |
| EP4654017A1 (en) | Composite risk score for cloud software deployments | |
| US20230206114A1 (en) | Fair selective classification via a variational mutual information upper bound for imposing sufficiency |
Legal Events
| Date | Code | Title | Description |
|---|---|---|---|
| AS | Assignment |
Owner name: INTERNATIONAL BUSINESS MACHINES CORPORATION, NEW YORK Free format text: ASSIGNMENT OF ASSIGNORS INTEREST;ASSIGNORS:VANDERHEYDEN, ROBERT;CHAMBERLIN, WILLIAM;HANDY BOSMA, JOHN;SIGNING DATES FROM 20211019 TO 20211025;REEL/FRAME:057903/0131 |
|
| FEPP | Fee payment procedure |
Free format text: ENTITY STATUS SET TO UNDISCOUNTED (ORIGINAL EVENT CODE: BIG.); ENTITY STATUS OF PATENT OWNER: LARGE ENTITY |
|
| STPP | Information on status: patent application and granting procedure in general |
Free format text: DOCKETED NEW CASE - READY FOR EXAMINATION |
|
| STPP | Information on status: patent application and granting procedure in general |
Free format text: NON FINAL ACTION MAILED |
|
| STPP | Information on status: patent application and granting procedure in general |
Free format text: RESPONSE TO NON-FINAL OFFICE ACTION ENTERED AND FORWARDED TO EXAMINER |
|
| STPP | Information on status: patent application and granting procedure in general |
Free format text: NOTICE OF ALLOWANCE MAILED -- APPLICATION RECEIVED IN OFFICE OF PUBLICATIONS |
|
| STPP | Information on status: patent application and granting procedure in general |
Free format text: AWAITING TC RESP., ISSUE FEE NOT PAID |
|
| STPP | Information on status: patent application and granting procedure in general |
Free format text: NOTICE OF ALLOWANCE MAILED -- APPLICATION RECEIVED IN OFFICE OF PUBLICATIONS |
|
| STPP | Information on status: patent application and granting procedure in general |
Free format text: PUBLICATIONS -- ISSUE FEE PAYMENT RECEIVED |
|
| STPP | Information on status: patent application and granting procedure in general |
Free format text: PUBLICATIONS -- ISSUE FEE PAYMENT VERIFIED |
|
| STCF | Information on status: patent grant |
Free format text: PATENTED CASE |