Latent Semantic Index Or Analysis (lsi Or Lsa) Patents (Class 707/739)
  • Patent number: 10642881
    Abstract: A method of emotive autography includes calculating a plurality of classifiers associated with an individual user. Each of the classifiers indicates a preference of the user for an associated type of multimedia content. Multimedia data is received including video data, audio data and/or image data. The multimedia data is divided into semantically similar segments. A respective preference score is assigned to each of the semantically similar segments by use of the classifiers. The semantically similar segments are arranged in a sequential order dependent upon the preference scores. An emotive autograph is presented based on the semantically similar segments arranged in the sequential order.
    Type: Grant
    Filed: June 30, 2016
    Date of Patent: May 5, 2020
    Assignee: Intel Corporation
    Inventors: Sudhir K. Singh, Abhishek Narain, Jose M. Rodriguez, Prasad Modali
  • Patent number: 10642891
    Abstract: Relational graphs may be used to extract information. Similarities between the relational graphs and the items they represent may be determined. For example, when applied to video searching, relational graphs may be obtained from searching videos to extract objects, events and/or relations therebetween. Each relational graph may comprise a plurality of nodes and edges, wherein at least some of the detected objects and events are represented by each node, and wherein each edge and represents a relationship between two nodes. Subgraphs may be extracted from each relational graph and dimension reduction may be performed on the subgraphs to obtain a reduced variable set which may then be used to perform searches, such as similarity analyses of videos.
    Type: Grant
    Filed: April 14, 2014
    Date of Patent: May 5, 2020
    Inventors: Tae Eun Choe, Hongli Deng, Mun Wai Lee, Feng Guo
  • Patent number: 10642875
    Abstract: A processor-implemented method generates a plurality of smoothed transition vectors from a plurality of training data. The method receives a plurality of text and a query. The method converts the plurality of received text to a word embedding space. The method converts the received query to a set of coordinates from the word embedding space and a set of the plurality of determined smoothed transition vectors. The method determines a plurality of candidate answers based on adding the set of the smoothed transition vectors to the set of coordinates in the word embedding space. The method determines an answer to the received query, based on applying a filter, wherein the filter is selected from a group consisting of a type filtering, a conflicting type filtering, and an equivalence filtering, and the method displays the determined answer.
    Type: Grant
    Filed: April 28, 2017
    Date of Patent: May 5, 2020
    Assignee: International Business Machines Corporation
    Inventors: Brendan Bull, Paul Lewis Felt
  • Patent number: 10642845
    Abstract: Systems and methods are disclosed for improving search results returned to a user from one or more domains, utilizing query features learned locally on the user's device. One or more domains can inform a computing device of one or more features related to a search query upon which to the computing device can apply local learning. A local search system can include a local database, a local search history and feedback history database, and a local learning system to identify features about query terms. The features can be learned from the user's interaction with both local search results and remote search results, without sending the user interaction information or other user identification information to a remote search engine. A locally learned feature can be used to extend a query, bias a query term, or filter query results.
    Type: Grant
    Filed: September 30, 2014
    Date of Patent: May 5, 2020
    Assignee: Apple Inc.
    Inventors: John M. Hornkvist, Gaurav Kapoor
  • Patent number: 10621648
    Abstract: In some embodiments, apparatuses and methods are provided herein useful to recommending products to a customer based on derived subjective attributes for the products. In some embodiments, a system for recommending products to a customer comprises a customer prompt module configured to receive an indication of a category, determine, based on derived subjective attributes of the customer, an initial set of products, select a prompt based on an estimated number of products in the initial set of products that can be eliminated, incorporate, with a script, the prompt, present the prompt, receive the response to the prompt, a scoring module configured to calculate a score for the response to the prompt, a product recommendation module configured to eliminate at least a portion of the initial set of products, determine that a threshold has been reached, and a presentation module configured to generate a GUI including the remaining products.
    Type: Grant
    Filed: May 30, 2019
    Date of Patent: April 14, 2020
    Assignee: Walmart Apollo, LLC
    Inventor: Randall E. Davis
  • Patent number: 10607042
    Abstract: A computing server configured to process data of a domain from unstructured data sources to generate natural language phrases describing relationships between entities identified from the unstructured data. The computing server may receive master data schema and domain knowledge ontology of a domain including relationship definitions in the domain. The computing server may identify targeted types of named entities of the domain from the master data schema according to the relationship definitions in the domain knowledge ontology. The computing server may extract a plurality of named entities from unstructured data of the domain. The computing server may generate one or more sequences of named entities and assign entity labels to the named entities. The computing server may, based on the entity labels, generate natural language phrases describing relationships of sets of named entities.
    Type: Grant
    Filed: October 31, 2019
    Date of Patent: March 31, 2020
    Assignee: Live Objects, Inc.
    Inventors: Sudipto Shankar Dasgupta, Kamesh Raghavendra
  • Patent number: 10592541
    Abstract: Technologies for dynamic automated content discovery include a computing device that determines a contextual part of a document selected by a user and extracts one or more key terms from the contextual part of the document using an automated key phrase extraction algorithm. The computing device may perform a syntactic algorithm, named entity recognition, or the TextRank algorithm. The computing device may calculate a vagueness score for terms of the document by querying a semantic database and select the key terms based on the corresponding vagueness scores. The computing device performs a content search based on the key terms to generate one or more search results and presents the search results to the user. The computing device may associate each of the search results with the corresponding key term of the contextual part of the document, for example by visually highlighting the key term. Other embodiments are described and claimed.
    Type: Grant
    Filed: May 29, 2015
    Date of Patent: March 17, 2020
    Assignee: Intel Corporation
    Inventors: Elliot Smith, Max Waterman, Plamena Manolova, Karolina Kret, Mikael Metthey, Alok Barsode
  • Patent number: 10558985
    Abstract: There are disclosed marketing analytics apparatus and processes. There is a data store of interactions between a plurality of customers and one or more vendors over a plurality of channels. Pathing and attribution of these interactions may be obtained as marketing analytics. Pathing may be obtained in part with a match programming statement which identifies all of the paths in the data store matching criteria specified in the match programming statement. Pathing may be obtained in part with a split programming statement which splits all of the journeys in the data store into paths. After pathing, an attribution pathing statement may be used to attribute conversions to other events.
    Type: Grant
    Filed: November 23, 2016
    Date of Patent: February 11, 2020
    Assignee: Impact Radius, Inc.
    Inventors: Paul Mazak, Kevin Gipe
  • Patent number: 10534808
    Abstract: A visual query such as a photograph, a screen shot, a scanned image, a video frame, or an image created by a content authoring application is submitted to a visual query search system. The search system processes the visual query by sending it to a plurality of parallel search systems, each implementing a distinct visual query search process. These parallel search systems may include but are not limited to optical character recognition (OCR), facial recognition, product recognition, bar code recognition, object-or-object-category recognition, named entity recognition, and color recognition. Then at least one search result is sent to the client system. In some embodiments, when the visual query is an image containing a text element and a non-text element, at least one search result includes an optical character recognition result for the text element and at least one image-match result for the non-text element.
    Type: Grant
    Filed: February 18, 2014
    Date of Patent: January 14, 2020
    Assignee: GOOGLE LLC
    Inventor: David Petrou
  • Patent number: 10521510
    Abstract: When retrieving specific text from a search target document, an information processing apparatus receives text, generates a semantic structure indicating a meaning of a word that is included in the received text, by subjecting the received text to semantic analysis, identifies a word that is associated with the generated semantic structure, by referring to a synonym dictionary that stores a word and semantic structure indicating a meaning of the word in an associated manner, determines whether the identified word is included in the search target document, and outputs information according to a determination result.
    Type: Grant
    Filed: August 31, 2017
    Date of Patent: December 31, 2019
    Inventors: Nobuko Takase, Kazuo Mineno, Naohiro Itou
  • Patent number: 10521670
    Abstract: A system includes a plurality of summarization engines, each summarization engine to receive video content, via a processing system, and to provide a summary of the video content, thereby providing a plurality of summaries of the video content. The system includes a plurality of meta-algorithmic patterns, each meta-algorithmic pattern to be applied to at least two of the summaries to provide, via the processing system, a meta-summary of the video content using the at least two summaries, thereby providing a plurality of meta-summaries of the video content. The system includes an evaluator to evaluate the plurality of summaries and the plurality of meta-summaries and to determine similarity measures of the video content over each given class of a plurality of classes of video content, and to select a class of the plurality of classes based on the determined similarity measures.
    Type: Grant
    Filed: October 30, 2015
    Date of Patent: December 31, 2019
    Assignee: Hewlett-Packard Development Company, L.P.
    Inventors: Steven J Simske, Tong Zhang, Manav Das
  • Patent number: 10496650
    Abstract: Video segments related to an annotation term are identified from a target video. A video dataset and an image data set are searched using the annotation term to generate a video set and an image set. The video set and the image set are iteratively refined to generate a set of iconic images. A frame level model is generated using the set of iconic images and video segments related to the annotation term are identified from the target video by applying the frame level model to frames of the target video.
    Type: Grant
    Filed: February 25, 2015
    Date of Patent: December 3, 2019
    Assignee: Google LLC
    Inventors: Chen Sun, Sanketh Shetty
  • Patent number: 10482119
    Abstract: A method for assigning a topic to a collection of microblog posts may include, by an acquisition module, receiving from at least one messaging service server, a plurality of posts, wherein each of the plurality of posts comprise post content; by a generation module, analyzing the posts and extract, from at least one of the posts, a link with an address to an external document; and, by the acquisition module, accessing the external document that is associated with the address and fetch external content associated with the document. The method may also include by the generation module: analyzing the post content to identify at least one label for each post, for each post that includes a link, analyzing the external content to identify a topic, and using a topic modeling technique to generate a trained topic model comprising a plurality of topics and a plurality of associated words.
    Type: Grant
    Filed: April 14, 2016
    Date of Patent: November 19, 2019
    Assignee: Conduent Business Services, LLC
    Inventors: Saurabh Kataria, Arvind Agarwal
  • Patent number: 10474700
    Abstract: A document stream filtering system includes computing hardware which is operable to execute one or more software products recorded on machine-readable data storage media. The computing hardware is operable to receive and store one or more documents of a document stream in a search index, receive one or more reference documents, calculate global document frequencies for the one or more documents of the document stream, generate a set of relevant terms and corresponding weights, generate a query by classifying the relevant terms into first and second categories, retrieve one or more documents from the search index based on the query, and filter the one or more retrieved documents based on a cut-off score.
    Type: Grant
    Filed: February 9, 2015
    Date of Patent: November 12, 2019
    Assignee: NEKTOON AG
    Inventors: Alexander Sennhauser, Felix Hürlimann
  • Patent number: 10445163
    Abstract: Computer system drift can occur when a computer system or a cluster of computer systems deviates from ideal and/or desired behavior. In a server farm, for example, many different machines may be identically configured to work in conjunction with each other to provide an electronic service (serving web pages, processing electronic payment transactions, etc.). Over time, however, one or more of these systems may drift from previous behavior. Early drift detection can be important, especially in large enterprises, to avoiding costly downtime. Changes in a computer's configuration files, network connections, and/or executable processes can indicate ongoing drift, but collecting this information at scale can be difficult. By using certain hashing and min-Hash techniques, however, drift detection can be streamlined and accomplished for large scale operations. Velocity of drift may also be tracked using a decay function.
    Type: Grant
    Filed: September 28, 2017
    Date of Patent: October 15, 2019
    Assignee: PAYPAL, INC.
    Inventors: Omri Moshe Lahav, Raoul Christopher Johnson, David Tolpin
  • Patent number: 10423723
    Abstract: In accordance with a first exemplary embodiment, there is provided a method for extracting semantic topics from document sets in which opinions about an object are described using an apparatus capable of calculating a probability distribution. The method include (a) extracting word distributions about sentiment global topics and sentiment local topics; (b) extracting a global topic distribution, a local topic distribution and sentiment distributions about the global and local topics from the document sets; (c) performing statistical inference about each of the distributions extracted in the step (a) and the step (b); (d) extracting a global or local topic and a sentiment relevant to the global or local topic from the distributions of the inference performed in the step (c); and (e) extracting a word from the word distributions about sentiment global topics or sentiment local topics on the basis of the topic and sentiment extracted in the step (d).
    Type: Grant
    Filed: June 3, 2015
    Date of Patent: September 24, 2019
    Inventors: Sang Keun Lee, Md Hijbul Alam
  • Patent number: 10404816
    Abstract: Determining browsing activities is described. In one or more implementations, browsing history data, indicating navigation to websites using a web platform, is analyzed to determine a browsing activity, such as shopping, planning a trip, and so forth. The websites navigated to using the web platform as part of the browsing activity are then stored with the browsing activity to enable subsequent access to the websites. In one or more implementations, for each browsing activity, one or more suggested websites which are related to the browsing activity are determined and stored with the browsing activity to enable access to the suggested websites.
    Type: Grant
    Filed: December 5, 2014
    Date of Patent: September 3, 2019
    Assignee: Microsoft Technology Licensing, LLC
    Inventors: Paula J. Chuchro, Michael John Patten, Akriti Dokania
  • Patent number: 10380649
    Abstract: In accordance with an embodiment, described herein is a system and method for logistic matrix factorization of implicit feedback data, with application to media environments or streaming services. While users interact with an environment or service, for example a music streaming service, usage data reflecting implicit feedback can be collected in an observation matrix. A logistic function can be used to determine latent factors that indicate whether particular users are likely to prefer particular items. Exemplary use cases include providing personalized recommendations, such as personalized music recommendations, or generating playlists of popular artists.
    Type: Grant
    Filed: March 3, 2015
    Date of Patent: August 13, 2019
    Assignee: SPOTIFY AB
    Inventor: Christopher Johnson
  • Patent number: 10380187
    Abstract: A method, system, and recording medium for knowledge graph augmentation using data based on a statistical analysis of attributes in the data, including mapping classes, attributes, and instances of the classes of the data, indexing semantically similar input data elements based on the mapped data using at least one of a label-based analysis, a content-based analysis, and an attribute-based clustering, and ranking the semantically similar input data elements to create a ranked list.
    Type: Grant
    Filed: October 30, 2015
    Date of Patent: August 13, 2019
    Inventors: Oktie Hassanzadeh, Oliver Lehmberg, Mohammad Sadoghi Hamedani
  • Patent number: 10372714
    Abstract: A candidate document is received, for example, by a document filter. A determination is made based on the content of the candidate document, whether the candidate document is relevant to a document corpus. A determination is made based on the content of the candidate document, whether the candidate document is novel with respect to the document corpus. In response to determining that the candidate document is relevant to the document corpus and novel with respect to the document corpus, the candidate document is added to the document corpus to make at least a portion of the content of the candidate document available for a response to a search query.
    Type: Grant
    Filed: February 5, 2016
    Date of Patent: August 6, 2019
    Assignee: International Business Machines Corporation
    Inventors: Charles Evan Beller, William G Dubyak, Palani Sakthi, Kristen Maria Summers
  • Patent number: 10372745
    Abstract: A method, and associated computer system and computer program product, for computing a value between two concepts in a schema containing concepts which are linked to each other through associations. In a schema S of n concepts C1, C2 . . . Ci . . . Cn, the concepts are linked by associations, each association having a semantic distance set in a range between a minimum and a maximum indicating the concepts are completely similar or dissimilar respectively. An information value is determined between concepts from their semantic distance and informational distance. For dissimilar concepts, the informational distance is computed according to a closeness of the concepts. For similar concepts, the informational distance is computed according to a remoteness of the concepts. Both the first and second functions increase with a number of links between C1 and another concept. The number of links is a topological distance between C1 and the other concept.
    Type: Grant
    Filed: October 3, 2016
    Date of Patent: August 6, 2019
    Assignee: International Business Machines Corporation
    Inventor: Freddy Lorge
  • Patent number: 10372823
    Abstract: Described is a system for generating a semantic space based on the lexical relations between words. The system determines synonym and antonym relations between a set of words. A lexical graph is generated based on the synonym and antonym relations. Manifold embedding of the lexical graph is determined, and Laplacian coordinates of the manifold embedding are assigned as semantic features of the set of words. A quantitative representation of the set of words is generated using the semantic features.
    Type: Grant
    Filed: October 21, 2016
    Date of Patent: August 6, 2019
    Assignee: HRL Laboratories, LLC
    Inventors: Hankyu Moon, Rajan Bhattacharyya, James Benvenuto
  • Patent number: 10353938
    Abstract: Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for aggregating task data for multiple users. In one aspect, a method includes accessing action trail data that corresponds to a task and resources related to that task, wherein each task relates to one or more related topics and is defined by a sequence of user actions corresponding to the resources related to that task; clustering the action trails based on the action trail data such that each action trail cluster corresponds to a particular task and includes the action trails corresponding to that particular task; and for each action trail cluster, ranking the resources that correspond to the included action trails according to the topics of the particular task.
    Type: Grant
    Filed: February 27, 2013
    Date of Patent: July 16, 2019
    Assignee: Google LLC
    Inventors: Radhika Malpani, Elin R. Pedersen
  • Patent number: 10331676
    Abstract: Items of interest within digital information may be detected and associated with a label that provides context to the item of interest. The label may describe an item category of the item of interest. The knowledge base of item categories may be limited. Additional item categories may be learned by accessing sets of vocabulary that may relate to the known item categories.
    Type: Grant
    Filed: April 13, 2016
    Date of Patent: June 25, 2019
    Assignee: Disney Enterprises, Inc.
    Inventors: Yanwei Fu, Leonid Sigal
  • Patent number: 10282672
    Abstract: A processing device determines a plurality of visual concepts for visual data based on at least one of visual entities in the visual data or feature-level attributes in the visual data, wherein the visual entities are based on the feature-level attributes, and wherein each of the plurality of visual concepts comprises a subject visual entity related to an object visual entity by a predicate. The processing device further determines one or more visual semantics for the visual data based on the plurality of visual concepts, wherein the one or more visual semantics define relationships between the plurality of visual concepts.
    Type: Grant
    Filed: June 26, 2014
    Date of Patent: May 7, 2019
    Assignee: Amazon Technologies, Inc.
    Inventors: Pragyana K. Mishra, Danny Guan
  • Patent number: 10212572
    Abstract: The present invention extends to methods, systems, and computer program products for detecting and validating planned event information. A plurality of normalized signals is accessed. Planned event data across the plurality of normalized signals is checked for inconsistencies. Any inconsistencies are resolved in an automated fashion, for example, through reference to databases containing additional information. A planned event can be detected/validated from concurring and/or resolved planned event data. A validator can refer to an event history database and/or a planning system to validate a possible planned event as an actual planned event.
    Type: Grant
    Filed: September 5, 2018
    Date of Patent: February 19, 2019
    Assignee: Banjo, Inc.
    Inventors: Damien Patton, Joshua J. Newman, Tilmann Bruckhaus
  • Patent number: 10159106
    Abstract: An automatic wireless docking system includes a source device that includes a source device display screen, a display device, and a sink device that is coupled to the display device. The sink device determines a location of the source device and determines a motion of the source device. The sink device then identifies a wireless docking intent of the source device with the sink device based on the location of the source device and the motion of the source device. In response to identifying the wireless docking intent, the sink device establishes a current wireless docking session between the source device and the sink device.
    Type: Grant
    Filed: January 29, 2018
    Date of Patent: December 18, 2018
    Assignee: Dell Products L.P.
    Inventors: Joseph Paul Marquardt, Todd Farrell Basche
  • Patent number: 10142774
    Abstract: Systems, methods, and computer-readable storage media for invitational content geofencing. A system first sends, to a server location data associated with the system, the location data being calculated at the system. The system then receives a listing of places of interest within a geofence including a geographical perimeter for identifying places of interest in the listing, the geofence being based on the location data associated with the system. Next, the system selects a place of interest from the listing based on a location of the system. The system then presents a content item associated with the place of interest.
    Type: Grant
    Filed: July 3, 2017
    Date of Patent: November 27, 2018
    Assignee: Apple Inc.
    Inventors: Thomas Alsina, David T. Wilson, Kenley Sun, Sagar Joshi
  • Patent number: 10078697
    Abstract: Computer-implemented method of and system for searching an inverted index having a plurality of posting lists, comprising: Receiving a search query including a plurality of search terms to be searched. Multithreadedly searching a plurality of complementary sets of corresponding interspaced segments of each of the plurality of posting lists corresponding to the plurality of search terms, each set being searched via a separate thread to yield per-thread search results. Aggregating the per-thread search results to yield aggregated search results. Transmitting at least a portion of the aggregated search results.
    Type: Grant
    Filed: February 25, 2013
    Date of Patent: September 18, 2018
    Assignee: Yandex Europe AG
    Inventor: Petr Sergeevich Popov
  • Patent number: 10073890
    Abstract: A comparison engine configured to utilize combined semantic-probabilistic algorithms to differentiate and compare an input to obtain enumerated results of similarity (items that are similar to other patent-related references), differences (items that are different from other patent-related references), and uniquenesses (how the input text is distinct from other patent-related references).
    Type: Grant
    Filed: August 3, 2015
    Date of Patent: September 11, 2018
    Inventors: Mahmoud Azmi Khamis, Bruce Golden, Rami Ikhreishi
  • Patent number: 10049164
    Abstract: Provided is a multidimensional-range search apparatus (10) including: an acquisition unit (11) that acquires a target index key indicating arbitrary point on a space-filling curve; an extraction unit (12) that extracts prefix data capable of indicating a bit string of an index key included in an unsearched section on the space-filling curve on the basis of a bit string of the target index key; a determination unit (13) that determines overlapping of an inquiry section of a multidimensional-range search and a prefix section on the space-filling curve which is indicated by the prefix data; a specification unit (14) that specifies, as a search point, an index key indicating a minimum point or a maximum point in an overlap section in which the inquiry section overlaps the prefix section, which is closest to the target index key on the space-filling curve, and which is determined to overlap the inquiry section by the determination unit; and a search unit (15) that searches an index storage unit (16) for page infor
    Type: Grant
    Filed: June 2, 2014
    Date of Patent: August 14, 2018
    Assignee: NEC Corporation
    Inventor: Shoji Nishimura
  • Patent number: 10027774
    Abstract: A method of obtaining information on navigation behavior of users accessing web pages, includes obtaining information on web page sessions and correlating the information of at least two web page sessions for one user based on the obtained information on web page sessions. The method further includes extracting information on links from the correlated information of the at least two web page sessions, and inferring information on navigation behavior of the user based on the extracted information on links and the correlated information of the at least two web page sessions for one user.
    Type: Grant
    Filed: October 15, 2013
    Date of Patent: July 17, 2018
    Assignee: Telefonaktiebolaget LM Ericsson (publ)
    Inventors: Icaro L. J. Da Silva, Åsa Bertze, Jing Fu
  • Patent number: 10007690
    Abstract: A time series data stager that receives input data sets and outputs output data blocks for ingestion into a time series database, with the out data blocks being sent at timings according to a sliding window based on a predetermined time.
    Type: Grant
    Filed: September 26, 2014
    Date of Patent: June 26, 2018
    Assignee: International Business Machines Corporation
    Inventor: Ulrich A. Finkler
  • Patent number: 10007649
    Abstract: A content management system is disclosed. The system includes at least one server, non-transitory storage, documents, entity-specific section weights, and entity-specific review thresholds. The system further includes at least two client computer systems that enable a user to access a document for at least one of review or modification. The system will, in response to receipt of an indication that changes have been made to one or more sections of a document, A) determine a change value indicative of a quantity of changes made within each section, B) calculate an entity-specific provenance value by multiplying, on a section basis, the change value within each section by the assigned entity-specific weight value for each section, to produce an entity-specific section value for each section, and then summing the entity-specific section values; and C) when any entity-specific provenance value satisfies a review threshold value, to construct and send a review notification.
    Type: Grant
    Filed: September 29, 2017
    Date of Patent: June 26, 2018
    Inventors: Kenytt D. Avery, Edward L. Bader, Jean-Marc Costecalde, Chi M. Nguyen, Kevin N. Trinh
  • Patent number: 9911211
    Abstract: Provided is a process of adjusting a visualization of a graph in response to user interactions with the visualization, the process including: obtaining a graph; causing a visualization of the graph to be presented on one or more displays having a display area; receiving a request for a lens be applied to the visualization; selecting a first portion of the graph based on the first portion being presented within the region specified by the lens; and transforming the first portion of the graph.
    Type: Grant
    Filed: April 13, 2017
    Date of Patent: March 6, 2018
    Assignee: Quid, Inc.
    Inventors: Sashikanth Damaraju, Grant Titus
  • Patent number: 9785833
    Abstract: A method for efficiently grouping electronic documents that are likely textual near-duplicates includes processing first and second electronic documents to determine respective sets of character sequence counts. The processing may include, for each document, identifying a plurality of non-contiguous character sequences expressed within the document text, with each character sequence including at least one character from each of at least two different words in the text, and determining character sequence counts for each unique character sequence within the identified character sequences. The method also includes generating one or more similarity metrics, at least by comparing the sets of character sequence counts determined for the first and second electronic documents. The method may also include using the similarity metric(s) to calculate a similarity score, and assigning, based on the similarity score, the second electronic document to a same document group as the first electronic document.
    Type: Grant
    Filed: April 1, 2016
    Date of Patent: October 10, 2017
    Inventor: Robert Jenson Price
  • Patent number: 9779291
    Abstract: As visual recognition scales up to ever larger numbers of categories, maintaining high accuracy is increasingly difficult. Embodiment of the present invention include methods for optimizing accuracy-specificity trade-offs in large scale recognition where object categories form a semantic hierarchy consisting of many levels of abstraction.
    Type: Grant
    Filed: October 12, 2015
    Date of Patent: October 3, 2017
    Assignee: The Board of Trustees of the Leland Stanford Junior University
    Inventors: Fei-Fei Li, Jia Deng, Jonathan Krause, Alexander C. Berg
  • Patent number: 9769733
    Abstract: Methods, systems, and devices are described for wireless communication. A first method includes receiving, at a user equipment (UE), a first set of system information; determining, based at least in part on the first set of system information, that additional system information is available; transmitting a request for the additional system information; and receiving the additional system information at the UE. A second method includes transmitting, from a base station, a first set of system information; receiving a request for additional system information; and transmitting the additional system information based at least in part on the request.
    Type: Grant
    Filed: July 20, 2015
    Date of Patent: September 19, 2017
    Assignee: QUALCOMM Incorporated
    Inventors: Keiichi Kubota, Gavin Bernard Horn
  • Patent number: 9741058
    Abstract: Embodiments provide a computer-executable method, computer system and non-transitory computer-readable medium for programmatically analyzing a consumer review. The method includes programmatically accessing, via a network device, one or more consumer reviews for a commercial entity or a commercial object. The method also includes executing a consumer review processing engine to programmatically identify an attribute descriptor in the one or more consumer reviews, and executing the consumer review processing engine to programmatically generate a sentiment score associated with the one or more consumer reviews. The method further includes storing, on a non-transitory computer-readable storage device, the attribute descriptor and the sentiment score in association with the commercial entity or the commercial object.
    Type: Grant
    Filed: March 17, 2016
    Date of Patent: August 22, 2017
    Assignee: Groupon, Inc.
    Inventors: Gaston L'Huillier, Francisco Jose Larrain, Hernan Enrique Arroyo Garcia, Juzheng Li, Daniel Langdon, Jonathan Esterhazy, Srinivasa Raghavan Vedanarayanan, Shawn Jeffery, Feras Karablieh, Bhupesh Bansal, Dor Levi, Amit Koren
  • Patent number: 9734146
    Abstract: Systems, methods and computer-readable media are provided for facilitating patient health care by providing discovery, validation, and quality assurance of nomenclatural linkages between pairs of terms or combinations of terms in databases extant on multiple different health information systems that do not share a set of unified codesets, nomenclatures, or ontologies, or that may in part rely upon unstructured free-text narrative content instead of codes or standardized tags. Embodiments discover semantic structures existing naturally in documents and records, including relationships of synonymy and polysemy between terms arising from disparate processes, and maintained by different information systems. In some embodiments, this process is facilitated by applying Latent Semantic Analysis in concert with decision-tree induction and similarity metrics.
    Type: Grant
    Filed: September 4, 2014
    Date of Patent: August 15, 2017
    Assignee: Cerner Innovation, Inc.
    Inventors: Douglas S. McNair, John Christopher Murrish, Kanakasabha Kailasam
  • Patent number: 9734181
    Abstract: The present invention extends to methods, systems, and computer program products for understanding tables for search. Aspects of the invention include identifying a subject column for a table, detecting a column header using other tables, and detecting a column header using a knowledge base. Implementations can be utilized in a structured data search system (SDSS) that indexes structured information, such as, tables in a relational database or html tables extracted from web pages. The SDSS allows users to search over the structured information (tables) using different mechanisms including keyword search and data finding data.
    Type: Grant
    Filed: October 2, 2014
    Date of Patent: August 15, 2017
    Assignee: Microsoft Technology Licensing, LLC
    Inventors: Zhongyuan Wang, Kanstantsyn Zoryn, Zhimin Chen, Kaushik Chakrabarti, James P. Finnigan, Vivek R. Narasayya, Surajit Chaudhuri, Kris Ganjam
  • Patent number: 9575952
    Abstract: Topics are determined for short text messages using an unsupervised topic model. In a training corpus created from a number of short text messages, a vocabulary of words is identified, and for each word a distributed vector representation is obtained by processing windows of the corpus having a fixed length. The corpus is modeled as a Gaussian mixture model in which Gaussian components represent topics. To determine a topic of a sample short text message, a posterior distribution over the corpus topics is obtained using the Gaussian mixture model.
    Type: Grant
    Filed: October 21, 2014
    Date of Patent: February 21, 2017
    Inventor: Vivek Kumar Rangarajan Sridhar
  • Patent number: 9558165
    Abstract: A method and system for summarizing messages from a message stream is disclosed in which association analysis is applied to stream of short data messages comprising words in a spoken language, such as English. Clusters of words are identified that provide a summary of the several conversations (short data messages originating from different human sources) that are imbedded in the message stream. Each word cluster may represent a set of messages that are its instances. The word clusters may collectively constitute a summary of the entire message stream. The word clusters that have been extracted from message stream may also be grouped into topics. Also, an identity of one or more message originators may be listed based on their influence on the messages being analyzed. The short data messages may also be sorted based on a geographical location of one or more originators of messages.
    Type: Grant
    Filed: August 19, 2012
    Date of Patent: January 31, 2017
    Assignee: EMICEN CORP.
    Inventors: Roy Marsten, Russell Caldwell, Radhika Subramanian
  • Patent number: 9558265
    Abstract: Provided is a process including: obtaining a graph comprising nodes and edges, each of the edges having a value indicating an amount of similarity between objects corresponding to the two linked nodes; selecting a parameter for influencing the graph; assessing each of the nodes based on the selected influencing parameter, wherein assessing comprises, with respect to each adjacent node in the graph sharing an edge with the node: determining the value indicating the amount of similarity between the object corresponding to the node and the object corresponding to the adjacent node; and determining a score related to the edge shared with the node, the score determined based on the similarity-amount value and a value of the selected influencing parameter for the node, such that edges are removed, weakened, added, or strengthened; and preparing, based on the graph, instructions to display at least part of the graph.
    Type: Grant
    Filed: May 12, 2016
    Date of Patent: January 31, 2017
    Assignee: Quid, Inc.
    Inventors: Ruggero Altair Tacchi, Fabio Ciulla
  • Patent number: 9479839
    Abstract: Provided is a method and system for providing a representative phrase with respect to a real time popular keyword, which may determine programs including a popular keyword from broadcast information, and may generate a representative phrase with respect to the popular keyword using the determined programs, thereby providing the representative phrase by combining the generated representative phrase and the popular keyword.
    Type: Grant
    Filed: July 6, 2011
    Date of Patent: October 25, 2016
    Assignee: NHN Corporation
    Inventors: Jae Seung Shin, Young Sub Park, Jae Keol Choi, Won Sook Noh
  • Patent number: 9477751
    Abstract: A system and method for displaying relationships between concepts to provide classification suggestions via injection is provided. A reference set of concepts each associated with a classification code is designated. Clusters of uncoded concepts are designated. One or more of the uncoded concepts from at least one cluster are compared to the reference set. At least one of the concepts in the reference set that is similar to the one or more uncoded concepts is identified. The similar concepts are injected into the at least one cluster. Relationships between the uncoded concepts and the similar concepts in the at least one cluster are visually depicted as suggestions for classifying the uncoded concepts.
    Type: Grant
    Filed: July 27, 2010
    Date of Patent: October 25, 2016
    Assignee: FTI Consulting, Inc.
    Inventors: William C. Knight, Nicholas I. Nussbaum, John W. Conwell
  • Patent number: 9472115
    Abstract: Mechanisms for evaluating a link between information concept entities are provided. A set of evidential data specifying a plurality of information concept entities is received and a link between at least two information concept entities in the set of evidential data is generated. The set of evidential data is evaluated with regard to whether or not the set of evidential data supports or refutes the link. The evaluation of the set of evidential data comprises analyzing language of natural language statements in the set of evidential data to identify certainty terms within the natural language statements. A confidence value for the link is calculated based on results of the evaluation of the set of evidential data and a knowledge output is generated based on the link and the confidence value associated with the link.
    Type: Grant
    Filed: November 19, 2014
    Date of Patent: October 18, 2016
    Assignee: International Business Machines Corporation
    Inventors: Darryl M. Adderly, Corville O. Allen, Robert K. Tucker
  • Patent number: 9454602
    Abstract: A device may analyze text to identify a set of text portions of interest, and may analyze the text to identify a set of terms included in the set of text portions. The device may perform a similarity analysis to determine a similarity score. The similarity score may be determined between each term, included in the set of terms, and each text portion, included in the set of text portions, or the similarity score may be determined between each term and each other term included in the set of terms. The device may determine a set of dominant terms based on performing the similarity analysis. The set of dominant terms may include at least one term with a higher average degree of similarity than at least one other term. The device may provide information that identifies the set of dominant terms.
    Type: Grant
    Filed: August 29, 2013
    Date of Patent: September 27, 2016
    Assignee: Accenture Global Services Limited
    Inventors: Janardan Misra, Shubhashis Sengupta, Subhabrata Das
  • Patent number: 9430485
    Abstract: An information processor coupled to a storage apparatus that stores information, includes: a creation unit configured to create a snapshot of a file system that manages first information stored in the storage apparatus and to output the snapshot to the storage apparatus; a writing unit configured to write second information stored in cache memory onto the storage apparatus after the snapshot has been created; and a replication instruction unit configured to instruct the storage apparatus to create a replication of the first information stored in the storage apparatus after the second information has been written and the snapshot.
    Type: Grant
    Filed: November 6, 2013
    Date of Patent: August 30, 2016
    Inventors: Norihito Kato, Nobuhiro Takano, Norichika Imamura
  • Patent number: 9384233
    Abstract: Methods and systems for automatically synthesizing product information from multiple data sources into an on-line catalog are disclosed, and in particular, for automatically synthesizing the product information based on attribute-value pairs. Information for a product may be obtained, via entity extraction, feed ingestion, and other mechanisms, from a plurality of structured and unstructured data sources having different taxonomies and schemas. Product information may additionally or alternatively be obtained or derived based on popularity data. The product information may be cleansed, segmented and normalized. The product information may be clustered so closest products, attribute names and attribute values are associated. A representative value for an attribute name may be determined, and the on-line catalog may be updated so that entries are comprehensive, meaningful and useful to a catalog user.
    Type: Grant
    Filed: December 4, 2012
    Date of Patent: July 5, 2016
    Assignee: Microsoft Technology Licensing, LLC
    Inventors: Ariel Fuxman, Hoa Nguyen, Juliana Freire de Lima e Silva, Stelios Paparizos, Rakesh Agrawal, Zhimin Chen, Lawrence William Colagiovanni, Prakash Sikchi