Application Patents (Class 704/270)

Speech assisted network (Class 704/270.1)

Handicap aid (Class 704/271)

Novelty item (Class 704/272)

Security system (Class 704/273)

Warning/alarm system (Class 704/274)

Speech controlled system (Class 704/275)

Pattern display (Class 704/276)

Translation (Class 704/277)

Sound editing (Class 704/278)

Audio speech signal analysis for fraud detection

Patent number: 12380896

Abstract: A device, system and method for analyzing audio speech signals to detect fraudulent calls to a contact center comprising splitting an audio recording of a call in real-time into a foreground speech signal attributed to a main speaker and a background audio signal, extracting audio features from the foreground speech signal and background audio signal, inputting the extracted audio features into an ensemble model comprising multiple different machine learning models co-trained to cumulatively detect fraud, wherein the multiple different machine learning models include: a speaker audio model to detect audio speech anomalies, a speaker intent model to classify intent of the main speaker, a synthetic voice detection model to identify a non-human entity, and a prosody model to detect voice intonation of the main speaker. A prediction may be output, by the ensemble model, indicating whether the call is fraudulent.

Type: Grant

Filed: April 30, 2025

Date of Patent: August 5, 2025

Assignee: Morgan Stanley Services Group Inc.

Inventors: Sushil Ninawe, Jayati Tripathi, Cheryl Fernandes, Mehak Mehta, Aratrika Sarkar, Melissa Kagaju
Individual recognition using voice detection

Patent number: 12374340

Abstract: A method, a computer program product, and a computer system determine a name of an individual based on voice detection. The method includes determining voice characteristics of a voice of the individual based on an audio input received and recorded via an audio input device. The method includes comparing the voice characteristics of the voice to further voice characteristics of voice profiles. The method includes as a result of the voice and one of the voice profiles meeting a similarity threshold, determining a name associated with the one of the voice profile. The method includes providing the name to a user who is having a conversation with the individual.

Type: Grant

Filed: June 10, 2022

Date of Patent: July 29, 2025

Assignee: International Business Machines Corporation

Inventors: Ethan S. Headings, Feng-wei Chen, Neha S Deshpande, Madhavi Kolachala
Automatically generating feedback about content shared during a videoconference

Patent number: 12360727

Abstract: Feedback can be automatically generated for visual content presented during a videoconference. For example, a system can receive a user selection of a file containing visual content to be presented to members of a video conference, wherein the visual content includes pages that are to be sequentially presented to the members during the video conference. The system can then facilitate presentation of the visual content to the members of the video conference. The system can also obtain metadata associated with at least one page of the visual content presented during the video conference, determine feedback about the at least one page by analyzing the metadata, and provide the feedback to an editor of the visual content.

Type: Grant

Filed: May 21, 2024

Date of Patent: July 15, 2025

Assignee: Zoom Communications, Inc.

Inventors: Vi Dinh Chau, Graeme Lambourne Geddes
Adaptive simulation of celebrity and legacy avatars

Patent number: 12354187

Abstract: A device, computer-readable medium, and method for adaptive simulation of celebrity and legacy avatars in extended reality environments is disclosed. In one example, a method performed by a processing system including at least one processor includes acquiring preferences from a user with respect to a virtual interaction, matching the preferences to an individual for whom an avatar is available, rendering an extended reality environment in which the virtual interaction will occur, rendering the avatar in the extended reality environment, receiving an input from the user, extracting a meaning from the input, and controlling the avatar to present an output that is responsive to the meaning, wherein the output is generated dynamically using at least one of: an image of the individual, an audio of the individual, or biographical data of the individual.

Type: Grant

Filed: December 23, 2022

Date of Patent: July 8, 2025

Assignee: AT&T Intellectual Property I, L.P.

Inventors: Rashmi Palamadai, Brian Novack, Eric Zavesky
Information processing apparatus and information processing method for processing sound of real environment for displaying virtual experience

Patent number: 12315055

Abstract: An information processing apparatus according to an embodiment of the present technology includes a generation unit and an operation control unit. The generation unit generates environmental information of a real space around a user on the basis of a detection result of a microphone unit that detects a sound in the real space. The operation control unit controls an operation of a virtual object presented in a virtual space constructed in accordance with the real space on the basis of the environmental information.

Type: Grant

Filed: May 21, 2021

Date of Patent: May 27, 2025

Assignee: SONY GROUP CORPORATION

Inventor: Tomohiko Gotoh
Method and system for adjusting sound playback to account for speech detection

Patent number: 12314631

Abstract: A method performed by an audio system comprising a headset. The method sends a playback signal containing user-desired audio content to drive a speaker of the headset that is being worn by a user, receives a microphone signal from a microphone that is arranged to capture sounds within an ambient environment in which the user is located, performs a speech detection algorithm upon the microphone signal to detect speech contained therein, in response to a detection of speech, determines that the user intends to engage in a conversation with a person who is located within the ambient environment, and, in response to determining that the user intends to engage in the conversation, adjusts the playback signal based on the user-desired audio content.

Type: Grant

Filed: October 16, 2023

Date of Patent: May 27, 2025

Assignee: Apple Inc.

Inventors: Christopher T. Eubank, Devin W. Chalmers, Kirill Kalinichev, Rahul Nair, Thomas G. Salter
Systems and methods to obtain feedback in response to autonomous vehicle failure events

Patent number: 12315315

Abstract: The present disclosure provides systems and methods to obtain feedback descriptive of autonomous vehicle failures. In particular, the systems and methods of the present disclosure can detect that a vehicle failure event occurred at an autonomous vehicle and, in response, provide an interactive user interface that enables a human located within the autonomous vehicle to enter feedback that describes the vehicle failure event. Thus, the systems and methods of the present disclosure can actively prompt and/or enable entry of feedback in response to a particular instance of a vehicle failure event, thereby enabling improved and streamlined collection of information about autonomous vehicle failures.

Type: Grant

Filed: January 3, 2024

Date of Patent: May 27, 2025

Assignee: AURORA OPERATIONS, INC.

Inventors: Molly Castle Nix, Sean Chin, Dennis Zhao
Computer-implemented task completion platform for visually impaired students

Patent number: 12300115

Abstract: An ordered interaction task is initiated in a graphical user interface. A main region of the graphical user interface is segmented into a plurality of discrete sub-regions, each sub-region including content of the ordered interaction task. The user is then prompted to begin the ordered interaction task through a non-visual prompt that is provided concurrently with the graphical user interface. In response to a first user-initiated command received in the graphical user interface, a non-visual presentation of at least a portion of the content of at least one sub-region is provided concurrently with the graphical user interface.

Type: Grant

Filed: January 4, 2023

Date of Patent: May 13, 2025

Assignee: Educational Testing Service

Inventors: Shrirang Prakash Sahasrabudhe, Sindhura Jaladhanki, Markku Tapio Hakkinen, Thomas Florek, Dolores Marie Dyer
Access control to secured locations using relaxed biometrics

Patent number: 12277823

Abstract: Example implementations include a method, apparatus, and computer-readable medium for controlling access to a location, comprising receiving an access identifier and a biometric information sample from a user attempting to enter an access point. The implementations further include matching the access identifier to an authenticated cluster identifier corresponding to a biometric information cluster within a plurality of different biometric information clusters, wherein the biometric information cluster is associated with an authenticated user. Additionally, the implementations further include identifying, based on a machine learning model processing, a cluster identifier from the biometric information sample.

Type: Grant

Filed: July 26, 2023

Date of Patent: April 15, 2025

Assignee: Tyco Fire & Security GmbH

Inventors: Gregory A. Makowski, Kshitij Judah, Kris Duszak
Systems and methods for connected natural language models

Patent number: 12265788

Abstract: Methods, systems, apparatuses, and non-transitory computer-readable media are provided for providing answer data through multiple connected large language models. Operations may include receiving, through a graphical user interface associated with a local large language model having access to a first limited private dataset but not a second limited private dataset, an input from a user device, identifying, based on the input, an external large language model from among a plurality of external large language models, transmitting the input to the external large language model, receiving, from the external large language model, the answer data responsive to the input, generating, by the local large language model, response data based on the answer data, and outputting the response data at the user device.

Type: Grant

Filed: June 4, 2024

Date of Patent: April 1, 2025

Assignee: Curio XR

Inventor: Ethan Fieldman
Integrated production automation for real-time multi-frequency data processing and visualization in internet of things (IoT) systems

Patent number: 12253929

Abstract: A system comprising an interoperable digital architecture configured for real-time multi-frequency data collection, processing, and visualization of data from various IoT devices in real-time, an automated event detection module configured to detect events from the IoT devices in real-time and manage multi-frequency data, a modular AI module in communication with the automated event detection module, and configured to process the multi-frequency data in real-time through separate aggregated containers for increased computational efficiency, where the modular AI module is configured to parallelize AI processing, receive the multi-frequency data from the event detection module and processes it in parallel using a plurality of AI algorithms. A data sorting module configured to sort the processed multi-frequency data and send it to destinations across multiple cloud environments, and a relational database for segregation of raw multi-frequency data and processed multi-frequency data.

Type: Grant

Filed: September 5, 2024

Date of Patent: March 18, 2025

Assignee: Enovate AI Corporation

Inventors: Camilo Mejia, Daniel Martinez, Rebecca Nye
Automatic adaptation of multi-modal system components

Patent number: 12243519

Abstract: A component management server computer (“server”) and processing methods are disclosed. In some embodiments, the server is programmed to continuously receive input data regarding what is happening in the physical room from one or more input devices. The server is programmed to then detect an utterance of a spoken word from the input data and generate one or more sound metrics based on the input data. Based on the sound metrics as applied to certain criteria, the server is programmed to activate a component, such as an input device, variable, software system, or output device, and cause one or more output devices to execute an action that alerts a user of the activated component. The server can also be programmed to turn on, off, up, or down any of the components based on the activated component.

Type: Grant

Filed: November 3, 2021

Date of Patent: March 4, 2025

Assignee: Merlyn Mind, Inc.

Inventors: Mohammad Niknazar, Aditya Vempaty, Robert Smith, Amol Nayate, Javier Villafana, Ravindranath Kokku, Shom Ponoth, Sharad Sundararajan, Satya Nitta
Determining whether to automatically resume first automated assistant session upon cessation of interrupting second session

Patent number: 12243526

Abstract: Determining whether, upon cessation of a second automated assistant session that interrupted and supplanted a prior first automated assistant session: (1) to automatically resume the prior first automated assistant session, or (2) to transition to an alternative automated assistant state in which the prior first session is not automatically resumed. Implementations further relate to selectively causing, based on the determining and upon cessation of the second automated assistant session, either the automatic resumption of the prior first automated assistant session that was interrupted, or the transition to the state in which the first session is not automatically resumed.

Type: Grant

Filed: August 28, 2023

Date of Patent: March 4, 2025

Assignee: GRAY ICE HIGDON

Inventors: Andrea Terwisscha van Scheltinga, Nicolo D'Ercole, Zaheed Sabur, Bibo Xu, Megan Knight, Alvin Abdagic, Jan Lamecki, Bo Zhang
Language model biasing modulation

Patent number: 12230251

Abstract: Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for modulating language model biasing. In some implementations, context data is received. A likely context associated with a user is determined based on at least a portion of the context data. One or more language model biasing parameters based at least on the likely context associated with the user is selected. A context confidence score associated with the likely context based on at least a portion of the context data is determined. One or more language model biasing parameters based at least on the context confidence score is adjusted. A baseline language model based at least on the one or more of the adjusted language model biasing parameters is biased. The baseline language model is provided for use by an automated speech recognizer (ASR).

Type: Grant

Filed: December 12, 2022

Date of Patent: February 18, 2025

Assignee: Google LLC

Inventors: Pedro J. Moreno Mengibar, Petar Aleksic
Dynamic remediation of pluggable streaming devices

Patent number: 12225067

Abstract: The present disclosure describes a system and method for providing dynamic remediation of a pluggable streaming device issue, such as a customer premises equipment (CPE) device. Sometimes, various features of the CPE device to begin to fail. For example, synchronization of the audio and video streams may drift, media rental purchases may time out, or playback may throttle to low quality. Such failures can be caused by device or network issues. The present disclosure describes a CPE remediation system that operates to identify a failure associated with playing media streamed by the CPE device. The CPE remediation system may further determine a solution to remediate an observed CPE device-related failure. In some examples, the CPE remediation process may further provide or perform one or more actions included in the determined solution. In some examples, the solution may include a warm or a cold reboot.

Type: Grant

Filed: December 22, 2023

Date of Patent: February 11, 2025

Assignee: CenturyLink Intellectual Property LLC

Inventors: John R. B. Woodworth, Dean Ballew
Natural language understanding for visual tagging

Patent number: 12204869

Abstract: A tag characterizing a portion of a multi-view interactive digital media representation (MVIDMR) may be determined by applying a grammar to natural language data. The MVIDMR may include images of an object and may be navigable in one or more dimensions. An object model location for the tag identifying a location within a three-dimensional object model may be determined by applying the grammar to the natural language data. The tag may then be applied to the MVIDMR by associating it with two or more of the images at positions determined based on the object model location.

Type: Grant

Filed: April 28, 2020

Date of Patent: January 21, 2025

Assignee: Fyusion, Inc.

Inventors: Abhishek Kar, Martin Markus Hubert Wawro, Stefan Johannes Josef Holzer, Radu Bogdan Rusu
Selective inclusion of speech content in documents

Patent number: 12190886

Abstract: In an approach for enabling a user to visualize a transcript of a discussion via a head-mounted AR device and to selectively copy one or more parts of the transcript that is contextually relevant for inclusion in a new document file or a previously created document file, a processor captures audio of a spoken content of a first participant of a discussion via an AR device worn by a user. A processor analyzes the audio of the spoken content of the first participant. A processor converts the audio of the spoken content of the first participant to text to create a transcript. A processor creates a visualization of the transcript. A processor presents the visualization of the transcript to the user via the AR device. A processor enables the user to copy one or more parts of the transcript into a document file via a selection support.

Type: Grant

Filed: September 27, 2021

Date of Patent: January 7, 2025

Assignee: International Business Machines Corporation

Inventors: Sushain Pandit, Sarbajit K. Rakshit
Image output device

Patent number: 12179590

Abstract: The present invention relates to an image output device mounted on a vehicle to implement augmented reality. One or more of an autonomous driving vehicle, a user terminal, and a server of the present invention can be linked to an artificial intelligence module, a drone (unmanned aerial vehicle, UAV), a robot, an augmented reality (AR) device, a virtual reality (VR) device, a device related to 5G services, etc.

Type: Grant

Filed: January 30, 2020

Date of Patent: December 31, 2024

Assignee: LG Electronics Inc.

Inventors: Dukyung Jung, Kihyung Lee
System and method for improving named entity recognition

Patent number: 12170079

Abstract: A method includes training a set of teacher models. Training the set of teacher models includes, for each individual teacher model of the set of teacher models, training the individual teacher model to transcribe unlabeled audio samples and predict a pseudo labeled dataset having multiple labels. At least some of the unlabeled audio samples contain named entity (NE) audio data. At least some of the labels include transcribed NE labels corresponding to the NE audio data. The method also includes correcting at least some of the transcribed NE labels using user-specific NE textual data. The method further includes retraining the set of teacher models based on the pseudo labeled dataset from a selected one of the teacher models, where the selected one of the teacher models predicts the pseudo labeled dataset more accurately than other teacher models of the set of teacher models.

Type: Grant

Filed: August 3, 2021

Date of Patent: December 17, 2024

Assignee: Samsung Electronics Co., Ltd.

Inventors: Divya Neelagiri, Taeyeon Ki, Vijendra Raj Apsingekar
Intent detection via multi-hop unified syntactic graph

Patent number: 12153878

Abstract: A method for detecting business intent from a business intent corpus by employing an Intent Detection via Multi-hop Unified Syntactic Graph (IDMG) is presented. The method includes parsing each text sample representing a business need description to extract syntactic information including at least tokens and words, tokenizing the words of the syntactic information to generate sub-words for each of the words by employing a multi-lingual pre-trained language model, aligning the generated sub-words to the tokens of the syntactic information to match ground-truth intent actions and objects to the tokenized sub-words, generating a unified syntactic graph, encoding, via a multi-hop unified syntactic graph encoder, the unified syntactic graph to generate an output, and predicting an intent action and object from the output.

Type: Grant

Filed: April 12, 2022

Date of Patent: November 26, 2024

Assignee: NEC Corporation

Inventors: Xuchao Zhang, Yanchi Liu, Haifeng Chen
Detection of live speech

Patent number: 12142259

Abstract: A method of detecting live speech comprises: receiving a signal containing speech; obtaining a first component of the received signal in a first frequency band, wherein the first frequency band includes audio frequencies; and obtaining a second component of the received signal in a second frequency band higher than the first frequency band. Then, modulation of the first component of the received signal is detected; modulation of the second component of the received signal is detected; and the modulation of the first component of the received signal and the modulation of the second component of the received signal are compared. It may then be determined that the speech may not be live speech, if the modulation of the first component of the received signal differs from the modulation of the second component of the received signal.

Type: Grant

Filed: May 16, 2023

Date of Patent: November 12, 2024

Assignee: Cirrus Logic Inc.

Inventors: John Paul Lesso, Toru Ido
Method and system for sound monitoring over a network

Patent number: 12089011

Abstract: A mobile communication environment (100) can include a mobile device (160) to measure and send sound pressure level data. The mobile device (160) can initiate the collection of audio information responsive to detecting a trigger event. Mobile device (160) can measure or calculate the sound pressure level from the audio information. Metadata including time information and geographic location information can be captured with the collected audio information. Mobile device (160) can send the sound pressure level data and metadata through a wired or wireless communication path to a database (614).

Type: Grant

Filed: April 29, 2021

Date of Patent: September 10, 2024

Assignee: ST FamTech, LLC

Inventors: Steven Wayne Goldstein, Marc Boillot, Jason McIntosh, John P. Keady
Methods, apparatuses and computer program products for providing a conversational data-to-text system

Patent number: 12072874

Abstract: Methods, apparatuses and computer program products for providing a conversational data-to-text system are described herein. An example method may include receiving a first natural language query from a client device; generating a first analytic operation instruction associated with a multi-dimensional dataset based at least in part on the first natural language query; determining a first multi-dimensional data object based at least in part on the first analytic operation instruction and the multi-dimensional dataset; generating a first natural language response to the first natural language query based at least in part on the first multi-dimensional data object; and transmitting the first natural language response to the client device.

Type: Grant

Filed: August 31, 2021

Date of Patent: August 27, 2024

Assignee: Arria Data2Text Limited

Inventors: Kapila Anuruddha Ponnamperuma Arachchi, Rodrigo Gomes De Oliveira, John William Alexander, Daniel da Silva De Paiva, Neil Stuart Burnett
Method and system for programmatic analysis of consumer reviews

Patent number: 12073444

Abstract: Embodiments provide a computer-executable method, computer system and non-transitory computer-readable medium for programmatically analyzing a consumer review. The method includes programmatically accessing, via a network device, one or more consumer reviews for a commercial entity or a commercial object. The method also includes executing a consumer review processing engine to programmatically identify an attribute descriptor in the one or more consumer reviews, and executing the consumer review processing engine to programmatically generate a sentiment score associated with the one or more consumer reviews. The method further includes storing, on a non-transitory computer-readable storage device, the attribute descriptor and the sentiment score in association with the commercial entity or the commercial object.

Type: Grant

Filed: December 23, 2020

Date of Patent: August 27, 2024

Assignee: Bytedance Inc.

Inventors: Gaston L'Huillier, Francisco Jose Larrain, Hernan Enrique Arroyo Garcia, Juzheng Li, Daniel Langdon, Jonathan Esterhazy, Srinivasa Raghavan Vedanarayanan, Shawn Jeffery, Feras Karablieh, Bhupesh Bansal, Dor Levi, Amit Koren
Contextual workflow triggering on devices

Patent number: 12056413

Abstract: The unique attributes of handheld devices and how they are used—particularly multi-screen devices—are leveraged to define rules for automatically triggering workflows. By monitoring signals from various device sensors, the device can anticipate a user's intention to perform an action, such as capturing a quick thought. A workflow for performing the action (or actions) may be automatically triggered based on rules for evaluating the sensor signals. By anticipating the user's intentions, the device can automatically perform many of the underlying actions behind the scenes, thereby minimizing the actions performed by the user and improving the user experience. In this way, cumbersome, multi-step user inputs and interactions are avoided by anticipating user intentions and automatically triggering workflows.

Type: Grant

Filed: September 25, 2020

Date of Patent: August 6, 2024

Assignee: Microsoft Technology Licensing, LLC

Inventors: Klorida Miraj, Bernd Ingo Plontsch, Shrey Shah, Viktoryia Akulich
Efficient streaming non-recurrent on-device end-to-end model

Patent number: 12051404

Abstract: An ASR model includes a first encoder configured to receive a sequence of acoustic frames and generate a first higher order feature representation for a corresponding acoustic frame in the sequence of acoustic frames. The ASR model also includes a second encoder configured to receive the first higher order feature representation generated by the first encoder at each of the plurality of output steps and generate a second higher order feature representation for a corresponding first higher order feature frame. The ASR model also includes a decoder configured to receive the second higher order feature representation generated by the second encoder at each of the plurality of output steps and generate a first probability distribution over possible speech recognition hypothesis. The ASR model also includes a language model configured to receive the first probability distribution over possible speech hypothesis and generate a rescored probability distribution.

Type: Grant

Filed: June 16, 2023

Date of Patent: July 30, 2024

Assignee: Google LLC

Inventors: Tara Sainath, Arun Narayanan, Rami Botros, Yanzhang He, Ehsan Variani, Cyril Allauzen, David Rybach, Ruoming Pang, Trevor Strohman
System and method for implicit authentication

Patent number: 12047773

Abstract: A system for implicit authentication for a mobile device associated with a user, wherein the implicit authentication is behavioral, biometric and task-based and includes at least one authentication task selected so as to leverage the user's muscle memory. The mobile device comprises a touchscreen; a transaction authentication information unit; one or more sensors coupled to the transaction authentication information unit; and an anomaly detector coupled to the transaction authentication information unit. The sensors comprise one or more touchscreen sensors coupled to the touchscreen, an accelerometer, and a gyroscope, and are used to obtain and transmit one or more sets of data to the transaction authentication information unit. The sets of data are associated with one or more performances of the authentication task by the user. The anomaly detector generates an authentication model using the one or more data sets transmitted to the transaction authentication information unit.

Type: Grant

Filed: February 7, 2022

Date of Patent: July 23, 2024

Assignee: Zighra Inc.

Inventors: Deepak Chandra Dutt, Anil Buntwal Somayaji, Michael John Kendal Bingham
Time scaler, audio decoder, method and a computer program using a quality control

Patent number: 12020721

Abstract: A time scaler for providing a time scaled version of an input audio signal is configured to compute or estimate a quality of a time scaled version of the input audio signal obtainable by a time scaling of the input audio signal. The time scaler is configured to perform the time scaling of the input audio signal in dependence on the computation or estimation of the quality of the time scaled version of the input audio signal obtainable by the time scaling. An audio decoder has such a time scaler.

Type: Grant

Filed: April 9, 2021

Date of Patent: June 25, 2024

Assignee: Fraunhofer-Gesellschaft zur Foerderung der angewandten Forschung e.V.

Inventors: Stefan Reuschl, Stefan Doehla, Jérémie Lecomte, Manuel Jander, Nikolaus Faerber
Contextual resource completion

Patent number: 12008308

Abstract: An example computer-implemented method for contextual prediction of content for a structured resource is provided. The example method includes providing, to an endpoint device, an input interface for inputting natural language content. The example method includes receiving, from the endpoint device via the input interface, an initial input of natural language content. The example method includes receiving context data associated with the endpoint device. The example method includes obtaining, using a machine-learned content generation model, and based on the initial input and the context data, a suggested portion of natural language content. The example method includes providing, to the endpoint device, the suggested portion.

Type: Grant

Filed: March 14, 2023

Date of Patent: June 11, 2024

Assignee: ROCKET RESUME, INC.

Inventor: Stephen William Zimmerman
Fraud detection using emotion-based deep learning model

Patent number: 12008579

Abstract: Techniques are described for determining a likelihood that a customer communication is fraudulent using one or more machine learning models. For example, a computing system includes a memory and one or more processors in communication with the memory. The one or more processors are configured to: receive a set of emotion factor values for communication data of a current communication associated with a customer, wherein each emotion factor value indicates a measure of a particular emotion factor in the current communication; classify, using an emotion variance model running on the one or more processors, the current communication into an emotional fraud category based on the set of emotion factor values for the current communication associated with the customer; and determine a risk score for the current communication indicative of a probability that the current communication is fraudulent based on at least the emotional fraud category for the current communication.

Type: Grant

Filed: August 9, 2021

Date of Patent: June 11, 2024

Assignee: Wells Fargo Bank, N.A.

Inventors: Abhishek Kumar, Dipanjan Deb, Julia A Kosheleva-Coates, Amit Agarwal, Naveen Gururaja Yeri
NLU training with user corrections to engine annotations

Patent number: 11995404

Abstract: Techniques for training a natural language understanding (NLU) engine may include generating a first annotation of free-form text documenting a healthcare patient encounter and a link between the first annotation and a corresponding portion of the text, using the NLU engine. A second annotation of the text and a link between the second annotation and a corresponding portion of the text may be received from a human user. The first annotation and its corresponding link may be merged with the second annotation and its corresponding link. Training data may be provided to the engine in the form of the text and the merged annotations and links.

Type: Grant

Filed: July 14, 2020

Date of Patent: May 28, 2024

Assignee: Microsoft Technology Licensing, LLC.

Inventors: Howard D'Souza, Regina Spitznagel, Debjani Sarkar
Enabling the use of multiple Picture Archiving Communication Systems by one or more facilities on a shared domain

Patent number: 11996180

Abstract: Methods, systems, and computer-storage media are provided for utilizing multiple Picture Archiving Communication Systems (PACS) to view one more medical images by storing one or more PACS at a database within the system. Requests are received from one or more users at one or more facilities to utilize one or more PACS to view one or more medical images. After accessing the database to determine one or more PACS authorized for each facility from which a request is received, one or more users are provided with one or more PACS to view medical images associated with radiological exams and provide the necessary assessments and reports for treatment.

Type: Grant

Filed: April 18, 2022

Date of Patent: May 28, 2024

Assignee: Cerner Innovation, Inc.

Inventors: Kiran Bhojaraja, Deepak Gupta, Vikram Nandwani, Premjit Adhikary, Tania Bhattacharyya, Bobbie Milne
Illicit route viewing system and method of operation

Patent number: 11991057

Abstract: A route viewing system includes a computing system that receives information associated with one or more routes through a network, and identifies the routes that are associated with at least one illicit user computer used by an illicit user. The computing system then obtains a source location of a source address of the routes and a destination location of a destination address of the routes, and displays the routes on a geographical display at the source location of the source address and the destination location of the destination address of each of the routes.

Type: Grant

Filed: May 18, 2023

Date of Patent: May 21, 2024

Assignee: Level 3 Communications, LLC

Inventors: Michael Benjamin, Skyler J. Bingham, John S. Reynolds
Method for generating a voice announcement as feedback to a handwritten user input, corresponding control device, and motor vehicle

Patent number: 11975729

Abstract: A method for generating a voice announcement as feedback to a handwritten user input is disclosed in which a user enters on a control device. A list of possible whole words which can be entered by the user input is provided together with a corresponding transcription and a predetermined word end, which comprises one or more characters of a whole word of the whole words, is removed from the end of said whole word in accordance with a predetermined shortening rule and corresponding to this, a transcription end corresponding to the word end is determined based on a predetermined assignment rule and is removed from the corresponding transcription of the whole word for generating a partial word and an associated partial transcription. The partial word and the partial transcription are added to another list.

Type: Grant

Filed: July 29, 2019

Date of Patent: May 7, 2024

Assignee: AUDI AG

Inventor: Jan Dusik
Cloud-based training and camera correction

Patent number: 11967117

Abstract: A method implemented by a server communicably coupled to at least two devices, each device including camera(s), the devices being present within same real-world environment. The method includes: receiving, from the devices(s), images captured by respective cameras of the devices; identifying one of the devices whose camera has camera parameter(s) better than camera parameter(s) of camera of another of the devices; training neural network using images captured by camera of one of the devices as ground truth material and using images captured by camera of another of the devices as training material; generating correction information to correct images captured by camera of another of the devices using trained neural network; and correcting the images captured by the camera of the another of the device(s) by utilising the correction information at the server, or sending correction information to another of the devices for correcting the images.

Type: Grant

Filed: March 22, 2022

Date of Patent: April 23, 2024

Assignee: Varjo Technologies Oy

Inventor: Mikko Ollila
Cross-assistant command processing

Patent number: 11955112

Abstract: A speech-processing system may provide access to one or more virtual assistants via a voice-controlled device. A user may leverage a first virtual assistant to translate a natural language command from a first language into a second language, which the device can forward to a second virtual assistant for processing. The device may receive a command from a user and send input data representing the command to a first speech-processing system representing the first virtual assistant. The device may receive a response in the form of a first natural language output from the first speech-processing system along with an indication that the first natural language output should be directed to a second speech-processing system representing the second virtual assistant. For example, the command may be in the first language, and the first natural language output may be in the second language, which is understandable by the second speech-processing system.

Type: Grant

Filed: February 5, 2021

Date of Patent: April 9, 2024

Assignee: Amazon Technologies, Inc.

Inventor: Robert John Mars
Regularizing machine learning models

Patent number: 11934956

Abstract: Methods, systems, and apparatus, including computer programs encoded on computer storage medium, for training a neural network, wherein the neural network is configured to receive an input data item and to process the input data item to generate a respective score for each label in a predetermined set of multiple labels. The method includes actions of obtaining a set of training data that includes a plurality of training items, wherein each training item is associated with a respective label from the predetermined set of multiple labels; and modifying the training data to generate regularizing training data, comprising: for each training item, determining whether to modify the label associated with the training item, and changing the label associated with the training item to a different label from the predetermined set of labels, and training the neural network on the regularizing data.

Type: Grant

Filed: November 30, 2022

Date of Patent: March 19, 2024

Assignee: Google LLC

Inventor: Sergey Ioffe
Automatic speech recognition

Patent number: 11915690

Abstract: A multi-channel transformer acoustic model that processes a plurality of audio signals output by microphones of a microphone array and outputs probabilities for acoustic units of an utterance represented in the audio signals. The audio signals represent the individual microphones' respective capturing of the utterance. The multi-channel model may perform self-attention on embeddings of the audio signals and then cross-channel attention across the attended audio signals. The cross-channel attention may involve processing of signals relative to each other to model the relationships across channels within and across time frames. The multi-channel model may include a transducer to perform processing frame-by-frame.

Type: Grant

Filed: September 29, 2021

Date of Patent: February 27, 2024

Assignee: Amazon Technologies, Inc.

Inventors: Feng-Ju Chang, Martin Radfar, Athanasios Mouchtaris, Brian King, Siegfried Kunzmann, Maurizio Omologo
Using a generative adversarial network to train a semantic parser of a dialog system

Patent number: 11908460

Abstract: Disclosed herein are techniques for using a generative adversarial network (GAN) to train a semantic parser of a dialog system. A method described herein involves accessing seed data that includes seed tuples. Each seed tuple includes a respective seed utterance and a respective seed logical form corresponding to the respective seed utterance. The method further includes training a semantic parser and a discriminator in a GAN. The semantic parser learns to map utterances to logical forms based on output from the discriminator, and the discriminator learns to recognize authentic logical forms based on output from the semantic parser. The semantic parser may then be integrated into a dialog system.

Type: Grant

Filed: August 13, 2020

Date of Patent: February 20, 2024

Assignee: Oracle International Corporation

Inventors: Thanh Long Duong, Mark Edward Johnson
Systems and methods to obtain feedback in response to autonomous vehicle failure events

Patent number: 11900738

Abstract: The present disclosure provides systems and methods to obtain feedback descriptive of autonomous vehicle failures. In particular, the systems and methods of the present disclosure can detect that a vehicle failure event occurred at an autonomous vehicle and, in response, provide an interactive user interface that enables a human located within the autonomous vehicle to enter feedback that describes the vehicle failure event. Thus, the systems and methods of the present disclosure can actively prompt and/or enable entry of feedback in response to a particular instance of a vehicle failure event, thereby enabling improved and streamlined collection of information about autonomous vehicle failures.

Type: Grant

Filed: January 13, 2023

Date of Patent: February 13, 2024

Assignee: UATC, LLC

Inventors: Molly Castle Nix, Sean Chin, Dennis Zhao
Dual-factor identification system and method with adaptive enrollment

Patent number: 11899765

Abstract: A multi-factor identification system is provided in which enrolled user authentication information is updated in the course of an authorization request based upon at least one of a confidence level of a match between a request first factor identifier, produced based upon first unique user identifying information received with the authentication request, and a respective matching enrolled first factor identifier and a confidence level of a match between a request second factor identifier, produced based upon second unique user identifying information received with the authentication request, and a respective matching enrolled second factor identifier.

Type: Grant

Filed: December 22, 2020

Date of Patent: February 13, 2024

Assignee: DTS Inc.

Inventors: Gadiel Seroussi, Michael M. Goodwin
Generation of text tags from game communication transcripts

Patent number: 11893357

Abstract: Some implementations relate to methods, systems, and computer-readable media to generate text tags for games. In some implementations, a computer-implemented method to generate one or more text tags includes obtaining a plurality of chat transcripts, each chat transcript associated with a respective gameplay session of a respective game of a plurality of games. Each chat transcript includes content provided by participants in the gameplay session. The method further includes programmatically analyzing the plurality of chat transcripts to determine one or more characteristics for each game of the plurality of games, and generating a text tag for at least one game of the plurality of games based on the one or more characteristics of the at least one game.

Type: Grant

Filed: May 7, 2021

Date of Patent: February 6, 2024

Assignee: Roblox Corporation

Inventors: Eric Holmdahl, Nikolaus Sonntag, Aswath Manoharan
Dynamic system response configuration

Patent number: 11887580

Abstract: A natural language processing system may select a synthesized speech quality using user profile data. The system may receive a natural language input and determine responsive output data. The system may, based at least in part on user profile data associated with the input, determine response configuration data corresponding to a quality of synthesized speech. The system may then determine further output data for presentation using the responsive output data and response configuration data.

Type: Grant

Filed: January 4, 2023

Date of Patent: January 30, 2024

Assignee: Amazon Technologies, Inc.

Inventors: Anthony Bissell, Janet Slifka
Sentiment progression analysis

Patent number: 11886824

Abstract: Various embodiments of the present disclosure performing conversation sentiment monitoring for a conversation data object. In various embodiments, a text block that can be resized is identified within a conversation data object and successive regularized sentiment profile generation iterations are performed until a regularized sentiment score of the block exceeds a regularized sentiment score threshold. A current regularized sentiment profile generation iteration involves determining a regularized sentiment score for the block based on an initial sentiment score, a subjectivity probability value, and, optionally, a stage-wise penalty factor. A determination is then made as to whether the score exceeds the threshold. If so, then a regularized sentiment profile of the conversation data object is updated based on the regularized sentiment score. If not, then the text block is resized and a subsequent regularized sentiment profile generation iteration is performed based on the resized block.

Type: Grant

Filed: January 28, 2022

Date of Patent: January 30, 2024

Assignee: Optum Technology, Inc.

Inventors: Ninad D. Sathaye, Raghav Bali, Piyush Gupta, Krishnamohan Nandiraju
Intelligent commissioning of building automation controllers

Patent number: 11874011

Abstract: Systems/methods for intelligent commissioning of an HVAC system provide a control node and at least a first network node coupled to communicate with the control node, the first network node configured to retrieve via a user interface objects configured at the control node, configure at least a second network node using the retrieved objects, and report the configuration of the second network node at the control node. A user interface of a first network node can access the objects at the control node. The first network node can apply the accessed objects to configure a second network node using a commissioning tool. The commissioning tool can be activated specifically for certain authorized HVAC personas or roles. The first network node can report the configuring at the control node. The commissioning tool can be voice-enabled to allow a single user to configure the HVAC system via voice commands.

Type: Grant

Filed: January 18, 2019

Date of Patent: January 16, 2024

Assignee: Schneider Electric Buildings Americas, Inc.

Inventors: Babak Haghayeghi, Kevin Sweeney, Shawn Lambert, David Keefer, David Shike
Digital twin enabled equipment diagnostics based on acoustic modeling

Patent number: 11874200

Abstract: In an approach to digital twin enabled equipment diagnostics based on acoustic modeling, a real-time audio input of an asset is received from a mobile device. The real-time audio input is analyzed using one or more acoustic modeling algorithms to establish a deviation from a baseline, where the baseline is associated with a digital twin of the asset. Responsive to determining the deviation from the baseline exceeds a predetermined threshold, the user is iteratively directed to move the mobile device until a stopping criteria is met.

Type: Grant

Filed: September 8, 2020

Date of Patent: January 16, 2024

Assignee: International Business Machines Corporation

Inventors: John Kaufmann, Borja Canseco, Adriel Ricardo Estrada
Automated clinical documentation system and method

Patent number: 11853691

Abstract: A method, computer program product, and computing system for synchronizing machine vision and audio is executed on a computing device and includes obtaining encounter information of a patient encounter, wherein the encounter information includes machine vision encounter information and audio encounter information. The machine vision encounter information and the audio encounter information are temporally-aligned to produce a temporarily-aligned encounter recording.

Type: Grant

Filed: March 23, 2021

Date of Patent: December 26, 2023

Assignee: Nuance Communications, Inc.

Inventors: Donald E. Owen, Uwe Helmut Jost, Daniel Paulino Almendro Barreda, Dushyant Sharma
Dynamic remediation of pluggable streaming devices

Patent number: 11856040

Abstract: The present disclosure describes a system and method for providing dynamic remediation of a pluggable streaming device issue, such as a customer premises equipment (CPE) device. Sometimes, various features of the CPE device to begin to fail. For example, synchronization of the audio and video streams may drift, media rental purchases may time out, or playback may throttle to low quality. Such failures can be caused by device or network issues. The present disclosure describes a CPE remediation system that operates to identify a failure associated with playing media streamed by the CPE device. The CPE remediation system may further determine a solution to remediate an observed CPE device-related failure. In some examples, the CPE remediation process may further provide or perform one or more actions included in the determined solution. In some examples, the solution may include a warm or a cold reboot.

Type: Grant

Filed: June 6, 2023

Date of Patent: December 26, 2023

Assignee: CenturyLink Intellectual Property LLC

Inventors: John R. B. Woodworth, Dean Ballew
Virtual counseling system and counseling method using the same

Patent number: 11837251

Abstract: The present disclosure relates to a virtual counseling system in which a user can virtually receive counseling by inputting query information into a system. A virtual counseling system according to an embodiment of the present disclosure may include an input unit obtaining audio information from a user and generating audio data; a determination unit receiving the audio data through the input unit, determining a type of the audio data, and generating type information on the audio data; and a text data generation unit generating object data by receiving the type information from the determination unit, converting content of the audio data into first text data, and combining the object data and the first text data to generate second text data.

Type: Grant

Filed: March 25, 2021

Date of Patent: December 5, 2023

Assignee: SOLUGATE INC.

Inventor: Sung Tae Min
Voice recognition system and voice recognition method

Patent number: 11830498

Abstract: A voice recognition method includes the following steps. An audio and a correct result are received. The audio is recognized, and a text file corresponding to the audio is output. The word error rate is determined by comparing the text file to the correct result. The word error rate is adjusted according to the weight of at least one important word, in order to calculate a professional score that corresponds to the text file. A determination is made as to whether the professional score is higher than a score threshold. In response to the professional score is higher than the score threshold, the text file, the audio, or the correct result corresponding to the professional score is sent to an engine training module for training.

Type: Grant

Filed: August 11, 2021

Date of Patent: November 28, 2023

Assignee: Wistron Corp.

Inventor: Zheng-De Liu

1 2 3 4 5 … next