Wherein The Metadata Is A Transcript Of The Audio Data (epo) Patents (Class 707/E17.103)

Computing systems for rapidly collecting digital witness statements and efficiently correcting transcription errors

Patent number: 12230276

Abstract: Systems, methods, and devices disclosed herein can capture an audio recording of an utterance, generate a transcription based on the audio recording, and generate a score for a section of the transcription that reflects a level of confidence that at least one word in the section was correctly transcribed. Content of the section is rendered in a field on a display. Also, a timeline for the audio recording is rendered. If the score does not satisfy a condition, the fill scheme applied to a segment of the timeline that maps to the section may differ from a fill scheme that is applied to the remainder of the timeline. An additional audio recording is then captured and transcribed. An additional timeline is rendered for the additional audio recording alongside the timeline and is aligned with the section. The transcription of the additional audio recording is used to replace the section.

Type: Grant

Filed: April 6, 2022

Date of Patent: February 18, 2025

Assignee: MOTOROLA SOLUTIONS, INC.

Inventors: Szymon Sikora, Jacek Doniec, Miroslaw Kawa, Artur Ziajko
Ingestion and segmentation of real-time event data

Patent number: 12210850

Abstract: In some implementations, the techniques may include receiving event data as a stream of event instances. Each received event instance is associated with an entity and a capacity change. In addition, the techniques may include identifying an event time for each received event instance. The techniques may include sorting the received event instances into sets of instances. Each set of instances can be associated with a respective entity. Moreover, the techniques may include segmenting each set of instances into subsets of instances based on the event time for each event instance of the set of instances. Each of the plurality of subsets of instances can correspond to a time period. Also, the techniques may include storing each segmented set of instances as stored event data. Further, the techniques may include performing one or more operations with respect to the stored event data.

Type: Grant

Filed: July 10, 2024

Date of Patent: January 28, 2025

Assignee: THE HUNTINGTON NATIONAL BANK

Inventors: Andrew Hopkins, Raghu Mundru, Steven Hittle
Methods and systems to provide a playlist for simultaneous presentation of a plurality of media assets

Patent number: 12184929

Abstract: Systems and methods are described herein for generating a playlist for a simultaneous presentation of a plurality of media assets. The system retrieves a user preference associated with a user profile and receives a selection of a first media asset and a second media asset from the plurality of media assets for presentation on a user device. The system parses the respective audio streams of the first media asset and the second media asset to identify one or more preferred audio segments based on the user preference and generates the playlist of the identified one or more preferred audio segments. Based on a generated audio playlist, the system generates, for presentation on the user device, the video stream for each of the first media asset and the second media asset and the playlist of the identified one or more preferred audio segments.

Type: Grant

Filed: September 17, 2021

Date of Patent: December 31, 2024

Assignee: Adeia Guides Inc.

Inventors: Harshavardhan Reddy Kalathuru, Padmassri Chandrashekar, Jayshil Parekh, Daina Emmanuel, Ramesh Arsam, Santhiya Krishnamoorthi, Vaibhav Gupta, Ashish Gupta, Senthil Kumar Karuppasamy, Anil Kumar, Reda Harb
Retroactive recording of a meeting

Patent number: 12112777

Abstract: Systems and methods for recording a meeting using a retroactive record feature. The present technology provides for improved systems and methods for providing a recording of a virtual meeting, where a selection to initiate the recording from the beginning or an earlier time in the meeting from a current time may be received after the virtual meeting has started. The system may process received meeting content streams to generate a plurality of data segments that may collectively form a meeting recording. Each data segment, for example, may include meeting content associated with a particular user/attendee and associated with a timestamp and/or time duration. In some examples, the plurality of data segments may be stored on a blockchain, which may provide an immutable meeting record that may be concatenated together and made available for playback based on a selection to record the meeting and consent given by the users/attendees.

Type: Grant

Filed: June 2, 2023

Date of Patent: October 8, 2024

Assignee: Microsoft Technology Licensing, LLC

Inventor: Dhirendra Kumar Bhupati
Multi-modal image search

Patent number: 12093310

Abstract: The present invention relates to methods for searching for two-dimensional or three-dimensional objects. More particularly, the present invention relates to searching for two-dimensional or three-dimensional objects in a collection by using a multi-modal query of image and/or tag data. Aspects and/or embodiments seek to provide a method of searching for digital objects using any combination of images, three-dimensional shapes and text by embedding the vector representations for these multiple modes in the same space. Aspects and/or embodiments can be easily extensible to any other type of modality, making it more general.

Type: Grant

Filed: March 7, 2018

Date of Patent: September 17, 2024

Assignee: STREEM, LLC

Inventors: Flora Ponjou Tasse, Ghislain Fouodji Tasse
Audio classification system

Patent number: 11978473

Abstract: A system includes a computer including a processor and a memory. The memory includes instructions such that the processor is programmed to receive an audio input representing a percussion performed by a user and classify, at a trained neural network, the audio input as a particular musical type.

Type: Grant

Filed: January 18, 2022

Date of Patent: May 7, 2024

Assignee: Bace Technologies LLC

Inventors: Christopher Samuels, Ghazaleh Jowkar, Mohammadbagher Fotouhi, Anita Garic, Ivan Vican
Creating data shapes for pattern recognition systems

Patent number: 11967142

Abstract: Methods, apparatuses and systems directed to pattern identification and pattern recognition. In some particular implementations, the invention provides a flexible pattern recognition platform including pattern recognition engines that can be dynamically adjusted to implement specific pattern recognition configurations for individual pattern recognition applications. In some implementations, the present invention also provides for a partition configuration where knowledge elements can be grouped and pattern recognition operations can be individually configured and arranged to allow for multi-level pattern recognition schemes.

Type: Grant

Filed: August 13, 2020

Date of Patent: April 23, 2024

Assignee: DataShapes, Inc.

Inventor: Jeffrey Brian Adams
User access to meeting recordings

Patent number: 11874942

Abstract: A method including receiving a request to access a meeting record from a user is provided. The meeting record may indicate at least one meeting participant, an audio/video recording and a presentation from one of the participants in the meeting. The method includes verifying an access privilege of the user for the meeting record, providing the meeting record to the user, for playback of a selected portion, and providing, in the meeting record, a selecting tool to the user, for playing the selected portion, wherein the selecting tool is configured to playback the selected portion for one of multiple participants in the meeting.

Type: Grant

Filed: December 28, 2022

Date of Patent: January 16, 2024

Assignee: Fuze, Inc.

Inventors: Luke Surazski, Elias Sardonis, Jedidiah Brown
Multi-portion spoken command framework

Patent number: 11837225

Abstract: A framework for efficiently importing content into a speech-controlled system in a manner that makes the content easily accessible using voice commands. A speech-controlled system that can be controlled using a variety of commands, including a command to retrieve audio content, can be configured using a framework of content organization that allows new content to be ingested using the framework, thus making the new content accessible to users of the system without manually adjusting the system to recognize when incoming commands call for the new content. The framework can include configured content demarcations (such as information demarcations that divide content into articles, or other sized portions), labels for those demarcations (such as topic descriptors or the like), etc.

Type: Grant

Filed: February 1, 2021

Date of Patent: December 5, 2023

Assignee: Amazon Technologies, Inc.

Inventors: Chase Brown, Christopher Wheeler, Kevin Bedell
Creating compact example sets for intent classification

Patent number: 11748393

Abstract: Embodiments for creating compact example subsets for intent classification in a conversational system are provided. A set of content used for training an intent classifier is received from a conversational corpus. Entries within the set of content are separated into a first subset and a second subset, and a cross-validation operation is performed on the first and second subsets to identify a correctly labeled portion and an incorrectly labeled portion of the set of content. A reduced content used for performing a final training of the intent classifier is formed by combining a first number of the entries from the correctly labeled portion and a second number of the entries from the incorrectly labeled portion of the set of content.

Type: Grant

Filed: November 28, 2018

Date of Patent: September 5, 2023

Assignee: INTERNATIONAL BUSINESS MACHINES CORPORATION

Inventors: Abhishek Shah, Tin Kam Ho
Interactive meeting recordings

Patent number: 11681819

Abstract: A method including receiving a request to access a meeting record from a user is provided. The meeting record includes an identification for at least one participant in a meeting, an audio recording for the at least one participant, a video recording for the at least one participant, and a presentation from one of the participants in the meeting. The method includes verifying an access privilege of the user for the meeting record, providing the meeting record to the user, for playback of a selected portion, and providing, in the meeting record, a selecting tool to the user, for playing the selected portion, wherein the selecting tool is configured to playback the selected portion for one of multiple participants in the meeting.

Type: Grant

Filed: August 1, 2019

Date of Patent: June 20, 2023

Assignee: 8x8, Inc.

Inventors: Luke Surazski, Elias Sardonis, Jedidiah Brown
Semantic content clustering based on user interactions

Patent number: 11599728

Abstract: Various embodiments of an apparatus, methods, systems and computer program products described herein are directed to a Topic Engine. The Topic Engine captures a plurality of content identifier sequences. Each respective sequence represents an order at which a corresponding user account accessed content. The Topic Engine generates a plurality of clusters. Each cluster is associated with respective content identifiers appearing within a proximity to each other across the plurality of content identifier sequences of different user accounts. The Topic Engine obtains one or more sample content identifiers from at least one cluster via sampling the cluster. The Topic Engine extracts keywords from content represented by the one or more sampled content identifiers. The Topic Engine identifies a topic for the cluster based on the one or more extracted keywords.

Type: Grant

Filed: March 7, 2022

Date of Patent: March 7, 2023

Assignee: Scribd, Inc.

Inventors: Matthew Allen Strong Ross, Monique Alves Cruz
Equalizer for equalization of music signals and methods for the same

Patent number: 11515853

Abstract: An equalizer and a method of controlling same are provided. The equalizer includes a memory storing an EQ value set for a plurality of music attributes and storing a general-purpose EQ value; and a processor configured to: obtain an input music signal; calculate a plurality of probability values for the plurality of music attributes by analyzing attributes of the input music signal based on a convolutional neural network; calculate a moderate index between the plurality of probability values; generate an EQ value based on the plurality of probability values and the moderate index; and perform equalizing by applying the generated EQ value to the input music signal.

Type: Grant

Filed: November 4, 2020

Date of Patent: November 29, 2022

Assignee: SAMSUNG ELECTRONICS CO., LTD.

Inventors: Kibeom Kim, Hoon Heo, Sangmo Son, Sunmin Kim, Jaeyoun Cho, Shukjae Choi
Image display apparatus and operation method of the same

Patent number: 11514107

Abstract: Method and apparatus for obtaining audio corresponding to a plurality of images, based on semantic information and the emotion information of the plurality of images.

Type: Grant

Filed: September 4, 2019

Date of Patent: November 29, 2022

Assignee: SAMSUNG ELECTRONICS CO., LTD.

Inventors: Anant Baijal, Daeeun Hyun, Mijeong Kwon
Information recording system and information recording method

Patent number: 11036775

Abstract: In an information recording system, a sound processing unit generates a conversion candidate word in a process of converting sound information into text information. A recording unit records the text information and the conversion candidate word on a recording medium such that the text information and the conversion candidate word are associated with each other. A search unit performs a search based on a keyword and extracts a word matching the keyword from words within the text information and the conversion candidate word. A reading unit reads the text information including the word matching the keyword from the recording medium. A display unit displays the text information such that a part corresponding to the word matching the keyword and a part other than the corresponding part are able to be distinguished.

Type: Grant

Filed: July 12, 2019

Date of Patent: June 15, 2021

Assignee: OLYMPUS CORPORATION

Inventor: Seiji Tatsuta
Reproducing apparatus and method, information processing apparatus and method, recording medium, and program

Patent number: 8095527

Abstract: The present invention is intended to automatically construct a database of contents data which are distributed over plural reproducing apparatuses and search this database on the basis of user's fragmentary memory. A contents sharing management system practiced as one embodiment of the invention comprises an episode server installed at user's home and plural reproducing apparatuses including a component stereo set, portable player, portable wireless terminal, and MD player, which are interconnected in a wireless manner based on wireless communication technologies such as Bluetooth. The episode server wirelessly connects to the portable player for example to get the episode information stored therein and organizes the retrieved episode information into a database. The episode server also searches the database upon request from the portable player to identify a source apparatus in which desired contents data are stored and supplies the retrieved contents data to the requesting portable player.

Type: Grant

Filed: October 7, 2008

Date of Patent: January 10, 2012

Assignee: Sony Corporation

Inventors: Noriyuki Yamamoto, Kazunori Ohmura
MULTIMEDIA ACCESS

Publication number: 20100274667

Abstract: A computer-implemented method provides access to multimedia content, which include units of content that include audio components. Meta data for the units of content is formed to an association of key phrases detected in the audio components and the units. In some examples, forming the meta data includes determining a candidate set of key phrases associated with the unit of multimedia and searching for the presence of the candidate key phrases in the audio components. Forming the meta data then includes forming data representing the presence of key phrases in the audio components.

Type: Application

Filed: April 24, 2009

Publication date: October 28, 2010

Applicant: Nexidia Inc.

Inventors: Drew Lanham, Marsal Gavalda, John Willcutts, Gordon Edwards
MUSIC SEARCH METHOD BASED ON QUERYING MUSICAL PIECE INFORMATION

Publication number: 20100125582

Abstract: A method for searching music based on music segment information inquiry comprises: a) analyzing certain music or song to obtain music rhythm and note information of any segment, and converting it to digital data as a basis for searching the music or the song after quantification; b) storing indexes of any segment of music rhythm and note information for the music or song in database; c) Take the inquiry requirement as a basis for searching and comparing to find the required music or song. The advantage of the invention is to search music via a segment of music melody or song without knowing text information like music name or singer, which extremely extends the flexibility of music searching, and therefore the subscriber's requirements for music searching is satisfied and fuzzy searching is achieved. When searching and comparing, the matching degree between music rhythm and note information in a segment and in index database may be configured to improve searching hit-the-target rate or searching accuracy.

Type: Application

Filed: January 8, 2008

Publication date: May 20, 2010

Inventors: Wenqi Zhang, Di Fan, Weimin Cheng
Systems and methods for conversation enhancement

Publication number: 20090306981

Abstract: This invention description details systems and methods for improving human conversations by enhancing conversation participants' ability to: —Distill out and record core ideas of conversations. —Classify and prioritize these key concepts. —Recollect commitments and issues and take appropriate action. —Analyze and uncover new insight from the linkage of these ideas with those from other conversations.

Type: Application

Filed: April 22, 2009

Publication date: December 10, 2009

Inventors: Mark Cromack, Robert Dolan, Andreas Wittenstein, David Brahm
MUSIC ANALYSIS

Publication number: 20090306797

Abstract: There is disclosed an analyser (101) for building a transcription model (112; 500) using a training database (111) of music. The analyser (101) decomposes the training music (111) into sound events (201a-e) and, in one embodiment, allocates the sound events to leaf nodes (504a-h) of a tree (500). There is also disclosed a transcriber (102) for transcribing music (121) into a transcript (113). The transcript (113) is sequence of symbols that represents the music (121), where each symbol is associated with a sound event in the music (121) being transcribed. In one embodiment, the transcriber (102) associates each of the sound events (201a-e) in the music (121) with a leaf node (504a-h) of a tree (500); in this embodiment the transcript (113) is a list of the leaf nodes (504a-h). The transcript (113) preserves information regarding the sequence of the sound events (201a-e) in the music (121) being transcribed.

Type: Application

Filed: September 8, 2006

Publication date: December 10, 2009

Inventors: Stephen Cox, Kris West
Remote Consultation System and Method

Publication number: 20090259492

Abstract: A computer-implemented method and system for confirming that a remote consultation between a professional and a client occurred, including monitoring a remote consultation call between a professional and a client using a remote consultation system, storing information of the consultation, and confirming that the consultation took place using the stored information. The consultant can be a healthcare professional such as a doctor, and the client can be a patient.

Type: Application

Filed: April 8, 2009

Publication date: October 15, 2009

Applicant: Strategic Medical, LLC

Inventor: Peter J. Cossman
APPARATUS AND METHOD FOR PREDICTING CUSTOMER BEHAVIOR

Publication number: 20090222313

Abstract: A predictive model generator that enhances customer experience, reduces the cost of servicing a customer, and prevents customer attrition by predicting the appropriate interaction channel through analysis of different types of data and filtering of irrelevant data. The model includes a customer interaction data engine for transforming data into a proper format for storage, data warehouse for receiving data from a variety of sources, and a predictive engine for analyzing the data and building models.

Type: Application

Filed: February 24, 2009

Publication date: September 3, 2009

Inventors: Pallipuram V. Kannan, Mohit Jain, Ravi Vijayaraghavan
System and Method for Using Content Features and Metadata of Digital Images to Find Related Audio Accompaniment

Publication number: 20080256100

Abstract: A system (300), apparatus (200) and method (100) are provided to automatically play/suggest at least one audio accompaniment while a sequence of at least one digital image is being displayed such that the audio accompaniment matches the content of the particular sequence of images and matches any provided and/or generated image metadata. Search terms are derived from the images themselves as well as any metadata provided by the user and these search terms are then used to find audio accompaniment that either (1) contains these search terms or synonyms thereof in the image or associated text (e.g., song text) or (2) represents the sound normally associated with the images, such as rushing water sound for an image of a fast flowing brook. The invention accepts user input, locates appropriate audio accompaniment as search results and presents these results to the user either by playing the audio accompaniment while displaying the images or by suggesting a playlist to the user compiled from these results.

Type: Application

Filed: November 15, 2006

Publication date: October 16, 2008

Applicant: KONINKLIJKE PHILIPS ELECTRONICS, N.V.

Inventors: Bartel Marinus van de Sluis, Wilhelmus Franciscus Johannes Fontijn, Mark Verberkt, Koen Hendrik Johan Vrielink, Albert M.A. Rijckaert