Abstract: Systems, methods, and computer program products are provided for generating visual representations for use in audio conversations. For example, a method comprises receiving user information associated with a first user; receiving visual representation information input by the first user, wherein the visual representation information comprises a first feature, wherein the visual representation information further comprises a second feature distinct from the first feature, wherein the first feature comprises a facial feature; generating a visual representation based on the visual representation information, wherein the visual representation is presented to a second user during an audio conversation between the first user and a second user, wherein at least one of the first feature or second feature changes form when the first user speaks during the audio conversation, and wherein both the first feature and the second feature remain static when the second user speaks during the audio conversation.
Type:
Grant
Filed:
August 26, 2020
Date of Patent:
August 24, 2021
Assignee:
Stereo App Limited
Inventors:
Artur Nugumanov, Sergey Frolovichev, Andrey Ogandzhanyants
Abstract: A transmission of a representation of an endpoint is disclosed. A performance of a source media is detected in a transmission of the representation of the endpoint. The detected performance of the source media is replaced with the source media in the image during transmission.
Type:
Grant
Filed:
July 26, 2016
Date of Patent:
August 17, 2021
Assignee:
Hewlett-Packard Development Company, L.P.
Abstract: An electronic device and an operation method of the electronic device according to various embodiments may include; a communication module comprising communication circuitry configured to transmit and/or receive data using a call channel established via a call connection to an external electronic device; and a processor, wherein the processor is configured to control the electronic device to: transmit a call connection request message to establish a call channel between an external electronic device and the electronic device; receive a response message from the external electronic device; based on capability information of the external electronic device related to the call connection and included in the response message, determine whether to pre-process a content transmitted via the call connection using a transmission filter configured to change a quality of the content; transmit, to the external electronic device, a call connection confirmation message indicating whether to pre-process the content; and cont
Type:
Grant
Filed:
September 28, 2020
Date of Patent:
August 17, 2021
Assignee:
Samsung Electronics Co., Ltd.
Inventors:
Hoonjae Lee, Yongtae Kim, Jeongyong Kim, Hyonmyong Cho, Taewon Do, Hyeyoung Jun
Abstract: A system or method of prioritizing participants in a virtual meeting via a network by scoring each participant on several criteria and weighting the criteria to determine a readiness score. The system of accesses, for each of a plurality of participants, network data, video data, audio data, processing data, and participation data. The system determines, for each of the plurality of participants, a signal strength score based on the corresponding network data, a background score based on the corresponding video data and audio data, a microphone proximity score based on the corresponding video data, a processing score for each of the plurality of participants based on the corresponding processing data, and an engagement score based on the corresponding video data and participation data.
Abstract: A method of playing music includes providing a playing pool, where the playing pool includes a plurality of playlists, each playlist includes at least one piece of music, and each piece of music includes at least one attribute; comparing all music in any two playlists, and if at least one attribute of any two pieces of music is determined to be identical, defining the music as pairing music, where the pairing music is not music the same playlist; and playing the music in one of the plurality of playlists, and when a playing sequence comes to the pairing music, playing the music according to the pairing music.
Abstract: A computer network for facilitating engagement between consumers present at a premises and agents is disclosed. The network comprises touchscreen computers with cameras and configured to communicate with an agent computer and send a continuous uplink video stream to the agent computer. Activation of a button sends a notification to the agent computer comprising a camera and configured to continuously and simultaneously display multiple uplink video streams from the touchscreen computers and receive the notification of activation. The agent computer displays a graphical indication of the notification of activation associated with the video stream received from that touchscreen computer and detection of a selection captures a video stream by the camera of the agent computer and establishes a video channel with the touchscreen computer to send the captured video stream as a downlink video stream from the agent computer to that selected touchscreen computer.
Abstract: A method for video calling comprises, at a server computing system, receiving a plurality of segmented participant video streams from a plurality of client computing devices, each segmented participant video stream depicting a different human participant participating in a video call. One or more priority parameters for each of the plurality of human participants are recognized. One or more human participants are ranked based on a cumulative participant priority for each of the plurality of human participants. The plurality of segmented participant video streams are composited into a virtual conference view that displays each of the ranked one or more human participants at a virtual position based on their cumulative participant priority, such that human participants having higher cumulative participant priorities are displayed more prominently than human participants having lower cumulative participant priorities. The virtual conference view is sent to the plurality of client computing devices.
Abstract: Systems, methods, and software to provide intelligent detection and automatic correction of erroneous audio settings in a video conference. Electronic conferences can often be the source of frustration and wasted resources as participants may be forced to contend with extraneous sounds, such as background/ambient noises, or conversations not intended for the conference, provided by an endpoint that should be muted. Similarly, participants may speak with the intention of providing their speech to the conference while their associated endpoint is muted. As a result, the conference may be awkward and lack a productive flow while endpoints are erroneously muted or non-muted. By intelligently processing at least the video portion of a video conference, endpoints/participants may be prompted to mute/unmute or automatically muted/unmuted.
Type:
Grant
Filed:
August 20, 2020
Date of Patent:
August 3, 2021
Assignee:
Avaya Management L.P.
Inventors:
David Chavez, Pushkar Yashavant Deole, Sandesh Chopdekar, Navin Daga
Abstract: Systems and methods are provided for generating and rendering an enhanced audiovisual recording of a user, which may be used for multiuser communication, e.g., in Virtual Reality. Such an enhanced recording may be generated by determining a face orientation of the user in the audiovisual recording, and generating orientation data specifying an orientation which represents said determined face orientation. During rendering, the audio data may be rendered based on the orientation data, namely by rendering the audio data as a spatial audio source having a spatial direction which is congruent with the face orientation of the user in a visual representation of the user. Accordingly, the spatial direction of the voice of the user may better match the user's face direction in the user's visual representation.
Type:
Grant
Filed:
December 19, 2018
Date of Patent:
August 3, 2021
Assignees:
KONINKLIJKE KPN N.V., NEDERLANDSE ORGANISATIE VOOR TOEGEPAST-NATUURWETENSCHAPPELIJK ONDERZOEK TNO
Inventors:
Martin Prins, Hans Maarten Stokking, Simon Norbert Bernhard Gunkel, Hendrikus Nathaniƫl Hindriks
Abstract: Systems and methods for altering communications captured by an incident recording device are provided. An incident recording may be captured by a recording device. The incident recording may comprise audio data. A communication activation signal may be detected by the recording device. The communication activation signal may be followed by communication audio data and the communication audio data may be captured in the audio data. Based on detecting the communication activation signal, the recording device may alter the audio data of the incident recording to at least partially alter the communication audio data captured in the audio data.
Abstract: Utilization of a state machine to determine participant framing. The states include empty room, group framing, any talker, conversation mode and unambiguous talker. In empty room state, the conference room is framed. In group framing state, any participants in the room are framed. In any talker state, the talking participant is framed. In conversation mode state, all talking participants are framed. In unambiguous talker state, the single talking participant is framed. Various framing conditions define transitions between the states. Conditions include, presence of participants, which and number of participants that are talking for how long, system mute and far site talking. The conversation states and conditions and framing decisions provide a fully automated framing mechanism to provide pleasant framing of the individuals in the near site or end for any of the conditions relating to number of talkers, participants and the like.
Type:
Grant
Filed:
October 13, 2020
Date of Patent:
July 27, 2021
Assignee:
PLANTRONICS, INC.
Inventors:
Stephen Paul Schaefer, Alain Elon Nimri, Rommel Gabriel Childress, Jr.
Abstract: Provided is an information processing apparatus that includes a first display unit, a second display unit that displays an image acquired from a space on a communication partner side, and a control unit that performs a display control of the first display unit and the second display unit, and control to display, on at least one of the first display unit or the second display unit, a shared object whose display at least extends to a work area on the communication partner side.
Abstract: Methods and apparatus for taking a break after seamless transition between network conferences. In an embodiment, a method for taking a break after a transition between network conferences includes operations of attending a first network conference using a first conference state and a conferencing application, and displaying Up-Next conference status about a second network conference. The method also includes operations of receiving a request to enter a break mode after joining the second network conference, joining the second network conference using the first conference state and the conferencing application, and transmitting a break mode icon to participants in the second network conference.
Abstract: The invention provides a Bluetooth speaker and an intelligent control method for playing audio, the Bluetooth speaker comprises a housing, a Bluetooth module inside the housing, an audio playing module, an audio control module connected to the audio playing module and a central processor inside the housing, the Bluetooth module and the audio control module are connected to the processor, the speaker is connected to an intelligent wearable device through the Bluetooth module, so that the speaker performs Bluetooth communication with the device, the processor acquires heart rate data transmitted by the device through the Bluetooth module, sends a preset control instruction to the audio control module according to the data, the audio control module controls rhythm, volume, pausing and powering-off when the audio playing module plays audio according to the instruction. The invention intelligently controls audio playing automatically thereby upgrading user's usage experience.
Abstract: A method of operating a robot includes detecting movement of a video call counterpart using a video call counterpart robot included in image data received from the video call counterpart robot; canceling movement of a user from detected movement of the video call counterpart; and determining motion corresponding to the canceled movement of the video call counterpart.
Abstract: A video conferencing and law enforcement corroboration system. A video feed is established between actors including a law enforcement officer and a perpetrator, thereby forming a remote, real time communication between the two. Greetings are remotely exchanged, which can be pre-scripted such that the perpetrator may remotely ask for a reason for a stop using impartial, prompted language, to thereby objectivize initial contact. The officer is prompted to invite participation by a corroborating third party, wherein a two-way, real-time communication of a video feed can generate conference information. The identity of the perpetrator, a driver document and a vehicle search can occur remotely via a real time communication, wherein an interaction between a law enforcement officer and a perpetrator is documented without physical conflict and without exposing a perpetrator and a law enforcement officer to physical harm and violence.
Abstract: A system for enhancing audio including a plurality of sensors, an output device, and a processor in communication with the plurality of sensors and the output device. The processor is configured to process data captured by the plurality of sensors, and based on that, modify an output of the output device. The processor also is configured to determine whether there are a plurality of users associated with a video conferencing session, determine which user of the plurality of users is speaking, and enhance the audio or video output of the speaking user on the output device.
Type:
Grant
Filed:
August 25, 2020
Date of Patent:
July 13, 2021
Assignee:
APPLE INC.
Inventors:
Aleksandar Pance, Brett Bilbrey, Darby E. Hadley, Martin E. Johnson, Ronald Nadim Isaac
Abstract: An encoder and an encoding method for a multi-channel signal, and a decoder and a decoding method for a multi-channel signal are disclosed. A multi-channel signal may be efficiently processed by consecutive downmixing or upmixing.
Type:
Grant
Filed:
February 10, 2020
Date of Patent:
July 6, 2021
Assignee:
Electronics and Telecommunications Research Institute
Inventors:
Seung Kwon Beack, Tae Jin Lee, Jong Mo Sung, Jeong Il Seo, Kyeong Ok Kang, Dae Young Jang, Jin Woong Kim