STREAMING DATA PROCESSING FOR HYBRID ONLINE MEETINGS
Techniques of streaming data processing for hybrid online meetings are disclosed herein. In one example, a method includes receiving, at the remote server, a video stream captured by a camera in the conference room. The video stream captures images of multiple local participants of an online meeting. The method also includes determining identities of the captured images of the multiple local participants in the received video stream using meeting information of the online meeting and generating a set of individual video streams each corresponding to one of the multiple local participants. The set of individual video streams can then be transmitted to the second computing device corresponding to a remote participant of the online meeting as if the multiple local participants are virtually joining the online meeting.
Online meeting generally refers to a computing platform that enables multiple users to interact with one another via video/audio streaming delivered real-time by a computer network such as the Internet. During a meeting session, computing devices can initially establish streaming connections with a remote server configured to relay video/audio streams for the session. Upon establishing the streaming connections, each computing device can capture and transmit to the remote server a video/audio stream of a participant. The remote server then can then relay the received video/audio streams to the other participants. As such, the participants of the online meeting can interact with and view videos of one another in real-time.
SUMMARYThis Summary is provided to introduce a selection of concepts in a simplified form that are further described below in the Detailed Description. This Summary is not intended to identify key features or essential features of the claimed subject matter, nor is it intended to be used to limit the scope of the claimed subject matter.
Online meetings can sometimes involve a mixture of local participants and one or more online participants. Such online meetings are sometime referred to as hybrid online meetings. For example, multiple local participants can hold a meeting in a conference room while additional remote participants joining the meeting via video/audio streaming to a computing device in the conference room. Though a computing device in the conference room can capture and transmit an overview video stream of the conference room, user experience of the remote participants can be very different than that of the local participants. For instance, the video stream from the conference room may be captured using a wide-angle camera to provide a broad view of the conference room. As such, the video stream from the conference room may not adequately identify the local participants or who is currently speaking. Also, the general view of the conference room may lead the remote participants to feel as observers instead of participants of the online meeting. The different user experiences may cause the hybrid online meeting to be less productive and inclusive than desired.
Several embodiments of the disclosed technology can address at least some aspects of the foregoing shortcomings of hybrid online meetings by implementing streaming data processing at an online meeting platform. The streaming data processing can allow remote participants to view local participants as if the local participants are also joining the online meeting virtually. In certain embodiments, upon launching an online meeting, a video processor at the remote server can be configured to determine identities of the local participants by retrieving data of the scheduled meeting from a calendar server. The retrieved data can represent, for instance, names, email addresses, identification photos, organizational positions, office locations, and/or other suitable information of the local participants. In other embodiments, the retrieved data can also include or be used to retrieve audio samples of the local participants. Such audio samples may be previous recorded by the local participants with appropriate user consent. In further embodiments, at least one module of the video processor may be a part of a computing device in the conference room or other suitable components of the online meeting platform.
During operation, the video processor can receive a raw video stream of a general view of the conference room from, for instance, a camera or a computing device in the conference room. Upon receiving the raw video stream, in certain embodiments, the video processor can be configured to identify each local participant using identification photos of the local participants via facial recognition. For example, using the identification photos, the video processor can be configured to identify facial features of the local participants by extracting landmarks or features, such as a relative position, size, and/or shape of the eyes, nose, cheekbones, and jaw, from the identification photos of the local participants. The video processor can then be configured to identify each local participants in the raw video stream using the extracted landmarks or features in one or more frames of the raw video stream. In other embodiments, the video processor can also be configured to identify each local participants via voice matching using the retrieved audio samples in lieu of or in addition to the identification photos. In further embodiments, each local participants may be identified via user tagging or via other suitable techniques.
Upon identifying each local participant in the raw video stream, the video processor can be configured to generate multiple individual video streams corresponding to each local participant in the conference room from the raw video stream. In certain implementations, the raw video stream captured by the camera can be cropped or trimmed to eliminate other local participants to generate an individual video stream for one of the local participants. In one embodiment, each frame of the cropped individual video stream can have a preset fixed size (e.g., 1280 by 720 pixels) of an image and/or the identification photo of the local participant in at least approximately a center of the image/video. In another embodiment, the image/video of the local participant in each frame can be digitally zoomed to be a preset size. In further embodiments, the cropped individual video stream can also have a preselected or dynamically adjusted aspect ratio (e.g., 3:4) or have other suitable characteristics. Once the individual video streams are generated, the video processor or other components of the remote server can be configured to transmit the individual video streams to the remote participants instead of the raw video stream. In other implementations, the foregoing processing of the raw video stream can be performed at the computing device in the conference room before transmitting the individual video streams to the remote server for delivery to the remote participants.
In further implementation, the video processor can be configured to generate the individual video streams by modulating and focusing the camera located in the conference room. In one embodiment, upon detecting speech from a local participant in the conference room, the video processor can be configured to control movement of the camera (e.g., pivot, pan, etc.) toward the local participant providing the speech. The video processor can also be configured to cause the camera to focus on and optionally zoom in on the local participant. The video processor can then capture an individual video stream of the local participant via the camera while the local participant continues to speak. In other embodiments, the video processor can also be configured to filter out audio signals from other local participants while speech is detected from the speaking local participant. While the camera is focused on the speaking local participant, individual video streams of other local participants may be looped or frozen.
Several embodiments of the video processor can thus provide the individual video streams to the remote participants as if the local participants are attending the meeting virtually. With the individual video streams, the remote server can be configured to provide additional functionalities to the remote participants. For example, the remote server can be configured to identify each individual video streams with a corresponding name, email address, or other suitable identifier of one of the local participants. The remote server can also be configured to identify a local participant who is currently speaking, for instance, by providing a speech indicator on the individual video stream. In addition, the remote servers can be configured to allow the remote participants to mute one or more local participants in the conference room by activating suitable controls on the corresponding individual video streams. As such, the online meeting session with the individual video streams can be more productive and inclusive than with an overview video stream of the conference room.
Certain embodiments of systems, devices, components, modules, routines, data structures, and processes for streaming data processing in an online meeting platform are described below. In the following description, specific details of components are included to provide a thorough understanding of certain embodiments of the disclosed technology. A person skilled in the relevant art will also understand that the technology can have additional embodiments. The technology can also be practiced without several of the details of the embodiments described below with reference to
The disclosure herein is generally directed to streaming data processing in an online meeting platform. Online meetings can allow participants to interact with one another via video/audio streaming in real-time by a computer network such as the Internet. During a meeting session, computing devices can initially establish streaming connections with a remote server configured to relay video/audio streams for the session. Upon establishing the streaming connections, each computing device can capture and continuously transmit to the remote server a video/audio stream of a participant. The remote server then can then relay the received video/audio streams to the other participants. As such, the participants of the online meeting can interact with and view videos of one another in real-time.
Online meetings can sometimes be hybrid by having local participants and remote participants joining the online meeting virtually. Though a computing device in the conference room can capture and transmit an overview video stream of the conference room, user experience of the remote participants can be very different than that of the local participants. For instance, the video stream from the conference room may be captured using a wide-angle camera to provide a broad view of the conference room. As such, the video stream from the conference room may not adequately identify the local participants or who is currently speaking. Also, the broad view of the conference room may lead the remote participants to feel as observers instead of participants of the online meeting. The different user experiences may cause the hybrid online meeting to be less productive and inclusive than desired.
Several embodiments of the disclosed technology can address at least some aspects of the foregoing shortcomings by allowing remote participants to view local participants as if the local participants are also joining the online meeting virtually. In certain embodiments, upon launching an online meeting, the remote server can be configured to determine identities of the local participants based on identification photos via facial recognition. Facial recognition can involve identifying or verifying a person from a digital image or a video frame from a video source, for example, by comparing selected facial features from the digital image and facial data in a database. Example facial recognition techniques can identify facial features by extracting landmarks, or features, from the image of the person's face. For instance, a relative position, size, and/or shape of the eyes, nose, cheekbones, and jaw may be used as landmarks or features. These landmarks or features can then be used to search for other images with matching landmarks or features. Other example facial recognition techniques can also utilize three-dimensional facial recognition, skin texture analysis, or other suitable techniques. Based on the determined identity, a raw video stream of the local participants can be edited or otherwise modified into a set of individual video streams corresponding to each local participant. With the individual video streams, the remote server can be configured to provide additional functionalities to the remote participants to render the online meeting more productive and inclusive than with an overview video stream of the conference room, as described in more detail below with reference to
In certain implementations, the calendar items 116 can individually include a data structure with multiple data fields (not shown) containing data representing a meeting start/end date/times, names and/or email addresses of participants (e.g., remote and/or local participants 101a and 101b), a meeting location (e.g., conference room 105), a Universal Resource Locator (URL) of a link to an online version of the meeting, and/or other suitable information. In other implementations, the calendar items 116 can also include a description, an attachment, or other suitable information.
The contact list 110 can include data that represent an address book that contains users of a company, school, social network, or other suitable types of organization. This address book can be accessed over the computer network 104 using Lightweight Directory Access Protocol (“LDAP”), vCard Extensions to WebDAV (“CardDAV”), or another suitable protocol. Example components of the contact list 110 can include electronic and/or physical addresses, organizational position, telephone numbers, profile or identification photos, video samples, and/or other suitable information. The contact list 110 can be managed by an administrator (not shown) or at least partially managed by the individual remote/local participants 101a and 101b.
The client devices 102 can each include a computing device that facilitates corresponding remote participants 101a and local participant 101b accessing computing services such as online meeting services provided by the platform server 106 via the computer network 104. For example, in the illustrated embodiment, the client devices 102 individually include a desktop computer. In other embodiments, the client devices 102 can also include laptop computers, tablet computers, smartphones, or other suitable computing devices. Even though a certain number of remote participants 101a and local participants 101b in a conference room 105 are shown in
As shown in
The platform server 106 can be configured to facilitate online interactions of the remote/local participants 101a and 101b via real-time video streaming and other related functionalities. For example, the client devices 102 can initially establish a streaming connection with the platform server 106 via Real Time Streaming Protocol or other suitable protocols. Upon establishing the streaming connections, the cameras 103 can capture a video of the remote/local participants 101a and 101b, and the corresponding client devices 102 can continuously transmit the captured video streams 120 and 121 to the platform server 106. In turn, the platform server 106 can relay the received video streams 120 or 121 to the other remote/local participants 101a and 101b, as shown in
However, simply relaying the received video stream 121 from the client device 102 in the conference room 105 to the remote participants 101a can cause the user experience of the remote participants 101a to be very different than that of the local participants 101b. For instance, the video stream 120 from the conference room 105 may include a broad view of the conference room 105 with the local participants 101b. As such, the video stream 120 from the conference room 105 may not adequately identify the local participants 105 in the conference room 105 or who is currently speaking. Also, the broad view of the conference room 105 may lead the remote participants 101a to feel as observers instead of participants of the online meeting. The different user experiences may cause the online meeting to be less productive and inclusive than desired.
To address certain aspects of the foregoing drawbacks, the platform server 106 can be deploy a video processor 118 that is configured to generate a set of individual video streams 122 by editing the received video stream 121. As shown in
Upon receiving the video stream 121 from the client device 102 in the conference room 105, in certain embodiments, the video processor 118 can be configured to identify each local participant 101b using identification photos of the local participants 101b via facial recognition. For example, using the identification photos, the video processor 118 can be configured to identify facial features of the local participants 101b by extracting landmarks or features, such as a relative position, size, and/or shape of the eyes, nose, cheekbones, and jaw, from the identification photos of the local participants. The video processor 118 can then be configured to identify each local participants 118 in the video stream 121 using the extracted landmarks or features in one or more frames of the video stream 121. In other embodiments, the video processor can also be configured to identify each local participants via voice matching using the retrieved audio samples in lieu of or in addition to the identification photos. In further embodiments, each local participants may be identified via user tagging or via other suitable techniques.
Upon identifying each local participant 101b in the video stream 121, the video processor 118 can be configured to generate multiple individual video streams 122 corresponding to each local participant 101b in the conference room 105 from the video stream 121. In certain implementations, the video stream 121 captured by the camera 103 in the conference room 105 can be cropped or trimmed to eliminate other local participants 101b to generate an individual video stream 122 for one of the local participants 101b. In one embodiment, each frame of the cropped individual video stream 122 can have a preset fixed size (e.g., 1280 by 720 pixels) of an image and/or the identification photo of the local participant 101b in at least approximately a center of the image/video. In another embodiment, the image/video of the local participant 101 bin each frame can be digitally zoomed to be a preset size. In further embodiments, the cropped individual video stream 122 can also have a preselected or dynamically adjusted aspect ratio (e.g., 3:4) or have other suitable characteristics. Once the individual video streams 122 are generated, the video processor 118 or other components of the platform server 106 can be configured to transmit the individual video streams 122 to the remote participants 101a instead of the video stream 121 originally received from the client device 102 in the conference room 105. In other implementations, the video processor 118 can be deployed in the client device 102 in the conference room 105, and the foregoing processing of the video stream 121 can be performed at the computing device 102 in the conference room 105 before transmitting the individual video streams 122 to the platform server 105 for delivery to the remote participants 101a, as shown in
Also shown in
Several embodiments of the video processor 118 can thus provide the individual video streams 122 to the remote participants 101a as if the local participants 101b are attending the meeting virtually. With the individual video streams 122, the platform server 106 can be configured to provide additional functionalities to the remote participants 101a. For example, the platform server 106 can be configured to identify each individual video streams 122 with a corresponding name, email address, or other suitable identifier of one of the local participants 101b. The platform server 106 can also be configured to identify a local participant 101b who is currently speaking, for instance, by providing a speech indicator on the individual video stream 122. In addition, the platform server 106 can be configured to allow the remote participants to mute one or more local participants 101b in the conference room 105 by activating suitable controls on the corresponding individual video streams 122. As such, the online meeting session with the individual video streams 122 can be more productive and inclusive than with an overview video stream of the conference room 105.
Components within a system may take different forms within the system. As one example, a system comprising a first component, a second component and a third component can, without limitation, encompass a system that has the first component being a property in source code, the second component being a binary compiled library, and the third component being a thread created at runtime. The computer program, procedure, or process may be compiled into object, intermediate, or machine code and presented for execution by one or more processors of a personal computer, a network server, a laptop computer, a smartphone, and/or other suitable computing devices.
Equally, components may include hardware circuitry. A person of ordinary skill in the art would recognize that hardware may be considered fossilized software, and software may be considered liquefied hardware. As just one example, software instructions in a component may be burned to a Programmable Logic Array circuit or may be designed as a hardware circuit with appropriate integrated circuits. Equally, hardware may be emulated by software. Various implementations of source, intermediate, and/or object code and associated data may be stored in a computer memory that includes read-only memory, random-access memory, magnetic disk storage media, optical storage media, flash memory devices, and/or other suitable computer readable storage media excluding propagated signals.
As shown in
The identification component 134 can be configured to identify a local participant 101b (
In certain embodiments, the video editor 136 can be configured to crop or trim the video stream 121 to eliminate other local participants 101b to generate an individual video stream 122 (
Once the individual video stream 122 is generated, the video editor 136 can temporarily store the generated individual video stream 122 in, for instance, a buffer (not shown) and associate the individual video stream 122 with the identity of the local participant 101b. In one embodiment, the association is via modification of metadata of the generated video stream 122. In other embodiments, the association can be via other suitable data and/or metadata modification. Upon generation of the individual video stream 122, the video processor 118 or other components of the platform server 106 (
As shown in
Depending on the desired configuration, the processor 304 can be of any type including but not limited to a microprocessor (μP), a microcontroller (μC), a digital signal processor (DSP), or any combination thereof. The processor 304 can include one more level of caching, such as a level-one cache 310 and a level-two cache 312, a processor core 314, and registers 316. An example processor core 314 can include an arithmetic logic unit (ALU), a floating point unit (FPU), a digital signal processing core (DSP Core), or any combination thereof. An example memory controller 318 can also be used with processor 304, or in some implementations memory controller 318 can be an internal part of processor 304.
Depending on the desired configuration, the system memory 306 can be of any type including but not limited to volatile memory (such as RAM), non-volatile memory (such as ROM, flash memory, etc.) or any combination thereof. The system memory 306 can include an operating system 320, one or more applications 322, and program data 324. This described basic configuration 302 is illustrated in
The computing device 300 can have additional features or functionality, and additional interfaces to facilitate communications between basic configuration 302 and any other devices and interfaces. For example, a bus/interface controller 330 can be used to facilitate communications between the basic configuration 302 and one or more data storage devices 332 via a storage interface bus 334. The data storage devices 332 can be removable storage devices 336, non-removable storage devices 338, or a combination thereof. Examples of removable storage and non-removable storage devices include magnetic disk devices such as flexible disk drives and hard-disk drives (HDD), optical disk drives such as compact disk (CD) drives or digital versatile disk (DVD) drives, solid state drives (SSD), and tape drives to name a few. Example computer storage media can include volatile and nonvolatile, removable, and non-removable media implemented in any method or technology for storage of information, such as computer readable instructions, data structures, program modules, or other data. The term “computer readable storage media” or “computer readable storage device” excludes propagated signals and communication media.
The system memory 306, removable storage devices 336, and non-removable storage devices 338 are examples of computer readable storage media. Computer readable storage media include, but not limited to, RAM, ROM, EEPROM, flash memory or other memory technology, CD-ROM, digital versatile disks (DVD) or other optical storage, magnetic cassettes, magnetic tape, magnetic disk storage or other magnetic storage devices, or any other media which can be used to store the desired information, and which can be accessed by computing device 300. Any such computer readable storage media can be a part of computing device 300. The term “computer readable storage medium” excludes propagated signals and communication media.
The computing device 300 can also include an interface bus 340 for facilitating communication from various interface devices (e.g., output devices 342, peripheral interfaces 344, and communication devices 346) to the basic configuration 302 via bus/interface controller 330. Example output devices 342 include a graphics processing unit 348 and an audio processing unit 350, which can be configured to communicate to various external devices such as a display or speakers via one or more A/V ports 352. Example peripheral interfaces 344 include a serial interface controller 354 or a parallel interface controller 356, which can be configured to communicate with external devices such as input devices (e.g., keyboard, mouse, pen, voice input device, touch input device, etc.) or other peripheral devices (e.g., printer, scanner, etc.) via one or more I/O ports 358. An example communication device 346 includes a network controller 360, which can be arranged to facilitate communications with one or more other computing devices 362 over a network communication link via one or more communication ports 364.
The network communication link can be one example of a communication media. Communication media can typically be embodied by computer readable instructions, data structures, program modules, or other data in a modulated data signal, such as a carrier wave or other transport mechanism, and can include any information delivery media. A “modulated data signal” can be a signal that has one or more of its characteristics set or changed in such a manner as to encode information in the signal. By way of example, and not limitation, communication media can include wired media such as a wired network or direct-wired connection, and wireless media such as acoustic, radio frequency (RF), microwave, infrared (IR) and other wireless media. The term computer readable media as used herein can include both storage media and communication media.
The computing device 300 can be implemented as a portion of a small-form factor portable (or mobile) electronic device such as a cell phone, a personal data assistant (PDA), a personal media player device, a wireless web-watch device, a personal headset device, an application specific device, or a hybrid device that include any of the above functions. The computing device 300 can also be implemented as a personal computer including both laptop computer and non-laptop computer configurations.
From the foregoing, it will be appreciated that specific embodiments of the disclosure have been described herein for purposes of illustration, but that various modifications may be made without deviating from the disclosure. In addition, many of the elements of one embodiment may be combined with other embodiments in addition to or in lieu of the elements of the other embodiments. Accordingly, the technology is not limited except as by the appended claims.
Claims
1. A method of streaming data processing in an online meeting platform having a remote server, a first computing device located in a conference room, and a second plurality of remote computing devices remote from the conference room interconnected via a computer network, the method comprising:
- receiving, at the remote server, a video stream captured by a camera of the first computing device located in the conference room, the video stream capturing images of multiple local participants of an online meeting that also has a plurality of remote participants joining the online meeting via the computer network using the second plurality of remote computing devices;
- receiving, at the remote server, a plurality of video streams captured by cameras of the second plurality of remote computing devices, the plurality of video streams including a first remote video stream associated with a first remote participant and a second remote video stream associated with a second remote participant;
- upon receiving the video stream captured by the camera of the first computing device, at the remote server, determining identities of the captured images of the multiple local participants in the received video stream using meeting information of the online meeting; and editing the received video stream from the first computing device to generate a set of individual video streams each corresponding to one of the multiple local participants and associating the generated set of individual video streams with the determined identities of the multiple local participants in the conference room;
- transmitting, to a first remote computing device associated with the first remote participant, the generated set of individual video streams and the second remote video stream; and
- transmitting, to a second remote computing device associated with the second remote participant, the generated set of individual video streams and the first remote video stream.
2. The method of claim 1, further comprising:
- retrieving, from a network storage, data representing the meeting information of the online meeting, the meeting information containing data representing names and identification photos of the multiple local participants; and
- wherein determining the identities of the captured images of the multiple local participants includes determining the identities of the multiple local participants using the identification photos via facial recognition.
3. The method of claim 1 wherein determining the identities of the multiple participants includes:
- determining that the first remote participant and the second remote participant are joining the online meeting remotely; and
- in response to determining that the first remote participant and the second remote participant are joining the online meeting remotely, skipping determining an identity of the first remote participant and the second remote participants who are joining the online meeting remotely.
4. The method of claim 1, further comprising:
- retrieving, from a network storage, data representing meeting information of the online meeting, the meeting information containing data representing identities of all participants; and
- wherein determining the identities of the multiple participants includes: determining that the first remote participant and the second remote participant are joining the online meeting remotely; and in response to determining that the first remote participant and the second remote participant are joining the online meeting remotely, retrieving identification photos of the first remote participant and the second remote participant from a contact list and using the retrieved identification photos to determine identities of the first remote participant and the second remote participant captured in the first remote video stream and the second remote video stream.
5. The method of claim 1, further comprising:
- retrieving, from a network storage, data representing meeting information of the online meeting, the meeting information containing data representing voice samples of all participants; and
- wherein determining the identities of the multiple participants includes: determining that the first remote participant and the second remote participant are joining the online meeting remotely; and in response to determining that the first remote participant and the second remote participant are joining the online meeting remotely, using the voice samples in the retrieved data representing the meeting information to determine identities of the first remote participant and the second remote participant captured in the first remote video stream and the second remote video stream via voice matching.
6. The method of claim 1 wherein editing the received video stream includes:
- for one of the multiple local participants, cropping the received video stream to remove all other local participants from the received video stream; and temporarily saving the cropped video stream as one of the individual video streams in the generated set.
7. The method of claim 1 wherein editing the received video stream includes:
- for one of the local participants, cropping the received video stream to remove all other local participants from the received video stream; digitally enlarging the cropped video stream into a preset size; and temporarily saving the digitally enlarged video stream as one of the individual video streams in the generated set.
8. (canceled)
9. The method of claim 1,
- wherein transmitting the generated set of individual video streams to the first remote computing device includes transmitting the generated set of individual video streams with the associated identities of the multiple local participants in the conference room to the first remote computing device instead of the received video stream received from the first computing device, and
- wherein transmitting the generated set of individual video streams to the second remote computing device includes transmitting the generated set of individual video streams with the associated identities of the multiple local participants in the conference room to the second remote computing device instead of the received video stream received from the first device.
10. A server in an online meeting platform having a first computing device located in a conference room and a second plurality of remote computing devices remote from the conference room interconnected to the server via a computer network, the server comprising:
- a processor; and
- a memory operatively coupled to the processor, the memory containing instructions executable by the processor to cause the computing device to: receive a video stream captured by a camera of the first computing device located in the conference room, the video stream capturing images of multiple local participants of an online meeting that also has a plurality of remote participants joining the online meeting via the computer network using the second plurality of remote computing devices; receive a plurality of video streams captured by cameras of the second plurality of remote computing devices, the plurality of video streams including a first remote video stream associated with a first remote participant and a second remote video stream associated with a second remote participant; upon receiving the video stream captured by the camera of the first computing device located in the conference room, determine identities of the captured images of the multiple local participants in the received video stream using meeting information of the online meeting; and generate, based on the received video stream from the first computing device, a set of individual video streams each corresponding to one of the multiple local participants and associate the generated set of individual video streams with the determined identities of the multiple local participants; transmit, to a first remote computing device associated with the first remote participant, the generated set of individual video streams and the second remote video stream; and transmit, to a second remote computing device associated with the second remote participant, the generated set of individual video streams and the first remote video stream.
11. The computing device of claim 10 wherein the memory contains additional instructions executable by the processor to cause the computing device to:
- retrieve, from a network storage, data representing the meeting information of the online meeting, the meeting information containing data representing names and identification photos of the multiple local participants; and
- wherein to determine the identities of the captured images of the multiple local participants includes to determine the identities of the multiple local participants using the identification photos via facial recognition.
12. The computing device of claim 10 wherein the memory contains additional instructions executable by the processor to cause the computing device to:
- retrieve, from a network storage, data representing meeting information of the online meeting, the meeting information containing data representing voice samples of all participants; and
- wherein to determine the identities of the multiple participants includes to determine the identities of the multiple local participants using the voice samples in the meeting information via voice matching.
13. The computing device of claim 10 wherein to generate the set of individual video streams includes to:
- for one of the multiple local participants, crop the received video stream to remove all other local participants from the received video stream; and temporarily save the cropped video stream as one of the individual video streams in the generated set.
14. The computing device of claim 10 wherein to generate the set of individual video streams includes to:
- for one of the local participants, crop the received video stream to remove all other local participants from the received video stream; digitally format the cropped video stream into a preset size; and temporarily save the digitally enlarged video stream as one of the individual video streams in the generated set.
15. (canceled)
16. (canceled)
17. (canceled)
18. (canceled)
19. (canceled)
20. (canceled)
21. A method of streaming data processing in an online meeting platform having a remote server, a first computing device located in a conference room, and a second plurality of remote computing devices remote from the conference room interconnected via a computer network, the method comprising:
- receiving, at the remote server, a video stream captured by a camera of the first computing device located in the conference room, the video stream selectively capturing images of a first local participant from a plurality of local participants based on detected speech from the local participant;
- receiving, at the remote server, a plurality of video streams captured by cameras of the second plurality of remote computing devices, the plurality of video streams including a first remote video stream associated with a first remote participant and a second remote video stream associated with a second remote participant;
- transmitting, to a first remote computing device associated with the first remote participant, the video stream captured by the camera of the first computing device and the second remote video stream; and
- transmitting, to a second remote computing device associated with the second remote participant, to a second remote computing device associated with the second remote participant, the generated set of individual video streams and the first remote video stream.
22. The method of claim 21, further comprising:
- retrieving, from a network storage, data representing the meeting information of the online meeting, the meeting information containing data representing the identity of the first local participant; and
- wherein associating the captured video stream with the identity of the first local participant includes associating the captured video stream with the identity of the local participant in the retrieved data representing the meeting information of the online meeting.
23. The method of claim 21, further comprising:
- retrieving, from a network storage, data representing the meeting information of the online meeting, the meeting information containing data representing an identity and an identification photo of the first local participant; and
- wherein associating the captured video stream with the identity of the local participant includes: identifying the first local participant using the identification photo via facial recognition; and associating the captured video stream with the identity of the local participant in the retrieved data representing the meeting information of the online meeting.
24. The method of claim 21, further comprising:
- retrieving, from a network storage, data representing meeting information of the online meeting, the meeting information containing data representing a voice sample of the first local participant; and
- wherein associating the captured video stream with the identity of the local participant includes: identifying the first local participant using the voice sample via voice matching; and associating the captured video stream with the identity of the first local participant in the retrieved data representing the meeting information of the online meeting.
Type: Application
Filed: Aug 25, 2021
Publication Date: Mar 2, 2023
Inventors: Shai Ifrach (Yavne), Haim Somech (Ramat Gan)
Application Number: 17/411,905