Patents by Inventor Li Wan

Li Wan has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

  • Patent number: 11646011
    Abstract: Methods and systems for training and/or using a language selection model for use in determining a particular language of a spoken utterance captured in audio data. Features of the audio data can be processed using the trained language selection model to generate a predicted probability for each of N different languages, and a particular language selected based on the generated probabilities. Speech recognition results for the particular language can be utilized responsive to selecting the particular language of the spoken utterance. Many implementations are directed to training the language selection model utilizing tuple losses in lieu of traditional cross-entropy losses. Training the language selection model utilizing the tuple losses can result in more efficient training and/or can result in a more accurate and/or robust model—thereby mitigating erroneous language selections for spoken utterances.
    Type: Grant
    Filed: June 22, 2022
    Date of Patent: May 9, 2023
    Assignee: GOOGLE LLC
    Inventors: Li Wan, Yang Yu, Prashant Sridhar, Ignacio Lopez Moreno, Quan Wang
  • Publication number: 20230074247
    Abstract: A method for monitoring defects of a product implemented in an electronic device obtains product data in real time and determines whether a product is defective based on the product data; when the product is defective, outputting first warning information based on the number of defects of the product which satisfy a first preset condition; obtaining a rate of defects of the product every first preset time period, and outputting second warning information based on the rate of defects of the product when the rate of defects satisfies at least one of a second, third, and fourth preset conditions; when any warning information is output, analyzing distribution of the defects of the product; and predicting at least one cause of each defect of the product according to historical maintenance data of the product and a self-learning record of the electronic device.
    Type: Application
    Filed: August 29, 2022
    Publication date: March 9, 2023
    Inventors: ZI-QING XIA, LI WAN, CHENG-YONG ZHENG, LI HUANG, DONG CHEN, ZHEN-XIN DENG, XIN ZHOU, LIN-KUAN LU, MENG WANG, XIA LUO, XIAO-MEI MA
  • Patent number: 11594230
    Abstract: Methods, systems, apparatus, including computer programs encoded on computer storage medium, to facilitate language independent-speaker verification. In one aspect, a method includes actions of receiving, by a user device, audio data representing an utterance of a user. Other actions may include providing, to a neural network stored on the user device, input data derived from the audio data and a language identifier. The neural network may be trained using speech data representing speech in different languages or dialects. The method may include additional actions of generating, based on output of the neural network, a speaker representation and determining, based on the speaker representation and a second representation, that the utterance is an utterance of the user. The method may provide the user with access to the user device based on determining that the utterance is an utterance of the user.
    Type: Grant
    Filed: May 4, 2021
    Date of Patent: February 28, 2023
    Assignee: Google LLC
    Inventors: Ignacio Lopez Moreno, Li Wan, Quan Wang
  • Patent number: 11545157
    Abstract: Techniques are described for training and/or utilizing an end-to-end speaker diarization model. In various implementations, the model is a recurrent neural network (RNN) model, such as an RNN model that includes at least one memory layer, such as a long short-term memory (LSTM) layer. Audio features of audio data can be applied as input to an end-to-end speaker diarization model trained according to implementations disclosed herein, and the model utilized to process the audio features to generate, as direct output over the model, speaker diarization results. Further, the end-to-end speaker diarization model can be a sequence-to-sequence model, where the sequence can have variable length. Accordingly, the model can be utilized to generate speaker diarization results for any of various length audio segments.
    Type: Grant
    Filed: April 15, 2019
    Date of Patent: January 3, 2023
    Assignee: GOOGLE LLC
    Inventors: Quan Wang, Yash Sheth, Ignacio Lopez Moreno, Li Wan
  • Patent number: 11508862
    Abstract: The invention discloses a conductive paste for forming the electrode on the surface of solar cell, which contains conductive powder, mixed glass and organic phase; wherein, the mixed glass comprises the following two types of glass components: the first type of glass is at least one selected from the tellurium glass which does not contain lead substantially and having tellurium, bismuth, lithium as the essential component; The second type of glass is at least one kind of lead silicate glass, which having lead and silicon as essential components and does not contain tellurium substantially. The invention also provides a solar cell prepared by printing the conductive paste as a surface electrode and a manufacturing method of the solar cell.
    Type: Grant
    Filed: March 25, 2020
    Date of Patent: November 22, 2022
    Assignee: Changzhou Fusion New Material Co., Ltd.
    Inventors: Kuninori Okamoto, Haidong Liu, Yichao Ren, Changjun Xiong, Zhikai Xiong, Ansong Jiang, Li Wan, Tingting Mu
  • Publication number: 20220366934
    Abstract: A method comprises forming a single-crystal-like metal layer on a metal seed layer, the metal seed layer formed on a carrier wafer. The method comprises forming a first bonding layer on the single-crystal-like metal layer. The method also comprises forming a second bonding layer on a dielectric layer of a target substrate, the target substrate comprising one or more recording head subassemblies. The bonding layers may include diffusion layers or dielectric bonding layers. The method further comprises flipping and joining the carrier wafer with the target substrate such that the first and second diffusion layers are bonded and the single-crystal-like metal layer is integrated with the recording head as a near-field transducer.
    Type: Application
    Filed: July 29, 2022
    Publication date: November 17, 2022
    Inventors: Michael Christopher Kautzky, Tong Zhao, Li Wan, Xiaolu Kou
  • Publication number: 20220328035
    Abstract: Methods and systems for training and/or using a language selection model for use in determining a particular language of a spoken utterance captured in audio data. Features of the audio data can be processed using the trained language selection model to generate a predicted probability for each of N different languages, and a particular language selected based on the generated probabilities. Speech recognition results for the particular language can be utilized responsive to selecting the particular language of the spoken utterance. Many implementations are directed to training the language selection model utilizing tuple losses in lieu of traditional cross-entropy losses. Training the language selection model utilizing the tuple losses can result in more efficient training and/or can result in a more accurate and/or robust model—thereby mitigating erroneous language selections for spoken utterances.
    Type: Application
    Filed: June 22, 2022
    Publication date: October 13, 2022
    Inventors: Li Wan, Yang Yu, Prashant Sridhar, Ignacio Lopez Moreno, Quan Wang
  • Publication number: 20220301573
    Abstract: Processing of acoustic features of audio data to generate one or more revised versions of the acoustic features, where each of the revised versions of the acoustic features isolates one or more utterances of a single respective human speaker. Various implementations generate the acoustic features by processing audio data using portion(s) of an automatic speech recognition system. Various implementations generate the revised acoustic features by processing the acoustic features using a mask generated by processing the acoustic features and a speaker embedding for the single human speaker using a trained voice filter model. Output generated over the trained voice filter model is processed using the automatic speech recognition system to generate a predicted text representation of the utterance(s) of the single human speaker without reconstructing the audio data.
    Type: Application
    Filed: October 10, 2019
    Publication date: September 22, 2022
    Inventors: Quan Wang, Ignacio Lopez Moreno, Li Wan
  • Patent number: 11423928
    Abstract: A method includes forming a single-crystal-like metal layer on a metal seed layer, the metal seed layer formed on a sacrificial wafer. An anchor layer is formed on the single-crystal-like metal layer. The single-crystal-like metal layer is separated from the sacrificial wafer via the anchor layer. The single-crystal-like metal layer is transported via the anchor layer to a target substrate having one or more recording head subassemblies. The single-crystal-like metal layer is joined with the recording head, the single-crystal-like metal layer being integrated with the recording head as a near-field transducer.
    Type: Grant
    Filed: January 18, 2019
    Date of Patent: August 23, 2022
    Assignee: Seagate Technology LLC
    Inventors: Michael Christopher Kautzky, Tong Zhao, Li Wan, Xiaolu Kou
  • Patent number: 11410641
    Abstract: Methods and systems for training and/or using a language selection model for use in determining a particular language of a spoken utterance captured in audio data. Features of the audio data can be processed using the trained language selection model to generate a predicted probability for each of N different languages, and a particular language selected based on the generated probabilities. Speech recognition results for the particular language can be utilized responsive to selecting the particular language of the spoken utterance. Many implementations are directed to training the language selection model utilizing tuple losses in lieu of traditional cross-entropy losses. Training the language selection model utilizing the tuple losses can result in more efficient training and/or can result in a more accurate and/or robust model—thereby mitigating erroneous language selections for spoken utterances.
    Type: Grant
    Filed: November 27, 2019
    Date of Patent: August 9, 2022
    Assignee: GOOGLE LLC
    Inventors: Li Wan, Yang Yu, Prashant Sridhar, Ignacio Lopez Moreno, Quan Wang
  • Publication number: 20220157298
    Abstract: Techniques disclosed herein enable training and/or utilizing speaker dependent (SD) speech models which are personalizable to any user of a client device. Various implementations include personalizing a SD speech model for a target user by processing, using the SD speech model, a speaker embedding corresponding to the target user along with an instance of audio data. The SD speech model can be personalized for an additional target user by processing, using the SD speech model, an additional speaker embedding, corresponding to the additional target user, along with another instance of audio data. Additional or alternative implementations include training the SD speech model based on a speaker independent speech model using teacher student learning.
    Type: Application
    Filed: January 28, 2022
    Publication date: May 19, 2022
    Inventors: Ignacio Lopez Moreno, Quan Wang, Jason Pelecanos, Li Wan, Alexander Gruenstein, Hakan Erdogan
  • Patent number: 11308948
    Abstract: The present disclosure provides an intelligent interaction processing method and apparatus, a device and a computer storage medium. The method comprises: performing intention recognition for a preceding feedback item already returned to the user; continuing to return a subsequent feedback item to the user based on the intention of the preceding feedback item. According to the present disclosure, it is possible to guess the user's subsequent intention based on the preceding feedback item, and continue to return the desired subsequent feedback item to the user without the user's operations, so that the present disclosure is more intelligentized and richer and simplifies the user's operations.
    Type: Grant
    Filed: October 29, 2018
    Date of Patent: April 19, 2022
    Assignees: BAIDU ONLINE NETWORK TECHNOLOGY (BEIJING) CO., LTD., SHANGHAI XIAODU TECHNOLOGY CO. LTD.
    Inventors: Mengmeng Zhang, Gang Zhang, Li Wan, Jia Liu, Xiangtao Jiang, Ran Xu
  • Publication number: 20220089796
    Abstract: A hydrate kinetic inhibitor, which is prepared by a polymerization of mercaptoethanol and N-vinylcaprolactam, is hydroxyl terminated poly(N-vinylcaprolactam) having a structure of formula (I) below, wherein n=10 to 1000. The inhibitor is a novel hydrate kinetic inhibitor, which has low effective concentration and high cloud point, and is effective when the degree of supercooling is relatively high.
    Type: Application
    Filed: December 27, 2018
    Publication date: March 24, 2022
    Applicant: GUANGZHOU INSTITUTE OF ENERGY CONVERSION, CHINESE ACADEMY OF SCIENCES
    Inventors: Deqing LIANG, Li WAN
  • Publication number: 20220051694
    Abstract: A method of forming a thin film structure involves performing one or more repetitions to form a template on a wafer. The repetitions include: depositing a layer of a template material to a first thickness T1; and ion beam milling the layer of the template material to remove thickness T2, where T2<T1, resulting in a layer of the template material with thickness T1?T2. The ion beam milling is performed at a channeling angle relative to a deposition plane of the wafer, the channeling angle defined relative to a channeling direction of a crystalline microstructure of the template material. After the repetitions, additional material is deposited on the template to form a final structure. The additional material has a same crystalline microstructure as the template material.
    Type: Application
    Filed: October 27, 2021
    Publication date: February 17, 2022
    Inventors: Tong Zhao, Li Wan, Michael Christopher Kautzky
  • Patent number: 11238847
    Abstract: Techniques disclosed herein enable training and/or utilizing speaker dependent (SD) speech models which are personalizable to any user of a client device. Various implementations include personalizing a SD speech model for a target user by processing, using the SD speech model, a speaker embedding corresponding to the target user along with an instance of audio data. The SD speech model can be personalized for an additional target user by processing, using the SD speech model, an additional speaker embedding, corresponding to the additional target user, along with another instance of audio data. Additional or alternative implementations include training the SD speech model based on a speaker independent speech model using teacher student learning.
    Type: Grant
    Filed: December 4, 2019
    Date of Patent: February 1, 2022
    Assignee: Google LLC
    Inventors: Ignacio Lopez Moreno, Quan Wang, Jason Pelecanos, Li Wan, Alexander Gruenstein, Hakan Erdogan
  • Publication number: 20210363477
    Abstract: A drug screening device is provided. A method of determining optimal drug concentrations and efficacy in a patient using the device are provided. A method of determining effective chemotherapeutic drugs and effective concentrations thereof using the device is provided. Also, a method of determining safety and efficacy of drugs using the device is provided.
    Type: Application
    Filed: May 20, 2021
    Publication date: November 25, 2021
    Inventors: Philip LeDuc, Li Wan, Carola Neumann, John Skoko, Jun Yin, Mei Zhang
  • Patent number: 11183215
    Abstract: A thin film structure (e.g., a near-field transducer), includes a first surface parallel to a substrate on which the thin film structure is deposited and two other surfaces orthogonal to the first surface. The first surface and the two other surfaces have respective first, second, and third selected plane orientations with respective first, second, and third atomic packing factors. The first, second, and third selected plane orientations are selected to maximize an average of the first, second, and third atomic packing factors.
    Type: Grant
    Filed: July 29, 2019
    Date of Patent: November 23, 2021
    Assignee: Seagate Technology LLC
    Inventors: Tong Zhao, Li Wan, Michael Christopher Kautzky
  • Publication number: 20210357333
    Abstract: A clustered storage system includes a plurality of storage devices, each of which contributes a portion of its memory to form a global cache of the clustered storage system that is accessible by the plurality of storage devices. Cache metadata for accessing the global cache may be organized in a multi-layered structure. In one embodiment, multi-layered structure has a first layer first including a first address array, and the first address array include addresses pointing to a plurality of second address arrays in a second layer. Each second address array in the second layer includes addresses, each of which points to data that has been cached in the global cache.
    Type: Application
    Filed: July 30, 2021
    Publication date: November 18, 2021
    Inventors: Li Wan, Lili Chen, Hongliang Tang, Ning Wu
  • Publication number: 20210312907
    Abstract: Techniques disclosed herein enable training and/or utilizing speaker dependent (SD) speech models which are personalizable to any user of a client device. Various implementations include personalizing a SD speech model for a target user by processing, using the SD speech model, a speaker embedding corresponding to the target user along with an instance of audio data. The SD speech model can be personalized for an additional target user by processing, using the SD speech model, an additional speaker embedding, corresponding to the additional target user, along with another instance of audio data. Additional or alternative implementations include training the SD speech model based on a speaker independent speech model using teacher student learning.
    Type: Application
    Filed: December 4, 2019
    Publication date: October 7, 2021
    Inventors: Ignacio Lopez Moreno, Quan Wang, Jason Pelecanos, Li Wan, Alexander Gruenstein, Hakan Erdogan
  • Publication number: 20210311809
    Abstract: An interconnected computer system includes a Peripheral Component Interconnect Express (PCIe) fabric, a first computer system communicatively coupled to the PCIe fabric, a second computer system communicatively coupled to the PCIe fabric, and a shared single-access hardware resource coupled to the PCIe fabric. The first computer system includes a first processor and first memory coupled to the first processor configured to store a first flag indicating a desire of the first computer system to access the shared single-access hardware resource and a turn variable indicating which of the first computer system and the second computer system has access to the shared single-access hardware resource. The second computer system includes a second processor and second memory coupled to the second processor configured to store a second flag indicating a desire of the second computer system to access the shared single-access hardware resource.
    Type: Application
    Filed: June 22, 2021
    Publication date: October 7, 2021
    Inventors: Hongliang Tang, Li Wan, Lili Chen, Zhihao Tang