Patents Assigned to John Snow Labs Inc.
-
Patent number: 11940986Abstract: Techniques are described for performing automated operations related to identifying and using repair and maintenance status information, such as summarizing and encoding such information for one or more repair areas or other target domains, identifying specific repair or maintenance status information in response to natural language queries, and using the identified repair status information in further automated manners in some situations (e.g., to automatically initiate repair or maintenance actions on a particular target computing device). Identifying of specific repair status information in response to a particular natural language query may include initially identifying one or more candidate data groupings that match an encoded version of the natural language query (e.g., extracting encoded data groupings that match a generated version of the query), and optionally further analyzing one or more matching candidate data groupings as part of determining the actual response.Type: GrantFiled: August 23, 2022Date of Patent: March 26, 2024Assignee: John Snow Labs, Inc.Inventors: Veysel Kocaman, Julio Bonis Sanz, David Talby
-
Patent number: 11836969Abstract: A text extraction computing method that comprises calculating an estimated character pixel height of text from a digital image. The method may scale the digital image using the estimated character pixel height and a preferred character pixel height. The method may binarizes the digital image. The method may remove distortions using a neural network trained by a cycle GAN on a set of source text images and a set of clean text images. The set of source text images and clean text images are unpaired. The source text images may be distorted images of text. Calculating the estimated character pixel height may include summarizing the rows of pixels into a horizontal projection, and determining a line-repetition period from the projection, and quantifying the portion of the line-repetition period that corresponds to the text as the estimated character pixel height. The method may extract characters from the digital image using OCR.Type: GrantFiled: September 24, 2021Date of Patent: December 5, 2023Assignee: John Snow Labs Inc.Inventors: Jose Alberto Pablo Andreotti, David Talby
-
Patent number: 11783130Abstract: A computer process for entity resolution of natural language records including training a semantic embedding function on a corpus of unlabeled training materials. The semantic embedding function can take a word and represent it as a vector, where the vector represents the word as it relates to the semantic information of the corpus of unlabeled training materials. The process may transform a list of normalized descriptions using the semantic embedding function into a list of vector representations of the descriptions. The process may transform words from a natural language record to a vector representation of the natural language record using the semantic embedding function, and may use a named entity recognizer. The process may find a best match description from the list of normalized descriptions using the list of vector representations of the descriptions and the vector representation of the natural language record, and may include using word mover distance.Type: GrantFiled: May 6, 2019Date of Patent: October 10, 2023Assignee: John Snow Labs Inc.Inventors: Jose Pablo Andreotti, Saif Addin Ellafi, David Talby
-
Publication number: 20220012522Abstract: A text extraction computing method that comprises calculating an estimated character pixel height of text from a digital image. The method may scale the digital image using the estimated character pixel height and a preferred character pixel height. The method may binarizes the digital image. The method may remove distortions using a neural network trained by a cycle GAN on a set of source text images and a set of clean text images. The set of source text images and clean text images are unpaired. The source text images may be distorted images of text. Calculating the estimated character pixel height may include summarizing the rows of pixels into a horizontal projection, and determining a line-repetition period from the projection, and quantifying the portion of the line-repetition period that corresponds to the text as the estimated character pixel height. The method may extract characters from the digital image using OCR.Type: ApplicationFiled: September 24, 2021Publication date: January 13, 2022Applicant: John Snow Labs Inc.Inventors: Jose Alberto Pablo Andreotti, David Talby
-
Patent number: 11176410Abstract: A text extraction computing method that comprises calculating an estimated character pixel height of text from a digital image. The method may scale the digital image using the estimated character pixel height and a preferred character pixel height. The method may binarizes the digital image. The method may remove distortions using a neural network trained by a cycle GAN on a set of source text images and a set of clean text images. The set of source text images and clean text images are unpaired. The source text images may be distorted images of text. Calculating the estimated character pixel height may include summarizing the rows of pixels into a horizontal projection, and determining a line-repetition period from the projection, and quantifying the portion of the line-repetition period that corresponds to the text as the estimated character pixel height. The method may extract characters from the digital image using OCR.Type: GrantFiled: October 27, 2019Date of Patent: November 16, 2021Assignee: John Snow Labs Inc.Inventors: Jose Alberto Pablo Andreotti, David Talby
-
Publication number: 20210124979Abstract: A text extraction computing method that comprises calculating an estimated character pixel height of text from a digital image. The method may scale the digital image using the estimated character pixel height and a preferred character pixel height. The method may binarizes the digital image. The method may remove distortions using a neural network trained by a cycle GAN on a set of source text images and a set of clean text images. The set of source text images and clean text images are unpaired. The source text images may be distorted images of text. Calculating the estimated character pixel height may include summarizing the rows of pixels into a horizontal projection, and determining a line-repetition period from the projection, and quantifying the portion of the line-repetition period that corresponds to the text as the estimated character pixel height. The method may extract characters from the digital image using OCR.Type: ApplicationFiled: October 27, 2019Publication date: April 29, 2021Applicant: John Snow Labs Inc.Inventors: Jose Alberto Pablo Andreotti, David Talby
-
Publication number: 20200356627Abstract: A computer process for entity resolution of natural language records including training a semantic embedding function on a corpus of unlabeled training materials. The semantic embedding function can take a word and represent it as a vector, where the vector represents the word as it relates to the semantic information of the corpus of unlabeled training materials. The process may transform a list of normalized descriptions using the semantic embedding function into a list of vector representations of the descriptions. The process may transform words from a natural language record to a vector representation of the natural language record using the semantic embedding function, and may use a named entity recognizer. The process may find a best match description from the list of normalized descriptions using the list of vector representations of the descriptions and the vector representation of the natural language record, and may include using word mover distance.Type: ApplicationFiled: May 6, 2019Publication date: November 12, 2020Applicant: John Snow Labs Inc.Inventors: Jose Alberto Pablo, Saif Addin, David Talby