Patents Assigned to I.R.I.S.
-
Patent number: 11823497Abstract: An image processing system and an image processing method for localising recognised characters in an image. An estimation unit is configured to estimate a first location of a recognised character that has been obtained by performing character recognition of the image. A determination unit is configured to determine second locations of a plurality of connected components in the image. A comparison unit is configured to compare the first location and the second locations, to identify a connected component associated with the recognised character. An association unit is configured to associate the recognised character, the identified connected component, and the second location of the identified connected component.Type: GrantFiled: June 10, 2022Date of Patent: November 21, 2023Assignee: I.R.I.SInventors: Frédéric Collet, Jordi Hautot, Michel Dauw
-
Patent number: 11551461Abstract: A text classifying apparatus (100), an optical character recognition unit (1), a text classifying method (S220) and a program are provided for performing the classification of text. A segmentation unit (110) segments an image into a plurality of lines of text (401-412; 451-457; 501-504; 701-705) (S221). A selection unit (120) selects a line of text from the plurality of lines of text (S222-S223). An identification unit (130) identifies a sequence of classes corresponding to the selected line of text (S224). A recording unit (140) records, for the selected line of text, a global class corresponding to a class of the sequence of classes (S225-S226). A classification unit (150) classifies the image according to the global class, based on a confidence level of the global class (S227-S228).Type: GrantFiled: April 10, 2020Date of Patent: January 10, 2023Assignee: I.R.I.S.Inventors: Frédéric Collet, Vandana Roy
-
Patent number: 11386682Abstract: An image processing system and an image processing method for localising recognised characters in an image. An estimation unit is configured to estimate a first location of a recognised character that has been obtained by performing character recognition of the image. A determination unit is configured to determine second locations of a plurality of connected components in the image. A comparison unit is configured to compare the first location and the second locations, to identify a connected component associated with the recognised character. An association unit is configured to associate the recognised character, the identified connected component, and the second location of the identified connected component.Type: GrantFiled: March 14, 2019Date of Patent: July 12, 2022Assignee: I.R.I.SInventors: Frédéric Collet, Jordi Hautot, Michel Dauw
-
Patent number: 11188779Abstract: Processing a dithered image comprising a grid of pixels including defining an array of pixels corresponding to a sub-region of the image; performing edge detection along the rows and the columns of the array; counting the number of edges detected along the rows of the array to determine the number of horizontal edges in the array; counting the number of edges detected along the columns of the array to determine the number of vertical edges in the array; identifying whether the sub-region is dithered based on the number of horizontal and vertical edges in the array; and selectively processing the corresponding sub-region of the image based on whether or not the sub-region is identified to be dithered. The identification step may also be based on the lengths of segments of similar pixels in the lines of the array.Type: GrantFiled: March 4, 2019Date of Patent: November 30, 2021Assignee: I.R.I.S.Inventors: Brecht Milis, Michel Dauw, Frédéric Collet
-
Patent number: 11170265Abstract: An image processing method for recognising characters included in an image. A first character recognition unit performs recognition of a first group of characters corresponding to a first region of the image. A measuring unit calculates a confidence measure of the first group of characters. A determination unit determines whether further recognition is to be performed based on the confidence measure. A selection unit selects a second region of the image that includes the first region, if it is determined that further recognition is to be performed. A second character recognition unit performs further recognition of a second group of characters corresponding to the second region of the image.Type: GrantFiled: February 26, 2019Date of Patent: November 9, 2021Assignee: I.R.I.S.Inventors: Frédéric Collet, Jordi Hautot, Michel Dauw
-
Patent number: 10395393Abstract: A comprising: processing the image to a text image with a number of text blobs; classifying the text blobs based on a calculation as to whether they will belong to a foreground layer or to a background layer in OCR processing; and generating a quality value of the image based on the classified text blobs. By generating the quality value based on the classified text blobs, pictures in the image, which are not relevant for OCR are not taken into account for assessing the quality of the image. The amount of data to be processed is thereby decreased resulting in a method which can be executed in real-time. Furthermore, as the quality assessment criterion is based on the division of blobs into a foreground and a background layer, i.e. on prior knowledge of the OCR system, it provides a good indication for OCR accuracy.Type: GrantFiled: December 22, 2016Date of Patent: August 27, 2019Assignee: I.R.I.S.Inventors: Jianglin Ma, Michel Dauw
-
Publication number: 20180260652Abstract: A computer implemented method for optical character recognition (OCR) of a character string in a text image. The method efficiently combines two different OCR engines with the computation that needs to be done by the second OCR engine depending on the results found by the first OCR engine. This method provides, in particular, a high speed and accurate results when the first OCR engine is fast and the second OCR engine is accurate. The combination is possible because the second OCR engine identifies each segment to be processed by the second OCR engine without needing to process all segments.Type: ApplicationFiled: February 27, 2018Publication date: September 13, 2018Applicant: I.R.I.S.Inventors: Frederic Collet, Jordi Hautot, Michel Dauw
-
Publication number: 20180182086Abstract: A comprising: processing the image to a text image with a number of text blobs; classifying the text blobs based on a calculation as to whether they will belong to a foreground layer or to a background layer in OCR processing; and generating a quality value of the image based on the classified text blobs. By generating the quality value based on the classified text blobs, pictures in the image, which are not relevant for OCR are not taken into account for assessing the quality of the image. The amount of data to be processed is thereby decreased resulting in a method which can be executed in real-time. Furthermore, as the quality assessment criterion is based on the division of blobs into a foreground and a background layer, i.e. on prior knowledge of the OCR system, it provides a good indication for OCR accuracy.Type: ApplicationFiled: December 22, 2016Publication date: June 28, 2018Applicant: I.R.I.S.Inventors: Jianglin MA, Michel Dauw
-
Patent number: 9836646Abstract: The invention relates to a method for combining a first Optical Character Recognition (OCR) and a second OCR. The first OCR is run first on an image of string of characters. Its output (first identified characters, positions of the characters and likelihood parameters of the characters) is used to generate a first graph. Segmentation points related to the positions of the first identified characters are used as input by the second OCR performing a combined segmentation and classification on the image of string of characters. The output (second identified characters, positions of the characters and likelihood parameters of the characters) of the second OCR is used to update the first graph to generate a second graph that combines the output of the first OCR with the output of the second OCR. Decision models are then used to modify the weights of paths in the second graph to generate a third graph.Type: GrantFiled: October 15, 2015Date of Patent: December 5, 2017Assignee: I.R.I.S.Inventors: Frederic Collet, Jordi Hautot, Michel Dauw, Pierre De Muelenaere
-
Patent number: 9798943Abstract: The optical character recognition method applies a first OCR engine to provide an identification of characters of at least a first type of characters and zones of at least a second type of characters in the character string image. A second OCR engine is applied on the zones of the at least second type of characters to provide an identification of characters of a second type of characters. The characters identified by the first OCR engine and by the second OCR engine are in a further step combined to obtain the identification of the characters of the character string image.Type: GrantFiled: June 9, 2014Date of Patent: October 24, 2017Assignee: I.R.I.S.Inventors: Frederic Collet, Jordi Hautot, Michel Dauw, Pierre De Muelenaere, Olivier Dupont, Gunter Hensges
-
Patent number: 9692929Abstract: A method for correcting an image acquired by a hand-held scanning device. A binarized image of an acquired image is cropped by removing columns on the left end and on the right end of only first components. A work image is created from the cropped image by replacing in each row of components series of first components smaller than a predetermined distance with series of second components. In the work image, a central line is identified. The identified central line in the work image is used to identify the corresponding central line in the cropped image forming a central line image, and in the central line image, the central text line is straightened.Type: GrantFiled: August 31, 2015Date of Patent: June 27, 2017Assignee: I.R.I.S.Inventors: Pierre De Muelenaere, Michel Dauw, Patrick Verleysen, Rene Barbier
-
Patent number: 9679179Abstract: A method implemented on a mobile device for selecting information on a paper document using a hand-held scanning device. The method provides an interface to the user, receives information from the hand-held pen scanning device connected to the mobile device and determines if the received information is valid information for the type of information to be selected. The invention further relates to using a remote computer for performing image processing and data extraction when more powerful resources are needed.Type: GrantFiled: August 31, 2015Date of Patent: June 13, 2017Assignee: I.R.I.S.Inventors: Pierre De Muelenaere, Michel Dauw, Patrick Verleysen, Rene Barbier
-
Publication number: 20170064126Abstract: A method for correcting an image acquired by a hand-held scanning device. A binarized image of an acquired image is cropped by removing columns on the left end and on the right end of only first components. A work image is created from the cropped image by replacing in each row of components series of first components smaller than a predetermined distance with series of second components. In the work image, a central line is identified. The identified central line in the work image is used to identify the corresponding central line in the cropped image forming a central line image, and in the central line image, the central text line is straightened.Type: ApplicationFiled: August 31, 2015Publication date: March 2, 2017Applicant: I.R.I.SInventors: PIERRE DE MUELENAERE, MICHEL DAUW, PATRICK VERLEYSEN, RENE BARBIER
-
Publication number: 20170061182Abstract: A method implemented on a mobile device for selecting information on a paper document using a hand-held scanning device. The method provides an interface to the user, receives information from the hand-held pen scanning device connected to the mobile device and determines if the received information is valid information for the type of information to be selected. The invention further relates to using a remote computer for performing image processing and data extraction when more powerful resources are needed.Type: ApplicationFiled: August 31, 2015Publication date: March 2, 2017Applicant: I.R.I.S.Inventors: Pierre De Muelenaere, Michel Dauw, Patrick Verleysen, Rene Barbier
-
Patent number: 9311558Abstract: A method for identifying a pattern in an image. In a first step the image is normalized to a binary matrix. A binary vector is subsequently generated from the binary matrix. The binary vector is filtered with a sparse matrix to a feature vector using a matrix vector multiplication wherein the matrix vector multiplication determines the values of the feature vector by applying program steps which are the result of transforming the sparse matrix in program steps including conditions on the values of the binary vector. Lastly, from the feature vector, a density of probability for a predetermined list of models is generated to identify the pattern in the image.Type: GrantFiled: April 16, 2014Date of Patent: April 12, 2016Assignee: I.R.I.S.Inventors: Frederic Collet, Jordi Hautot, Michel Dauw, Pierre De Muelenaere, Olivier Dupont, Gunter Hensges
-
Publication number: 20150356365Abstract: The optical character recognition method applies a first OCR engine to provide an identification of characters of at least a first type of characters and zones of at least a second type of characters in the character string image. A second OCR engine is applied on the zones of the at least second type of characters to provide an identification of characters of a second type of characters. The characters identified by the first OCR engine and by the second OCR engine are in a further step combined to obtain the identification of the characters of the character string image.Type: ApplicationFiled: June 9, 2014Publication date: December 10, 2015Applicant: I.R.I.S.Inventors: Frederic Collet, Jordi Hautot, Michel Dauw, Pierre De Muelenaere, Olivier Dupont, Gunter Hensges
-
Patent number: 9183636Abstract: A line segmentation method which starts with determining a first starting point coordinate and generating a list of potential character widths dependent on a maximum character width stored in a database and on characteristics of the portion of the line of text corresponding to the maximum character width. The method determines a second portion of the line of text corresponding to the first starting point coordinate and the first width on the list of potential character widths. On the second portion, a classification method is applied providing a likelihood of error for the first width and a candidate character. The likelihood of error is compared with a first threshold determined by a trade-off between speed and accuracy, and if the likelihood of error corresponding to the first width is lower than the threshold value, the candidate character is selected as the character meaning that a segment is known.Type: GrantFiled: April 16, 2014Date of Patent: November 10, 2015Assignee: I.R.I.S.Inventors: Frederic Collet, Jordi Hautot, Michel Dauw, Pierre De Muelenaere, Olivier Dupont, Gunter Hensges
-
Publication number: 20150302268Abstract: A method for identifying a pattern in an image. In a first step the image is normalized to a binary matrix. A binary vector is subsequently generated from the binary matrix. The binary vector is filtered with a sparse matrix to a feature vector using a matrix vector multiplication wherein the matrix vector multiplication determines the values of the feature vector by applying program steps which are the result of transforming the sparse matrix in program steps including conditions on the values of the binary vector.Type: ApplicationFiled: April 16, 2014Publication date: October 22, 2015Applicant: I.R.I.S.Inventors: Frederic COLLET, Jordi HAUTOT, Michel DAUW, Pierre DE MUELENAERE, Olivier DUPONT, Gunter HENSGES
-
Publication number: 20150302598Abstract: A line segmentation method which starts with determining a first starting point coordinate and generating a list of potential character widths dependent on a maximum character width stored in a database and on characteristics of the portion of the line of text corresponding to the maximum character width. The method determines a second portion of the line of text corresponding to the first starting point coordinate and the first width on the list of potential character widths. On the second portion, a classification method is applied providing a likelihood of error for the first width and a candidate character. The likelihood of error is compared with a first threshold determined by a trade-off between speed and accuracy, and if the likelihood of error corresponding to the first width is lower than the threshold value, the candidate character is selected as the character meaning that a segment is known.Type: ApplicationFiled: April 16, 2014Publication date: October 22, 2015Applicant: I.R.I.S.Inventors: Frederic Collet, Jordi Hautot, Michel Dauw, Pierre De Muelenaere, Olivier Dupont, Gunter Hensges
-
Patent number: 9058517Abstract: A pattern recognition system and method which generates a feature vector by multiplying an image vector with a sparse matrix. The sparse matrix is generated from a Gabor function which is a sinusoidal wave multiplied by a Gaussian function. The Gabor function is a function of a set of parameters including a parameter related to the direction of the sinusoidal wave, a parameter related to a center of the Gabor function, and a parameter related to a wavelength of the sinusoidal wave. The wavelength takes at least two values, with a first wavelength value lower than or substantially equal to the distance between two adjacent centers of the Gabor function, and the first wavelength value is lower than a second wavelength value and higher than or substantially equal to half the second wavelength value.Type: GrantFiled: April 16, 2014Date of Patent: June 16, 2015Assignee: I.R.I.S.Inventors: Frederic Collet, Jordi Hautot, Michel Dauw, Pierre De Muelenaere, Olivier Dupont, Gunter Hensges