Patents by Inventor Marc A. Gollub

Marc A. Gollub has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

  • Publication number: 20110029807
    Abstract: A method and circuit for implementing enhanced memory reliability using memory scrub operations to determine a frequency of intermittent correctable errors, and a design structure on which the subject circuit resides are provided. A memory scrub for intermittent performs at least two reads before moving to a next memory scrub address. A number of intermittent errors is tracked where an intermittent error is identified, responsive to identifying one failing read and one passing read of the at least two reads.
    Type: Application
    Filed: July 28, 2009
    Publication date: February 3, 2011
    Applicant: INTERNATIONAL BUSINESS MACHINES CORPORATION
    Inventors: Richard E. Fry, Marc A. Gollub, Eric E. Retter, Kenneth L. Wright
  • Patent number: 7870441
    Abstract: Determining an underlying cause for errors that are detected in the data processing system is performed. An occurrence of at least one error in the data processing system is detected, thereby forming a detected error. Responsive to detecting the detected error, a determination is made as to whether there is at least one previously recorded error in an error history data structure that is the underlying cause for the detected error. The at least one previously recorded error is related to the detected error and the at least one previously recorded error is of a different type from the detected error. Responsive to identifying the at least one previously recorded error, the at least one previously recorded error is reported to a user.
    Type: Grant
    Filed: March 18, 2008
    Date of Patent: January 11, 2011
    Assignee: International Business Machines Corporation
    Inventors: Marc A. Gollub, Chuon W. Liu
  • Publication number: 20100293437
    Abstract: A system to improve memory failure management may include memory, and an error control decoder to determine failures in the memory. The system may also include an agent that may monitor failures in the memory. The system may further include a table where the error control decoder may record the failures, and where the agent can read and write to.
    Type: Application
    Filed: January 31, 2008
    Publication date: November 18, 2010
    Applicant: INTERNATIONAL BUSINESS MACHINES CORPORATION
    Inventors: Marc A. Gollub, Luis A. Lastras-Montano, Piyush C. Patel, Eric E. Retter, Barry M. Trager, Shmuel Winograd, Kenneth L. Wright
  • Publication number: 20090300425
    Abstract: Embodiments of the invention provide an interrupt handler configured to distinguish between critical and non-critical unrecoverable memory errors, yielding different actions for each. Doing so may allow a system to recover from certain memory errors without having to terminate a running process. In addition, when an operating system critical task experiences an unrecoverable error, such a task may be acting on behalf of a non-critical process (e.g., when swapping out a virtual memory page). When this occurs, an interrupt handler may respond to a memory error with the same response that would result had the process itself performed the memory operation. Further, firmware may be configured to perform diagnostics to identify potential memory errors and alert the operating system before a memory region state change occurs, such that the memory error would become critical.
    Type: Application
    Filed: June 3, 2008
    Publication date: December 3, 2009
    Inventors: Marc A. Gollub, Zane C. Shelley, Alwood P. Williams, III
  • Publication number: 20090300434
    Abstract: Embodiments of the invention provide an interrupt handler configured to distinguish between critical and non-critical unrecoverable memory errors, yielding different actions for each. Doing so may allow a system to recover from certain memory errors without having to terminate a running process. In addition, when an operating system critical task experiences an unrecoverable error, such a task may be acting on behalf of a non-critical process (e.g., when swapping out a virtual memory page). When this occurs, an interrupt handler may respond to a memory error with the same response that would result had the process itself performed the memory operation. Further, firmware may be configured to perform diagnostics to identify potential memory errors and alert the operating system before a memory region state change occurs, such that the memory error would become critical.
    Type: Application
    Filed: June 3, 2008
    Publication date: December 3, 2009
    Inventors: Marc A. Gollub, Zane C. Shelley, Alwood P. Williams, III
  • Publication number: 20090300290
    Abstract: Embodiments of the invention provide an interrupt handler configured to distinguish between critical and non-critical unrecoverable memory errors, yielding different actions for each. Doing so may allow a system to recover from certain memory errors without having to terminate a running process. In addition, when an operating system critical task experiences an unrecoverable error, such a task may be acting on behalf of a non-critical process (e.g., when swapping out a virtual memory page). When this occurs, an interrupt handler may respond to a memory error with the same response that would result had the process itself performed the memory operation. Further, firmware may be configured to perform diagnostics to identify potential memory errors and alert the operating system before a memory region state change occurs, such that the memory error would become critical.
    Type: Application
    Filed: June 3, 2008
    Publication date: December 3, 2009
    Inventors: Marc A. Gollub, Zane C. Shelley, Alwood P. Williams, III
  • Publication number: 20090240990
    Abstract: Determining an underlying cause for errors that are detected in the data processing system is performed. An occurrence of at least one error in the data processing system is detected, thereby forming a detected error. Responsive to detecting the detected error, a determination is made as to whether there is at least one previously recorded error in an error history data structure that is the underlying cause for the detected error. The at least one previously recorded error is related to the detected error and the at least one previously recorded error is of a different type from the detected error. Responsive to identifying the at least one previously recorded error, the at least one previously recorded error is reported to a user.
    Type: Application
    Filed: March 18, 2008
    Publication date: September 24, 2009
    Applicant: International Business Machines Corporation
    Inventors: Marc A. Gollub, Chuon W. Liu
  • Publication number: 20080201620
    Abstract: A system, method and program product for utilizing error correction code (ECC) logic to detect multi-bit errors. In one embodiment, a first test pattern and a second test pattern are applied to a set of hardware bit positions. The first and second patterns are multiple logic level patterns and the second test pattern is the logical complement of the first test pattern. The first and second test patterns are utilized by the ECC logic to detect correctable errors having n or fewer bits. One or more bit positions of a first correctable error occurring responsive to applying the first test pattern are determined and one or more bit positions of a second correctable error occurring responsive to applying the second test pattern are determined. The determined bit positions of the first and second correctable errors are processed to identify a multiple-bit error within the set of hardware bit positions.
    Type: Application
    Filed: February 21, 2007
    Publication date: August 21, 2008
    Inventor: Marc A Gollub
  • Publication number: 20070220335
    Abstract: A computer implemented method, apparatus, and computer usable program code for performing a diagnostic in a hardware component established in a data processing system. The method includes starting a first slow mode initial program loading in the hardware component, wherein the hardware component contains a first diagnostic routine. A determination is made whether the first diagnostic routine fails during the slow mode initial program loading. Responsive to a determination that the first diagnostic routine fails, the first diagnostic routine is isolated.
    Type: Application
    Filed: February 28, 2006
    Publication date: September 20, 2007
    Inventors: Marc Gollub, Chuon Liu
  • Publication number: 20060048005
    Abstract: A method, apparatus, and computer program product are disclosed for enhancing the error reporting that is done utilizing fault isolation registers (FIRs) after executing a diagnostic test. A diagnostic test is generated that will test for a particular type of error. A particular FIR is designated to be used to report the particular type of error. The particular FIR is designated in the design of the particular diagnostic test and the design of the particular FIR. Another FIR that is not designed to be used to report the particular type of error is also selected. This other FIR is unrelated to the diagnostic test. The result of the execution of the diagnostic test is reported utilizing the particular FIR and the other, selected FIR.
    Type: Application
    Filed: August 26, 2004
    Publication date: March 2, 2006
    Applicant: International Business Machines Corporation
    Inventors: Marc Gollub, Chuon Liu