Methods And Apparatus For Sharing Encrypted Data
The present invention is directed to methods and systems in which data are stored as encrypted records on a computer usable medium, and search requests are processed based on user identities to retrieve the data without decrypting all the stored records. Individual decryption keys are associated with identities of respective owners, without being revealed to the owners and are kept internal inside a computer. Finally, all keys are overwritten from the computer usable medium upon completion of each search request to take access away from a superuser or any unauthorized access to the system.
Not Applicable
STATEMENT REGARDING FEDERALLY SPONSORED RESEARCH OR DEVELOPMENTNot Applicable
REFERENCE TO SEQUENCE LISTING, A TABLE, OR A COMPUTER PROGRAM LISTING COMPACT DISK APPENDIXNot Applicable
FIELD OF THE INVENTIONThe field of the invention relates to software for confidential retrieval of encrypted data.
BACKGROUND OF THE INVENTIONWith the explosion of Big Data, complex search queries can be slow when running against SQL database. The performance issue roots from the fact that very simple wildcard-based text search required full table scans which results in degradation of the site's overall performance.
Apache Lucene is a free open source information retrieval software library. Along with the Apache Solr which is the open source enterprise search platform were introduced to address this issue. Apache Lucene/Solr are made for any application which requires full text indexing and searching capability. They are widely recognized for its utility in implementation of Internet search engine and local, single-site searching. Solr is able to achieve fast search responses by searching on indexes rather than on texts.
Apache Solr core manages a single index. An index is a set of all data used to store information about document to be searched. Only one core is loaded at a time. A single Solr instance is capable of managing multiple indexes hence the name Multi-Core.
By default, users that have access to one of the cores may also be able to access other cores. Such lack of access control may not be desirable as it allows users to perform searches on unauthorized data that may not belong to them.
The Trust No One design philosophy requires that a owner of encrypted data should always remain in control of decrypting the data, and no third party can access the decryption without obtaining authorization from the owner.
To apply the Trust-No-One philosophy in designing a data store, a security model is needed to offer security from 3 different dimensions, namely, data-at-rest trust, superuser trust, and owner trust.
Data-at-rest trust: When data is at rest on a computer readable medium, it is subject to theft and unauthorized physical access to the computer. A conventional solution is to encrypt all data stored in the medium. One drawback of this approach is the need to decrypt all data each time a search request is processed, resulting in performance degradation that only gets worse with increasing data volume and amount of search requests.
Superuser trust: When data is encrypted in the above fashion, it is typically done by means of a superuser key, or the equivalence of a root access. The resulting trustworthiness of data-at-rest hinges on the system key being trusted fully. In other words, data-at-rest is deemed untrustworthy as soon as the system key is compromised. The first drawback is the violation of trust, as the control of decryption falls into the hands of the superuser, instead of owners of the encrypted data. One alternative solution is to encrypt data by means of owner-keys instead. This approach presents a different drawback in the difficulty to process search requests, which would require decrypting all data by means of obtaining all the respective owner-keys, which is also a violation of trust among users.
Owner trust: yet another trust dimension is to restrict access to encrypted data by ownership. A search request is allowed to be processed only after its associated identity is successfully authenticated to be trustworthy, that the request is trusted to be originated by the data owner or a delegate with equivalent assigned privilege. One drawback is the tight coupling between the identity of an owner and the search request. Using a password authentication as an example, which is a commonly used challenge-response type of technique to authenticate the identity of a user. All it takes is a valid password to process search requests to access encrypted data under the privilege of the owner. One drawback of this approach is the lack of support for sharing access to data without sharing the password, and the sharing of a password results in compromising any trust that may have been associated with an identity.
With the increasing demand for a secure long-term massive data store, it is highly desirable for an invention that can provide a data store with the Trust No One security model, by addressing all of the aforementioned drawbacks.
SUMMARY OF THE INVENTIONThe present invention is directed to methods and systems in which data are stored as encrypted records on a computer usable medium, and search requests are processed based on user identities to retrieve the data without decrypting all the stored records. Individual decryption keys are associated with identities of respective owners, without being revealed to the owners and are kept internal inside a computer. Finally, all keys are overwritten from the computer usable medium upon completion of each search request to take access away from a superuser or any unauthorized access to the system.
The Index Process 7002 uses the secret, a first key associated with the secret, an encrypted index, an index, a search expression, and a location during the processing of the search request. Upon completion of processing, the secret, the first key associated with the secret, the index, the search expression, and the location are overwritten from all computer usable medium. Only the encrypted index remains on the medium.
The Core Process 7003 uses the secret, the location, the core, and the encrypted record during the processing of the search request. Upon completion of processing, the secret and the location are overwritten from all computer usable medium. Only the core and the encrypted record remain on the medium.
The Record Process 7004 uses the encrypted record, the secret, a second key associated with the secret, and a plaintext during the processing of a search request. Upon completion of processing, the secret, the second key associated with the secret, and the plaintext are overwritten from all computer usable medium. Only the encrypted record remains on the medium.
In the preferred embodiments, all the above non-encrypted data and keys are all overwritten upon completion of a search request. It has been contemplated in other embodiments to overwrite immediately upon completion of each individual steps. Even further in other embodiments, it has also been contemplated to delegate the task of overwriting and reclaiming memory occupied to some automatic memory management processes.
Claims
1. (canceled)
2. (canceled)
3. (canceled)
4. A data search method performed by a device comprising a computer usable medium that physically embodies a secret, a core, a plaintext, an encrypted record, a search expression for the plaintext, an index, an encrypted index, a first key, and a second key, wherein the method overwrites the secret, the plaintext, the expression, the index, the first key, and the second key from the medium, after performing the steps of:
- processing the plaintext to obtain the core and the index, wherein the index is coupled with a location within the core;
- encrypting the index;
- encrypting the plaintext to obtain the encrypted record;
- registering the encrypted record into the core at the location;
- associating the first key with decrypting the encrypted record and the second key with decrypting the encrypted index;
- associating the secret with the first key, the second key, the encrypted index, and the core;
- obtaining the core associated with the secret;
- obtaining the encrypted index associated with the secret;
- decrypting the encrypted index by using the first key;
- decrypting the encrypted record by using the second key;
- processing the expression against the index to obtain the location;
- obtaining the encrypted record at the location from the core; and
- decrypting the encrypted record by using the key associated with the secret to obtain the plaintext.
5. (canceled)
6. (canceled)
7. A data search method performed by a device comprising a computer usable medium that physically embodies a secret, an identity, and an encrypted secret, wherein the method overwrites the identity and the secret from the medium after performing the steps of:
- processing a plaintext to obtain a core and an index, wherein the index is coupled with a location within the core;
- encrypting the index;
- encrypting the plaintext to obtain an encrypted record;
- registering the encrypted record into the core at the location;
- associating a key with decrypting the encrypted record and decrypting the encrypted index;
- associating the secret with the key, the encrypted index, and the core;
- encrypting the secret;
- associating a first user-key with decryption of the encrypted secret;
- associating the identity with the first user-key;
- authenticating the first user-key by using the identity;
- obtaining the encrypted secret associated with the first user-key;
- obtaining the secret by decrypting the encrypted secret with the first user-key;
- associating a second user-key with the decryption;
- associating a second identity with the second user-key;
- authenticating the second user-key by using the second identity;
- obtaining the encrypted secret associated with the second user-key; and
- obtaining the secret by decrypting the encrypted secret with the second user-key.
8. (canceled)
9. A computer system for searching of encrypted data records, the computer system comprising:
- A processor(s) set; and
- A computer readable storage medium;
- Wherein:
- The processor set is structured, located, connected and/or programmed to run computer readable program code physically embodied on the computer usable medium; and
- The computer readable program code is programmed to perform the steps of:
- providing a secret and a plaintext;
- processing the plaintext to obtain a core and an index, wherein the index is coupled with a location within the core;
- encrypting the index;
- encrypting the plaintext to obtain an encrypted record;
- registering the encrypted record into the core at the location;
- associating a key with decrypting the encrypted record and decrypting the encrypted index; and
- associating the secret with the key, the encrypted index, and the core.
10. (canceled)
Type: Application
Filed: Feb 9, 2015
Publication Date: Aug 11, 2016
Inventor: Sze Yuen Wong (Herndon, VA)
Application Number: 14/616,746