Patents Issued in July 13, 2017
  • Publication number: 20170199860
    Abstract: Systems and methods are provided for generating tables from print-ready digital source documents. A document is received and one or more text fragments are identified on a rendered page of the document. A wrapping region collection is generated, comprising one or more wrapping regions. A tabular, narrative and label score is generated for each wrapping region. A block type is assigned to each wrapping region based on the scores. A wrapping region group and a block set are generated. One or more tables are generated based on text fragments corresponding to one of the one or more blocks. The text fragments are organized into corresponding fields of the one or more tables.
    Type: Application
    Filed: January 12, 2016
    Publication date: July 13, 2017
    Inventors: Mark Stephen Kyre, Jeffrey Lucas Eldridge, Austin Alexander Spears, Samuel Allen Hudock
  • Publication number: 20170199861
    Abstract: A system and method for providing user enhancements for spreadsheets and tables. In one implementation, the system detects a semantic organization event related to one or more data cells in a content-view of the spreadsheet or table. The system determines whether semantic organization data should be stored for the one or more data cells. The system can use the semantic organization data to display a semantic-view of the spreadsheet or table. The system then allows a user to navigate between the semantic-view and the content-view of the spreadsheet or table.
    Type: Application
    Filed: January 13, 2016
    Publication date: July 13, 2017
    Inventors: Khurram ZIA, Kevyn ARNOTT
  • Publication number: 20170199862
    Abstract: Systems and methods are provided to for automatically generating an N dimensional model table in a spreadsheet program. A model table function or add-in is executed as part of a spreadsheet program. More than two selections of one or more rows or columns from two or more tables of the spreadsheet program are received from a pointing and selection device. Each selection represents a different data dimension of more than two different data dimensions of the two or more tables. A model table is generated in the spreadsheet program from the more than two different data dimensions selected from the two or more tables by automatically repeating the elements of at least one data dimension of the more than two different data dimensions.
    Type: Application
    Filed: July 9, 2015
    Publication date: July 13, 2017
    Inventor: Steve Litt
  • Publication number: 20170199863
    Abstract: Disclosed are database systems, methods, systems, and computer program products for providing tabular data in a social network feed. In some implementations, a server of a database system stores, in a database, social network feed data comprising a plurality of feed items as data objects. The server also stores, in a database, tabular data as data objects. The server shares the social network feed data and the tabular data in a social network feed displayable to present the feed items and the tabular data in a user interface, the tabular data being presentable in the user interface in a tabular format. The user interface may receive shareable user commentary regarding the tabular data. The stored tabular data may be editable by users of the database system. The social network feed may be updated to comprise notifications of edits to the tabular data.
    Type: Application
    Filed: January 23, 2017
    Publication date: July 13, 2017
    Inventors: Sonali Agrawal, Ryan Spraetz, Cheryl Porro
  • Publication number: 20170199864
    Abstract: An authoring tool for authoring wrap packages of cards. The authoring tool includes a set of tools that enable an author to create a new card of a wrap selected from a variety of card templates. In various embodiments, the card templates include textual, video, image/photo, document, gallery, chat, widgets, global components, location/GPS, transact, appointment, and end-of-wrap card templates. When a particular card template is selected, the template is then duplicated to create the new card. Thereafter, the author may author the card to include various components, content, attributes, layouts, styles triggers and/or behaviors. As the cards of the wrap are created, the authoring tool provides the author with the ability to order the sequence of the cards. The authoring tool thus enables authors to selectively create wrap packages that include media that conveys a narrative story and application functionality.
    Type: Application
    Filed: September 6, 2016
    Publication date: July 13, 2017
    Inventors: Eric H. GREENBERG, John M. GARRIS, Ian MCFARLAND, Jared L. FICKLIN, Mark E. ROLSTON, Matthew J. SANTONE, Jon STEVENS, Eric J. WICKS
  • Publication number: 20170199865
    Abstract: A computerized method and system for correlating template data to select contextually relevant content including receiving a request from a consumer device to serve a template at a server, the template comprising at least one tag, selecting a content in a database associated with the at least one tag based at least in part on a correlation between the template and the at least one tag, substituting the at least one tag with the content, and transmitting the template from the server to the consumer device.
    Type: Application
    Filed: March 29, 2017
    Publication date: July 13, 2017
    Inventors: Michael Ciancio-Bunch, Tom Waltz, Greg Curiel, Gerald Gibson, Dale McCrory, Kevin Parkerson, Matt Beard
  • Publication number: 20170199866
    Abstract: Identifying actionable statements in communications may include: extracting features from at least one training statement; training a pattern recognition module to identify one or more types of patterns in actionable statements based at least in part on the features; and generating an actionable statement identification model using the trained action verb module and the trained pattern recognition module. Identifying actionable statements in communications is preferably adaptive in a continuous manner (e.g. based on user feedback), and may also include: determining whether a statement includes an actionable statement; predicting an actionable statement class of the actionable statement based on a pattern represented in the statement; and outputting the predicted actionable statement class to a user. Corresponding systems and computer program products are also disclosed.
    Type: Application
    Filed: January 13, 2016
    Publication date: July 13, 2017
    Inventors: Dalkandura Arachchige Kalpa Shashika Silva Gunaratna, Hamid Reza Motahari Nezhad
  • Publication number: 20170199867
    Abstract: A configuration includes: a morphological analyzer configured to analyze a text provided as an input in a form of natural language by a user; an intention-estimation processor configured to refer to an intention estimation model in which words and corresponding user's intentions to be estimated from the words are stored, to thereby estimate an intention of the user based on the text analysis results obtained by the morphological analyzer; an unknown-word extractor configured to extract, as an unknown word, a word that is not stored in the intention estimation model from among the text analysis results when the intention of the user fails to be uniquely determined by the intention estimation processor; and a response text message generator configured to generate a response text message that includes the unknown word extracted by the unknown-word extractor.
    Type: Application
    Filed: October 30, 2014
    Publication date: July 13, 2017
    Applicant: Mitsubishi Electric Corporation
    Inventors: Yusuke KOJI, Yoichi FUJII, Jun ISHII
  • Publication number: 20170199868
    Abstract: A mechanism is provided for representing information, such as binary sequence, in a manner that is easier to read and less likely to generate errors when interacted with by human. A dictionary is seeded with two or more set of words, the words being selected from distinct categories. Symbols may be created by combining words from the distinct categories. A mapping of symbols to corresponding values may then be generated. The generated mapping may be used to translate bit values to symbols and symbols to bit values.
    Type: Application
    Filed: March 27, 2017
    Publication date: July 13, 2017
    Inventors: Jon Arron McClintock, Darren Ernest Canavor, Jesper Mikael Johansson
  • Publication number: 20170199869
    Abstract: Various embodiments described herein facilitate multi-lingual communications. The systems and methods of some embodiments may enable multi-lingual communications through different modes of communications including, for example, Internet-based chat, e-mail, text-based mobile phone communications, postings to online forums, postings to online social media services, and the like. Certain embodiments may implement communications systems and methods that translate text between two or more languages (e.g., spoken), while handling/accommodating for one or more of the following in the text: specialized/domain-related jargon, abbreviations, acronyms, proper nouns, common nouns, diminutives, colloquial words or phrases, and profane words or phrases.
    Type: Application
    Filed: February 15, 2017
    Publication date: July 13, 2017
    Inventors: Gabriel Leydon, Francois Orsini, Nikhil Bojja, Shailen Karur
  • Publication number: 20170199870
    Abstract: Disclosed herein is a method for an automatic translation of input characters in the field of information input, which solves the low-efficiency problem in existing technologies for translating input characters. This method comprises: obtaining a translation command to translate characters entered in a first language; based on the language setting of the input interface for receiving first language input characters, determining a second language; and translating the characters entered in the first language into corresponding characters in the second language. By automatically determining the target language based on the input interface for receiving first language characters, the present invention allows for a rapid translation of input characters, a reduction in user operations, enhanced translation efficiency and improved user experiences.
    Type: Application
    Filed: May 17, 2016
    Publication date: July 13, 2017
    Inventors: Sheng ZHENG, Meng WANG
  • Publication number: 20170199871
    Abstract: A terminology marker system integrates a terminology analytical component for quantifying the amount of linguistic noise found in the translation output as measured against a dictionary; further, correlating the noise measured on a continuous basis enables the analytical component to build terminology predictive models used in a feedback loop to upstream components of the supply chain to improve future translation of new content. The system also provides a smart terminology assessment component for assessing linguistic assets and improving the quality of those assets to assist in translation. The system also provides a smart terminology evaluation component that is able to analyze MT output to make smart decisions on reducing the amount of post editing corrections needed for delivering a persistent level of translation quality.
    Type: Application
    Filed: October 13, 2016
    Publication date: July 13, 2017
    Inventors: Christophe D. Chenon, Marc P. Drapeau, Francis X. Rojas
  • Publication number: 20170199872
    Abstract: A computer system automatically organizes, retrieves, annotates and/or presents media data files as collections of media data files associated with one or more entities, such as individuals, groups of individuals or other objects, using context captured in real time from a viewing environment. The computer system presents media data from selected media data files on presentation devices in the viewing environment and receives and processes signals from sensors in that viewing environment. The processed signals provide context, which can be used to select and retrieve media data files, and can be used to further annotate the media data files and/or other data structures representing collections of media data files and/or entities. In some implementations, the computer system can be configured to be continually processing signals from sensors in the viewing environment to continuously identify and use the context from the viewing environment.
    Type: Application
    Filed: January 11, 2016
    Publication date: July 13, 2017
    Inventor: Georgios Krasadakis
  • Publication number: 20170199873
    Abstract: An event-related media management system contextualizes media content. The event-related media management system associates media content with contextual event-related data to associate the media content with the events and information about the events. The contextual event-related data can then be used to provide access to the media content, such as through relevant search results or by presenting the media content in organized displays for contextual browsing and navigation. In some embodiments the event-related media management system generates contextualized media content for contextual search, discovery, and advertising.
    Type: Application
    Filed: March 24, 2017
    Publication date: July 13, 2017
    Inventors: Terrence J. Barthel, Jeffrey P. Ess
  • Publication number: 20170199874
    Abstract: Systems and methods for generating and outputting personalized content recommendations are provided. When it is determined that two or more users are in close proximity to each other, the two or more users' respective media content libraries may be compared. Commonalities or related aspects of the two or more users' respective media content libraries can be determined and leveraged to generated personalized content recommendations.
    Type: Application
    Filed: January 13, 2016
    Publication date: July 13, 2017
    Applicant: Disney Enterprises, Inc.
    Inventors: MEHUL PATEL, MARK ARANA, TONY R. SMITH, ALEXANDER C. CHEN
  • Publication number: 20170199875
    Abstract: In some examples, a computing device determines a data manipulation from a job specification. The device determines a corresponding data-processing instruction using data-source metadata, and determines and executes a corresponding query. In some examples, a device receives search keys. The device searches data-source metadata using the search keys. The device weights a first data source based on producer-consumer relationships between data sources, and ranks the first data source using the weight. In some examples, a device determines structural and content information of a data record. The device determines a data-source identifier from the structural information and stores the content information with the data-source identifier in a database. In some examples, via a user interface, a device receives a job specification and annotation data. The device stores the spec and the annotation data in a metadata repository.
    Type: Application
    Filed: January 8, 2016
    Publication date: July 13, 2017
    Inventors: Narayan Nevrekar, Francisc Stugren, Atul Bhartia, Kevin Joseph Fischer, Michael Guthrie, Sohail Mohammed, Scott Rosemund, Kazi Al-Rashid
  • Publication number: 20170199876
    Abstract: A query processing service processes a query pertaining to a question about a client device. The query processing service identifies a subset of answers, from a plurality of answers, which are determined to be most relevant to the query, based upon relevance data for the plurality of answers. The query processing service provides to a client device answer data that includes the subset of answers. The query processing service receives, from the client device, user selection data that specifies a user selection of a particular answer to the question about the client device. In response to receiving the user selection data, the query processing service updates user selection data maintained by the query processing service to include the user selection of the particular answer to the question about the client device. The query processing device may also revise relevance data based upon the update made to the user selection data.
    Type: Application
    Filed: January 13, 2016
    Publication date: July 13, 2017
    Applicant: RICOH COMPANY, LTD.
    Inventors: Rathnakara Malatesha, Lana Wong, Hiroshi Kitada
  • Publication number: 20170199877
    Abstract: Apparatuses, systems, methods, and program products are disclosed for improved storage performance based on data placement. An identifier module generates a plurality of unique identifiers. Each of the unique identifiers is associated with a storage node of a storage cluster. A rank module ranks each of the plurality of unique identifiers in response to a received data item. The ranking of a unique identifier is based on one or more characteristics associated with the received data item. A storage module selects a unique identifier of the plurality of unique identifiers with a best ranking and stores the data item on a storage node associated with the selected unique identifier.
    Type: Application
    Filed: January 13, 2016
    Publication date: July 13, 2017
    Inventors: DAVID W. COSBY, JIAN LI, SHYAM SAREEN, WILLIAM G. HOLLAND, MICHAEL N. CONDICT
  • Publication number: 20170199878
    Abstract: A system for generating a document includes a user interface processing subsystem configured to generate an interface configured to receive one or more commands. The commands are provided in at least one of an audio format and a textual format. The user interface processing subsystem is configured to provide at least one response to the commands in at least one of: a graphical, audio, and textual format. A natural language processing subsystem of the system is configured to receive and convert the one or more commands into annotated structured text. A search engine of the system is configured to receive the annotated structured text and search a document database for one or more documents related to the annotated structured text.
    Type: Application
    Filed: January 6, 2017
    Publication date: July 13, 2017
    Inventors: Manoharan Ramasamy, Nitin Madhukar Sawant, Jayant Swamy, Himanshu Shah
  • Publication number: 20170199879
    Abstract: The provided solution makes it possible to select one item from a set of items, a multicomponent score being associated with each item of the set of items. After having computed a value characterising a relation ranking between a first item of the set of items and a second item of the set of items, the computed value being computed as a function of a plurality of score components associated with the first and second item of the set of items, respectively, the first item or of the second item is selected based on the computed value.
    Type: Application
    Filed: January 10, 2017
    Publication date: July 13, 2017
    Inventor: Alexandre DELATTRE
  • Publication number: 20170199880
    Abstract: The present technology relates to an information processing device, an information processing method, and a program for aiding ecosystem utilization. A vegetation distribution and related information associated with the vegetation distribution are acquired, and the vegetation distribution and the related information are displayed. The present technology can be applied in a case where information for aiding ecosystem utilization is provided, for example.
    Type: Application
    Filed: August 28, 2015
    Publication date: July 13, 2017
    Applicant: Sony Corporation
    Inventor: Masatoshi Funabashi
  • Publication number: 20170199881
    Abstract: [Object] To enable a user to simply and easily confirm update information of a plurality of pieces of content data handled by a plurality of various application programs without any troublesome operation. [Solving Means] A plurality of update information display cards (51) are aligned and displayed on a display screen (50). The update information display cards (51) displayed on the display screen (50) are associated with different pieces of content data, and update information of the content data is individually arranged on the corresponding update information display card (51). Moreover, the update information display cards (51) on which the update information of the different types of content data are arranged are aligned and displayed on the display screen (50) in a chronological order of the updates of the content data.
    Type: Application
    Filed: October 25, 2016
    Publication date: July 13, 2017
    Applicants: SONY MOBILE COMMUNICATIONS INC., SONY CORPORATION
    Inventors: George ARRIOLA, Kichiro KUROZUMI, Takahiro KAWAGUCHI, Jorge FURUYA, Brian JOHNSON, Takashi KAWAKAMI, Masatoshi INAGAWA, Takamasa KURAMITSU, Takahiro OKADA, Hironobu AOKI, Seigo IWASAKI, Yoshimitsu FUNABASHI, Takashi HASEGAWA, Erica KATO, Tetsuya TAKAHASHI, Makoto KAMIYA
  • Publication number: 20170199882
    Abstract: Curation of a corpus of a cognitive computing system is performed interactively by reporting on user interface device to a user a parse tree illustration of discrepancies and corresponding assigned confidence factors detected between at least a portion of a first document and a second or more documents in the corpus. Responsive to a user selection of an illustrated discrepancy in the parse tree, a drill-down dialog is prepared and displayed which shows at least a text string for the portion of the first document and at least one conflicting text string from the second or more documents, and which provides at least one user-selectable administrative action option for handling the detected discrepancy. Responsive to receipt of user selection of an administrative action option, the computing system performs the action to handle the detected discrepancy.
    Type: Application
    Filed: January 12, 2016
    Publication date: July 13, 2017
    Applicant: International Business Machines Corporation
    Inventors: Donna K. Byron, Ellie Feirouz, Ashok Kumar, William G. O'Keeffe
  • Publication number: 20170199883
    Abstract: Some embodiments provide a method for a device having multiple users. The method identifies a process installed on the device that requires an isolated storage in a file system of the device. For each of a set of the users of the electronic device, the method assigns at least one container for use by the process within a user-specific section of the file system. The containers assigned to the process in a section of the file system specific to a particular user are only accessible by the process when the particular user is logged into the device. The method assigns at least one container for use by the process within a non-user-specific section of the file system. The containers assigned to the process within the non-user-specific section of the file system are accessible by the process irrespective of which user is logged into the device.
    Type: Application
    Filed: September 22, 2016
    Publication date: July 13, 2017
    Inventors: Andrew S. Terry, Kelly B. Yancey, Pierre-Olivier J. Martel, Richard L. Hagy, Timothy P. Hannon, Alastair K. Fettes
  • Publication number: 20170199884
    Abstract: Systems and methods for generating customized reports from data storage databases and other information management system databases. The methods include normalizing queried information from different types of information management system databases to enable system-wide report generation. Other implementations are disclosed.
    Type: Application
    Filed: March 28, 2017
    Publication date: July 13, 2017
    Inventors: Parag Gokhale, Parminder Singh
  • Publication number: 20170199885
    Abstract: A method and device for generating a list of video files, which are applied to a mobile terminal. The method comprises: when video files are imported in a mobile terminal, invoking a system interface program of the mobile terminal to scan the imported video files; if scanning is successful, obtaining attribute information about the video files through the system interface program, and storing same in a system database of the mobile terminal; if scanning is not successful, invoking a custom interface program, obtaining the attribute information about the video files, and storing same in a custom database; when it is required to generate a list of video files, obtaining the stored attribute information about the various video files from the system database and the custom database respectively; based on the attribute information about the various video files obtained from the system database and the custom database, generating the list of video files.
    Type: Application
    Filed: June 23, 2015
    Publication date: July 13, 2017
    Applicant: Guangzhou Kingsoft Network Technology Co., Ltd.
    Inventors: Feng Yun, Shaocong Ouyang
  • Publication number: 20170199886
    Abstract: A method, apparatus, article of manufacture, and a memory structure for allocating storage of media programs among global and local storage assets for hot and cold storage is disclosed. Trend data is obtained from a plurality of sources including web page click-throughs and social media, indexed and combined with data describing available media programs to identify media programs of interest and reallocate them to hot storage as required. The system uses REST-compliant methods and commands and is therefore pluggable and can be used with a variety of existing systems.
    Type: Application
    Filed: February 11, 2016
    Publication date: July 13, 2017
    Inventors: Dean Perrine, Christopher D. Blandy, Joseph P. Coffaro
  • Publication number: 20170199887
    Abstract: A plurality of data objects is generated from a data stream received for constructing a backup image. A stream size of the data stream is determined. For each multi-threaded put operation used to write a data object of the plurality of data objects to a cloud storage, a maximum number of threads is determined into which the data object can be separated into a number of parts for simultaneous transmission to the cloud storage. A part size of each part is determined. A number of multi-threaded put operations is determined for writing the plurality of data objects to the cloud storage based, at least in part, on the stream size, the maximum number of threads into which the data object can be separated, and the part size. The plurality of data objects is stored into the cloud storage using the number of multi-threaded put operations.
    Type: Application
    Filed: January 13, 2016
    Publication date: July 13, 2017
    Inventors: John Keith Fullbright, Mark Duane Beaupre, Nilesh PradeepKumar Maheshwari
  • Publication number: 20170199888
    Abstract: Systems and methods for detecting the visibility of a content item on a resource are provided. One method includes receiving coordinates of a content item slot on a resource. The method further includes splitting the content item slot into a plurality of slot pieces. The method further includes, for each slot piece, determining a representative point of the slot piece and determining whether the slot piece is an uppermost element of the resource visible to a viewer of the resource at the representative point. The method further includes calculating an amount of slot pieces determined to be the uppermost elements of the resource at the representative points. The method further includes detecting a visibility of the content item in the content item slot based at least in part on the amount of slot pieces determined to be the uppermost elements of the resource at the representative points.
    Type: Application
    Filed: January 13, 2016
    Publication date: July 13, 2017
    Inventors: Tuna Toksoz, Thomas Price
  • Publication number: 20170199889
    Abstract: A method and device for identifying junk picture files, which are used for a server side to identify junk picture files in cached network data. The method comprises: obtaining a directory to be detected; determining whether the number of files in the directory is greater than or equal to a first preset value; if so, determining whether file names of the files in the directory contain keywords which represent the cached network data; if the keywords are contained, determining that the files in the directory are the cached network data, and then, determining whether keywords which represent useless picture files exist in the cached network data, so as to find out whether junk picture files exist; if the junk picture files exist, identifying the found picture files as cached junk picture files; and recording a directory path where the cached junk picture files are located. By means of the above solution, the cached network junk picture files can be accurately identified.
    Type: Application
    Filed: June 23, 2015
    Publication date: July 13, 2017
    Applicant: Beijing Kingsoft Internet Security Software Co. Ltd.
    Inventors: Guoqiang Jiao, Qidong Yang, Xiliang Bai, Xueshi Du, Nan Zhang, Yong Chen
  • Publication number: 20170199890
    Abstract: A computerized method and apparatus for distinguishing between false positive read events and true positive events of reading a file, comprising determining an amount of date read from the file, in case the amount of data exceeds a threshold generating a true positive read event, otherwise generating a false positive read event in case a decision condition is met, and an apparatus to carry out the same.
    Type: Application
    Filed: March 26, 2017
    Publication date: July 13, 2017
    Inventors: Yakov FAITELSON, Ohad Korkus, David Bass, Yzhar Kaysar, Doron Goldestein, Oren David
  • Publication number: 20170199891
    Abstract: Embodiments for distributed data deduplication in a grid of processors. Input data is received on a processor. The input data is partitioned into a plurality of similarity units. A corresponding deduplication metadata slice and owning processor for one of the similarity units is calculated. A representative value and corresponding digest values of the similarity unit are sent to the owning processor. The owning processor is used to search for the representative value in the deduplication metadata slice, and to send a specification and owning processors of calculated identical data sections to the processor. The processor is used to send nominal information of the calculated identical data sections to the owning processors of the data referenced by the calculated identical data sections.
    Type: Application
    Filed: January 12, 2016
    Publication date: July 13, 2017
    Applicant: INTERNATIONAL BUSINESS MACHINES CORPORATION
    Inventors: Lior Aronovich, Vincenzo Pasquantonio
  • Publication number: 20170199892
    Abstract: A method, including identifying, using a sampling ratio, a random number of logical data units. A hash is calculated for each of the identified logical data units, and a first histogram is computed indicating a duplication count of each of the calculated hashes. Based on respective frequencies of the calculated hashes, a second histogram is computed indicating observed frequencies of each of the duplication counts in the first histogram, and based on the sampling ratio and the second histogram, a target function is derived. A range of acceptable results is derived for the target function, and based on the range of the acceptable results, a set of plausible duplication frequency histograms is defined. A first given plausible duplication frequency histogram having a highest number of distinct logical data units is identified, and a second given plausible duplication frequency histogram having a lowest number of distinct logical data units is identified.
    Type: Application
    Filed: January 13, 2016
    Publication date: July 13, 2017
    Applicant: International Business Machines Corporation
    Inventors: Danny Harnik, Ety Khaitzin, Dmitry Sotnikov
  • Publication number: 20170199893
    Abstract: Embodiments for storing data deduplication metadata in a grid of processors. Each of a plurality of slices of deduplication metadata is assigned to be stored by a corresponding processor in a grid of processors. Each slice of the plurality of slices includes at least one of a slice of a similarity index and groups of digests corresponding to those of a plurality of representative values in the slice of the similarity index. A hashing method is use to map between a plurality of input representative values and the plurality of slices of deduplication metadata.
    Type: Application
    Filed: January 12, 2016
    Publication date: July 13, 2017
    Applicant: INTERNATIONAL BUSINESS MACHINES CORPORATION
    Inventors: Lior ARONOVICH, Vincenzo PASQUANTONIO
  • Publication number: 20170199894
    Abstract: Embodiments for rebalancing distributed deduplication metadata by a processor. An input similarity unit of data and a corresponding input representative value are received on an ingesting processor. A corresponding deduplication metadata slice and owning processor are calculated for the input similarity unit. The input representative value and input digest values are sent to the owning processor. The owning processor is used to search for the input representative value in a corresponding deduplication metadata slice, and to forward the input representative value and input digest values to an additional processor, if the input representative value is not found by the owning processor and a rebalancing status of the owning processor is in-process. The additional processor is used to send a reply message to the owning processor that facilitates migration of the input representative value and corresponding input digest values to the owning processor, if the input representative value is found.
    Type: Application
    Filed: January 12, 2016
    Publication date: July 13, 2017
    Applicant: INTERNATIONAL BUSINESS MACHINES CORPORATION
    Inventors: Lior ARONOVICH, Vincenzo PASQUANTONIO
  • Publication number: 20170199895
    Abstract: A method, including partitioning a dataset into a first number of data units, and selecting, based on a sampling ratio, a second number of the data units. A hash value is calculated for each of the selected data units, and a first histogram is computed indicating a first duplication count for each of the calculated hash values. Based on respective frequencies of the calculated hash values, a second histogram is computed indicating an observed frequency for each of the first duplication counts in the first histogram, and based on the sampling ratio and the second histogram, a target function is derived. A third histogram that minimizes the target function is derived, the third histogram including, for the first number of the storage units, second duplication counts and a respective predicted frequency for each of the second duplication counts. Finally, a deduplication ratio is determined based on the third histogram.
    Type: Application
    Filed: January 13, 2016
    Publication date: July 13, 2017
    Inventors: Danny Harnik, David Chambliss, Oded Margalit, Dmitry Sotnikov
  • Publication number: 20170199896
    Abstract: A system may read an input file having an input file size and including a first record and a second record. The first and second record may each have a record length. The system may parse the input file into a first split file and a second split file, with the first split file including the first record and the second split file including the second record. The system may distribute the first split file to a first node to generate a first output file and the second split file to a second node to generate a second output file. Any number of additional split files may be distributed to generate any number output files. The system may combine the output files to generate a converted data file.
    Type: Application
    Filed: January 13, 2016
    Publication date: July 13, 2017
    Applicant: American Express Travel Related Services Company,
    Inventors: Nitish Sharma, Shubham Arora, Balaji Balaraman, Sandeep Bose, Arindam Chatterjee, Sastry Durvasula, Priya Narayana, Manoj Kumar Rana
  • Publication number: 20170199897
    Abstract: The present disclosure is directed toward systems and methods for inferring one or more qualities of a place based on social networking system activity associated that place. The present disclosure is also directed toward systems and methods for providing a multi-step search tool that utilizes the inferred qualities of places within a geographic area to provide a user with a more meaningful search experience.
    Type: Application
    Filed: January 7, 2016
    Publication date: July 13, 2017
    Inventors: Adriel Samuel Frederick, Benjamin Bregman, Robert James Sicurelli, III, Jan Kalis
  • Publication number: 20170199898
    Abstract: Neural network image curation techniques are described. In one or more implementations, curation is controlled of images that represent a repository of images. A plurality of images of the repository are curated by one or more computing devices to select representative images of the repository. The curation includes calculating a score based on image and face aesthetics, jointly, for each of the plurality of images through processing by a neural network, ranking the plurality of images based on respective said scores, and selecting one or more of the plurality of images as one of the representative images of the repository based on the ranking and a determination that the one or more said images are not visually similar to images that have already been selected as one of the representative images of the repository.
    Type: Application
    Filed: March 27, 2017
    Publication date: July 13, 2017
    Applicant: Adobe Systems Incorporated
    Inventors: Xiaohui Shen, Xin Lu, Zhe Lin, Radomir Mech
  • Publication number: 20170199899
    Abstract: An image processing system comprises: an image recognizing unit which performs image recognition to an image; a rule setting unit which sets a sorting rule to be used in case of transmitting the image recognized by the image recognizing unit to an external web service; and a transmitting unit which transmits, according to the sorting rule having a condition conforming to the image among the sorting rules set by the rule setting unit, information concerning the image to the corresponding external web service, thereby enabling to cooperate a still image/moving image sharing service and various web services with others.
    Type: Application
    Filed: March 27, 2017
    Publication date: July 13, 2017
    Inventors: Tsuyoshi Kuroki, Mikio Kiura, Masakazu Matsugu
  • Publication number: 20170199900
    Abstract: A server for providing a city street search service includes a street information database configured to store city street images, a feature selection unit configured to select at least one feature according to a predetermined criterion when a city street image for searching and two or more features for the image are received from a user terminal, a candidate extraction unit configured to extract a candidate list of a city street image, a feature matching unit configured to match the city street image for registration included in the extracted candidate list and the at least one selected feature, and a search result provision unit configured to provide the user terminal with a result of the matching as result information regarding the city street image for searching.
    Type: Application
    Filed: October 24, 2016
    Publication date: July 13, 2017
    Inventors: Seung Jae LEE, Keun Dong LEE, Hyung Kwan SON, Weon Geun OH, Da Un JUNG, Young Ho SUH, Wook Ho SON, Won Young YOO, Gil Haeng LEE
  • Publication number: 20170199901
    Abstract: A schema for a dataset is identified by identifying a dataset comprising data and relationships between data pairs. An original schema is identified for the dataset. This original schema comprises an organizational structure. An initial fit between the dataset and the original schema is determined. The initial fit quantifying a conformity of the data in the dataset to the organizational structure of the original schema. A plurality of additional schemas are identified. Each additional schema is a distinct organizational schema. The dataset is partitioned into a plurality of subsets. Each subset comprises a modified fit quantifying a modified conformity of subset data in each subset to one of the original schema and the additional schemas. The modified fit is greater than the original fit.
    Type: Application
    Filed: March 27, 2017
    Publication date: July 13, 2017
    Inventors: Marcelo ARENAS, Gonzalo DIAZ, Achille FOKOUE, Anastasios KEMENTSIETSIDIS, Kavitha SRINIVAS
  • Publication number: 20170199902
    Abstract: Random cut trees are generated with respective to respective samples of a baseline set of data records of a data set for which outlier detection is to be performed. To construct a particular random cut tree, an iterative splitting technique is used, in which the attribute along which a given set of data records is split is selected based on its value range. With respect to a newly-received data record of the stream, an outlier score is determined based at least partly on a potential insertion location of a node representing the data record in a particular random cut tree, without necessarily modifying the random cut tree.
    Type: Application
    Filed: January 7, 2016
    Publication date: July 13, 2017
    Applicant: Amazon Technologies, Inc.
    Inventors: NINA MISHRA, DANIEL BLICK, SUDIPTO GUHA, OKKE JOOST SCHRIJVERS
  • Publication number: 20170199903
    Abstract: Some aspects disclosed herein are directed to, for example, a system and method of backing out data. The method may comprise determining one or more unique identifiers for data to be loaded from a source system to one or more databases. The method may comprise loading the data from the source system to the one or more databases, and the data may be loaded with the one or more unique identifiers. A computing device may determine that a subset of the data loaded to the one or more databases comprises invalid data. In response to determining that the subset of the data loaded to the one or more databases comprises invalid data, the computing device may determine one or more unique identifiers for the invalid data. The invalid data may be removed, from the one or more databases, based on the one or more unique identifiers for the invalid data.
    Type: Application
    Filed: January 12, 2016
    Publication date: July 13, 2017
    Inventors: Rodney Shannon Floyd, Ron G. Rambo, Nancy M. Cerniglia
  • Publication number: 20170199904
    Abstract: Methods, computing systems and computer program products implement embodiments of the present invention that include partitioning a dataset into a full set of logical data units, and selecting a sample subset of the full set, the sample subset including a random sample of the full set based on a sampling ratio. A set of target hash values are selected from a full range of hash values, and, using a hash function, a respective unit hash value is calculated for each of the logical data units in the sample subset. A histogram is computed that indicates a duplication count of each of the unit hash values that matches a given target hash value, and based on the histogram, a number of distinct logical data units in the full set is estimated.
    Type: Application
    Filed: January 13, 2016
    Publication date: July 13, 2017
    Applicant: International Business Machines Corporation
    Inventors: Danny Harnik, Ety KHAITZIN, Dmitry SOTNIKOV
  • Publication number: 20170199905
    Abstract: In one embodiment, an online social network accesses a place-entities graph that includes a plurality of place-entity nodes, each place-entity node representing a particular place-entity associated with a particular geographic location. From one of a plurality of search clients associated with the online social network, a threshold duplication value and a threshold redirect value is received. The online social network identifies place-entity clusters of place-entity nodes within the place-entities graph. In each place-entity cluster, the place-entity nodes have duplication values with respect to a canonical place-entity node in the cluster that are above the threshold duplication value. The online social network generates a redirection graph for the one search client.
    Type: Application
    Filed: June 24, 2016
    Publication date: July 13, 2017
    Inventors: Myle Arif Ott, Jason Luis Briceno, Yaniv Shmueli
  • Publication number: 20170199906
    Abstract: A method for content-quality scoring, the method comprising obtaining a plurality of metrics associated with a piece of content on an online community, weighing the plurality of metrics resulting in an weighted value for the plurality of metrics, applying a decay to the weighted value for the plurality of metrics resulting in a decayed value for the plurality of metrics, and calculating a content-quality score for the piece of content using at least one of the plurality of metrics, the weighted value for the plurality of metrics and the decayed value for the plurality of metrics.
    Type: Application
    Filed: January 17, 2017
    Publication date: July 13, 2017
    Inventors: Jose Lema, Wyatt Lyon Preul, Stacy Randolph
  • Publication number: 20170199907
    Abstract: Matching processing between pieces of vector data is accelerated. A matching device 100 performs, for a plurality of pieces of vector data each having a plurality of dimensions, a predetermined operation pertaining to each dimension of each piece of vector data. The matching device 100 includes a collective operation unit 150 and an individual operation unit 160. The collective operation unit 150 performs the predetermined operation pertaining to a specific dimension among the plurality of dimensions by a vector operation for different pieces of vector data in the plurality of pieces of vector data. The individual operation unit 160 performs the predetermined operation pertaining to each dimension other than the specific dimension for a piece of vector data that satisfies a predetermined condition among the plurality of pieces of vector data.
    Type: Application
    Filed: June 11, 2015
    Publication date: July 13, 2017
    Applicant: NEC Corporation
    Inventor: Kazuhisa Ishizaka
  • Publication number: 20170199908
    Abstract: A method for maintaining a document database and search index includes the steps of: (i) receiving a plurality of document action requests including one or more of a create document action request, a refresh document action request, a refreshfull document action request, an update document action request, or a delete document action request; (ii) creating a timestamp for each of the plurality of received document action requests; (iii) querying the document action requests to determine which are associated with a first document action request type; (iv) processing any of the document action requests of the first document action request type, where the document action requests are processed according to their associated timestamp; and (v) updating the search index using the updated document database.
    Type: Application
    Filed: August 16, 2016
    Publication date: July 13, 2017
    Applicant: International Business Machines Corporation
    Inventors: Alexander Gavrin, Raymond T. Jones, Paulo H. Paulin, Jorge A. Senger, Maksim Vinokurov
  • Publication number: 20170199909
    Abstract: A device may facilitate a query dialog involving queries that successively modify a query state. However, fulfilling such queries in the context of possible query domains, query intents, and contextual meanings of query terms may be difficult. Presented herein are techniques for modifying a query state in view of a query by utilizing a set of query state modifications, each representing a modification of the query state possibly intended by the user while formulating the query (e.g., adding, substituting, or removing query terms; changing the query domain or query intent; and navigating within a hierarchy of saved query states). Upon receiving a query, an embodiment may calculate the probability of the query connoting each query state modification (e.g., using a Bayesian classifier), and parsing the query according to a query state modification having a high probability (e.g., mapping respective query terms to query slots within the current query intent).
    Type: Application
    Filed: March 24, 2017
    Publication date: July 13, 2017
    Applicant: Microsoft Technology Licensing, LLC
    Inventors: Dilek Hakkani-Tur, Gokhan Tur, Larry Heck, Ashley Fidler, Fehtiye Asli Celikyilmaz