Home
Turning Data Into Actionable Assets
  
 
Technology
 
  

XSB Technical White Papers

Thank you for your interest in XSB, Inc. To receive access to our technical white papers, please select the papers you are interested in and complete the registration form below. Once you submit your request, we will forward the white papers you have requested to the email address you provided at registration.

Assigning a Quality Measurement to Matching Records from Heterogeneous Legacy Databases: A Practical Experience
– Dr. L. Robert Pokorny
A method of automatically generating approximate matching records is described and a procedure is presented for assigning scores to these matches. Finally, a statistical quality sampling standard is applied to these match results to generate a quality measure for each score. This methodology can be applied to any data manipulation process the quality needs to be independently verified through statistical sampling.
Ensuring the Consistency of Self-Reported Data: A Case Study
– Dr. Hasan Davulcu, Jennifer Jones, Dr. L. Robert Pokorny, Christopher Rued, Dr. Terrance Swift, Tatyana Vidrevich, Dr. David S. Warren
This paper presents the XSB, Inc. sourcing tool. The tool enhances data quality by classifying data from multiple sources with semantics based on the NAICS taxonomy. Methods of statistical quality control are used to quantify the quality of the data presented.
XcelLog: A Deductive Spreadsheet System
– Dr. C.R. Ramakrishnan, Dr. I.V. Ramakrishnan, Dr. David S. Warren
The promise of rule-based computing was to allow end users to create, modify, and maintain applications without the need to engage programmers. But experience has shown that rule sets often interact in subtle ways, making them difficult to understand and reason about. This has impeded the wide-spread adoption of rule-based computing. This paper describes the design and implementation of XcelLog, a user-centered deductive spreadsheet system, to empower non-programmers to specify and manipulate rule-based systems. The driving idea underlying the system is to treat sets as the fundamental data type and rules as specifying relationships among sets, and use the spreadsheet metaphor to create and view the materialized sets. The fundamental feature that makes XcelLog suitable for non-programmers is that the user mainly sees the effect of the rules; when rules or basic facts change, the user sees the impact of the change immediately. This enables the user to gain confidence in the rules and their modification, and also experiment with what-if scenarios without any programming. Preliminary experience with using XcelLog indicates that it is indeed feasible to put the power of deductive spreadsheets for doing rule-based computing into the hands of end users and do so without the requirement of programming or the constraints of canned application packages.
Part Attribute Discovery for On-Demand Manufacturing
– Dr. Terrance L. Swift
This paper examines how a coherent view has been developed for technical data for the Defense Logistics Agency. Techniques used include natural language parsing, ontology-driven inferences, and mined association rules.
Preference Logic Grammars: Fixed Point Semantics and Application to Data Standardization
– Dr. Terrance L. Swift
The addition of preferences to normal logic programs is a convenient way to represent many aspects of default reasoning. If the derivation of an atom A_1 is preferred to that of an atom A_2, a preference rule can be defined so that A_2 is derived only if A_1 is not. Although such situations can be modeled directly using default negation, it is often easier to define preference rules than it is to add negation to the bodies of rules. For certain grammars, it may be easier to disambiguate parses using preferences than by enforcing disambiguation in the grammar rules themselves. In this paper, we define general fixed-point semantics for preference logic programs based on an embedding into the well-founded semantics, and discuss its features and relation to previous preference logic semantics. We then study how preference logic grammars are used in data standardization, the commercially important process of extracting useful information from poorly structured textual data. This process includes correcting misspellings and truncations that occur in data, extraction of relevant information via parsing, and correcting inconsistencies in the extracted information. The declarativity of Prolog offers natural advantages for data standardization, and a commercial standardizer has been implemented using Prolog. However, we show that the use of preference logic grammars allow construction of a much more powerful and declarative commercial standardizer, and discuss in detail how the use of the non-monotonic construct of preferences leads to improved commercial software.
WEAVE®: An Automated System for Collating Unstructured Data from Web and Legacy Sources to Enhance the MRO Supply Chain
– Dr. L. Robert Pokorny and Harpreet Singh
Gleaning consistent and complete data from multiple sources of unstructured information is often a difficult and time consuming process. In this paper we outline the WEAVE® system which automates the structuring and collating of unstructured data from multiple on-line Websites. WEAVE® is presented in the context of the maintenance, repair, and operations supply chain. The underlying knowledge representation for WEAVE® is an MRO product ontology. This ontology drives classification of product descriptions harvested from Websites and attribute value extraction from the descriptions. The system uses logic programming to manage the ontology driven classification and extraction and the Java 2 Enterprise Edition platform and Open Business Engine workflow engine to continuously harvest and collate data from multiple MRO catalog Websites. It uses this coherent view of MRO data to allow a user to quickly locate and compare MRO products.
Contact Form (* denotes required field)
First Name*:  
Last Name*:  
Email Address*:  
Phone Number:
Company*:  
Title*:  
Address:
City:
State/Province:
Zip/Postal Code:
Country*:  
How did you here about us?