Data Quality
Aug 03 2010
Three Tips for Better Data Definitions
If business or IT users insist that their definition is good and everyone knows what they mean when in fact that is not the case, the strategies below may help.
1. Provide examples of unclear
vs. clear definitions
Users who are intimately familiar with their business process and
supporting systems may not understand the point of specifying exactly what they need. To them "the ID of the customer" is a
perfectly acceptable
definition of "Customer ID." Or, the IT representative may give a
definition that works for them but no one else, such as "the primary key
of the customer table". It will help both to see examples of what is
needed in order to have a workable definition to support data warehouse
population and use of the data.
Apr 12 2010
Defaulting data integration to customers = risky business
Here’s a little-recognized fact about data integration: if you run a business or any sizable chunk of one, someone is integrating your data.
In my professional life I have, on occasion, suggested data integration efforts. Sometimes my suggestions have been accepted and sometimes not. As an IT professional I understand that different managers have different priorities, and in a given business situation sometimes other things may be more important than, for example, having a single, consistent source for all customer records, or making sure production data matches financial data.
But as a customer? That’s different.
Feb 15 2010
Problem:
Our team needed to create a data set listing retail stores belonging to different territories for performance tests that involved store selection. The data set was to be used by virtual users signing-in as different territory owners. The challenge was that the territory and store numbers in the data set should be as random as possible. For example, if there are 20 territories and each territory has 10 stores, the data set should contain the 20 territories with their first stores, followed by the same 20 territories with their second stores and so on.
Jan 28 2010
Garbage in the Lockers and Gold on the Streets
How often do we find the currency and gold lying unprotected in the office cubes or corridors? How often do we find piles of garbage in and around the office buildings? Even if we do find them occasionally, how often do we find the gist of it getting summarized, packaged and sent to most of the senior managers, along with many other goodies undetected?
Jan 02 2010
On DW federation, whac-a-mole, and integrating business data
Information Management recently sent around their pick of best IM blog articles of 2009. Among them was Forrester’s James Kobelius’s reaction to Bill Inmon’s “incineration of a straw man concept that he refers to as ‘virtual data warehousing (DW).’”
Nov 17 2009
Testing Challenges in Large Scale OBIEE Projects: Lessons Learned and Best Practices
Testing in large scale Business Intelligence (BI) projects face challenges in data quality assurance, metrics / aggregation rules verification, source to target mapping accuracy, the test cases to the requirements traceability, and anomalies in the dimensions to facts relationships.
Oracle Business Intelligence Enterprise Edition (OBIEE) is a BI tool that addresses quite a few of these challenges by the nature of the product's growth strategy. As Gartner puts it, "70 functional and industry-specific packaged BI applications built on the Oracle BI Enterprise Edition Platform attests to Oracle's understanding of how to leverage the market interest in domain-specific and prepackaged solutions as a growth driver for its platform." Customers who buy OBIEE typically also buy a relevant packaged solution.
May 23 2009
It is a commonplace to say we should manage data like a resource. But when you think about it, data is an asset but not a resource. Data isn’t a thing like real estate, employees, or customers, but rather it represents all of those things. In data-geek-speak, data is a meta-resource that holds information about resources. That makes data a lot like money.