Brazilianberryacai O Loss T
N
searchA Category osearchhsearchrd Loss fsearche Berry ensearche
i Category
n Weight t
esearchg Category au
a Brazilianberryacai isearchy Brazilianberryacai osearch Loss h Brazilianberryacai Berry osearcht Brazilianberryacai n Category . Loss I Category m Loss nsearchb Brazilianberryacai li
vsearchs Loss h Weight tsearchtsearche Weight cosearchtn Berry Brazilianberryacai n
t
e
dtsearch
a Category e
ou Www e
h Brazilianberryacai s Www t Brazilianberryacai searche Weight tsearchte Category o Berry tsearchgr
n Berry lr Weight lev Berry l Www psearchs Category i Brazilianberryacai l searchn Weight mssearch Category n Weight lu Category esearcha Weight l searchh Category Www oss Loss bsearchehist Www r
csearchl Www dsearchta Www wi Berry hsearchn searchnsearche Berry t Loss r Www r Brazilianberryacai se. H Loss ssearchagu Berry en i Loss Weight ha Www Weight he Category nsearch- Brazilianberryacai ssearchrswi
l Loss m Category nd Www te Weight th Category Www e Category ds searchn
hesearchl Category v
l of data-detail that are not known at the time of building the data warehouse.
COMMON GOALS
Though Mr. Inmon and Mr. Kimball have different philosophies to their approach, they do tend to agree with each other in an indirect manner. Though Inmon’s basis is on a single data warehouse, he stressed on iterative approach and discouraged the “big bang” approach. On the other hand, though Kimball’s philosophy is to quickly create few successful data marts at a time, he stresses on integration for consistency via an Information Bus.
DATA WAREHOUSE vs. BUSINESS INTELLIGENCE
Business Intelligence = Inmon’s Corporate Data Warehouse + Kimball’s Data Marts + Data Mining + Unstructured Data.
ON THE STREET, IN PRACTICE
Over the years, almost every Fortune 500 company has implemented flavors of both the Inmon’s and the Kimball’s philosophies in pursuit of providing “single version of truth” through easily maneuverable analytics. In early 90s, several conferences promoted, numerous magazines recommended, and almost all the large corporations attempted to build Inmon’s centralized data warehouses. These are huge undertakings that needed to brew for several years through hard work and complex designs.
Due to the ease of implementation of Kimball’s philosophy with quicker returns, several mid-size companies initially implemented data marts rather than an enterprise-wide data warehouse. Additionally, there are also reviews stating that all the projects that tried to use Inmon’s approach have failed and none reported to succeed.
However, due to the underestimation of the complexity of the relations between the data marts as well as due to the departmental pressures to develop their respective data marts independently and simultaneously using Kimball’s approach, the database designers have created “silo-ed” data marts. As a result, the essence of Inmon’s centralized data warehouse that integrates departmental data for consistency has been reconsidered and pursued.
INMON, KIMBALL, … AND THE OTHERS
An analysis of Inmon vs. Kimball is not complete without referring to the first published work in 1988 on data warehousing by Barry Devlin and Paul Murphy of IBM Ireland. They coined the less-widely know term “Information Warehousing” which is defined as “A structured environment supporting end users in managing the complete business and supporting Information Systems in ensuring data quality”. Mr. Devlin has finally published his work as a book in 1997 called “Data Warehouse – from Architecture to Implementation”.
THE LESS DISCUSSED DIMENSION: PROCESS
The one dimension that has been ignored, stressed by several experts, is the “process”. For some reason, corporate processes have been treated as if they are not influential factors to the “analytics”.
STILL AN UNCONQUERED SCIENCE
Despite the great efforts from Inmon, Kimball, and the Others, the world of Data Warehousing is still facing great challenges. Even in 2005, after 14 years of Inmon explaining the concept, more than 50% of today’s data warehouse projects are anticipated to fail [Robert Jaques]. In fact, Ted Friedman, a Principal Analyst in Gartner wrote in xxx 2005, “Many enterprises fail to recognize that they have an issue with data quality. They focus only on identifying, extracting, and loading data to the warehouse, but do not take the time to assess the quality.”
Today’s data warehouses suffer from poor quality of the data. Whether or not the poor quality of data existed a decade ago is a questionable hypothesis. In the 90s, the new breed of software products and the ease of implementing data-moving techniques have opened several avenues that resulted in data duplication. As a result, any data inconsistencies in source systems have been remedied by scrubbing and cleansing them on “local copies” of the data sets rather than taking efforts to correct them at the sources.
If Inmon or Kimball had foreseen the wave of software product proliferation in the 90s that implemented duplicated functionality in an organization, they might have stressed on architecting for better quality.
CONCLUSION
Inmon and Kimball have seen the world of accessing enterprise-wide data with different sets of eyes. They both agree that easier access of enterprise data in an accurate and timely manner is the key success factor for creating an integrated solution for corporate information. Additionally, they both agreed that creating independent silos (often misrepresented as Kimball’s data marts) can only solve a set of specialized needs; difficult to support and maintain in long run; and often requires reconciliation and duplicate efforts to migrate into an enterprise-wide effort.
Inmon has definitely foreseen the hurdles and issues with data management through integration. Inmon presented them in a very academic manner that cannot be ignored. Several failures in the market can be attributed due to ignoring what Inmon has warned about. However, Kimball has brought forward a practical approach that corporation love to execute – with a “project” mindset that has definitive budget and time. Inmon’s writings usually tend to generalize a concept with little attention to the technical details. Kimball’s writings try to establish a definitive science that includes implementation techniques with abundant examples.
However, data accuracy is still the monster that needs to be conquered by several organizations.
On the end note, Inmon and Kimball have created approaches that complement each other. It only appears that they contradict each other if one tends to “pick” an approach.
REFERENCES
1. Using the Data Warehouse, W. H. Inmon, Richard D. Hackathorn, Wiley, July, 1994.
2. Managing the Data Warehouse, W.H. Inmon, John Wiley & Sons, December, 1996.
3. The Data Warehouse Toolkit: Practical Techniques for Building Dimensional Data Warehouses, Ralph Kimball, John Wiley & Sons, February, 1996.
4. The Data Warehouse Challenge: Taming Data Chaos, Michael H. Brackett, John Wiley & Sons, July, 1996.
5. Data Warehouse Project Management, Sid Adelman, Larissa T. Moss, Addison-Wesley Professional, December, 2000
6. Building the Data Warehouse (3rd Edition)”, W.H. Inmon, Wiley, March, 2002.
7. Nicholas Galemmo’s Mastering Data Warehouse Design.