Archive for the ‘Business Intelligence’ Category

SSIS vs Talend

October 22, 2013

Talend vs. SSIS: A Simple Performance Comparison

As a database developer, there is often some debate about the best ETL tools on the marketplace.  This article compares SSIS to Talend Open Studio with a simple comparison testing overall ability of both tools to load 1 delimited flat file to another delimited flat file on the same server.


Shade Under The Cloud

July 20, 2011

More and more companies and consumers are increasingly using use data-intensive applications driven by cloud technology that may have been previously unavailable due to overall cost and complexity.  Now, even mom and pop shops can store data, deploy complex applications using cloud technology.  They are finding that this is often the best way to provide optimal service, reduce failure points, ensure data integrity, reduce operating costs, and create a more easily managed environment.


‘Denali’ CTP3 Release

July 18, 2011

On Monday, Microsoft announced the third community technology preview (CTP3) release of its next-generation SQL Server database management system, code-named “Denali”.

I am excited to see some of the enhancements that come out with the first release:


Context Data Mining

August 31, 2010

A lot of companies are placing big bets that location services similar to Facebook Places are going to significantly change marketing, advertising and social networking in the coming years.  While some are still skeptical, I think leveraging the data mined from location services (or from a number of other places for that matter) will help in context aware applications.  Context aware applications are capable of giving us relevant information based on where we are located and/or the activity that we are performing.  The CNN money article Data mining your digital footprints speaks about two really cool “context-aware” applications worth reading.


BP(K*D) = i

March 25, 2010

Hey readers,  here is an excerpt from another blogger Jeff McQuigg

The full post can be found here.

I think the most interesting part about this post is discerning the difference between BI and DW.  It reminds me of the telescope/microscope analogy:  BI is about making reactive and more importantly proactive business decisions.  While some BI may involve analyzing historical trends, the newer the data, the more relevant it is.


Data Quality Part II

February 26, 2010

In Part I of this series, I showed how to create rules around data in a table. This post will expand on the idea of data quality and the need for data quality integration in your systems and applications. Ideally, a good data quality solution will help to reduce the amount of ETL (and associated overhead) required to incorporate data sets between applications, and will help to create data that is both efficient and valuable to your organization. (more…)

Parent-Child records in SSIS

December 1, 2009

Converting a File with Parent-Child Records in SSIS

This article talks about converting a file with Parent-Child records using SSIS.  Parent-child files have a header record (parent) with an undetermined amount of subsequent detail records(child).  The transformation extracts non-blank records, creates a key for each record and splits that record by type and then merges the records back together to create a flattened record with a 1-to-1 relationship. 

Read the article at:

Data Quality Part – I

December 1, 2009

Since the blog is named data integrity, I figured I would look up what “data integrity” truly meant.  In searching for the definition on google, data integrity came to be defined as ‘data that is complete or whole. All characteristics of the data including business rules, rules for how pieces of data relate, dates, definitions and lineage must be correct for data to be complete’.  With that in mind, I am going to start a number of posts on Data Quality and Data Profiling over the coming weeks. 


Dynamic SQL using CHARINDEX

November 11, 2009

Recently I was called upon to create a new field from an existing field.  The specifications called for the new field to be composed of any data between two dashes.  The dashes could be at any positional value and the data between the dashes was not a consistent length.  


Performance Tuning in SSIS

October 2, 2009

Tuning SQL Server 2005 Integration Services for EBCDIC Data Conversion

This article talks about tuning the dataflow for 5 million EBCDIC records at 497 bytes per record.  Impressively, the execution time for the dataflow task was cut almost in half by employing a number of tuning practices. 

Read the article at: