« Googling Data Governance | Main | Acquisitions, Acquisitions everywhere »
July 27, 2007
Data Discovery or Data Relationship Discovery
Check out Mark Smith's blog on Data Discovery.
http://www.intelligententerprise.com/blog/archives/2007/07/data_discovery.html;jsessionid=RUJMN5NWB54LAQSNDLPCKHSCJUNN2JVN
While he calls this Data Discovery, I would refer to it as Data Relationship Discovery. You have to be able to go beyond just analyzing a single data source for information like mean, median, mode, cardinality, selectivity, frequency distributions etc and even just the simple ability to discovery primary foreign key relationships. You have to be able to discover complex business rules between systems. It is the "between" that really counts here. So what is really interesting in a distributed computing world is to not just do data discovery, but to do "Data Relationship Discovery"
Posted by Todd Goldman at July 27, 2007 12:30 PM
