Editor's note: This article is an excerpt from Chapter 8, "Master Data Integration," of IBM InfoSphere: A Platform for Big Data Governance and Process Data Governance (MC Press, 2013).
Written by Sunil Soares
IBM InfoSphere Master Data Management (MDM) systems provide a 360-degree view of customers, vendors, materials, assets, and other entities. Traditional MDM systems collect structured data from a number of structured data sources. With the advent of big data, MDM projects will increasingly look to derive value from the large volumes of entity information that is hidden within unstructured text, such as social media, email, call center voice transcripts, agent logs, and scanned text. This content might reside in multiple formats, such as plain text, Microsoft Word documents, and Adobe PDF documents, and in different forms of storage, such as content management repositories and file systems. In the following case study, the MDM team at a hypothetical company needs to integrate email with the customer record using IBM InfoSphere Master Data Management and IBM InfoSphere BigInsights text analytics technologies.