Chief Analyst: Katy Ring, Research Director, IT Services

At 451 Research, we believe an 'enterprise data bazaar' can help organizations that aim to become more agile by using data to inform business direction and development. The phrase 'enterprise data bazaar' is a term used to define an environment in which many people can access and leverage this information to build data-driven products. To achieve this, enterprises need a unified data management layer so that data scientists and subject matter experts can decide what to do with the data that is stored. These layers allow the use of datasets (or data lakes) to provide value without having to silo information within the organization.

However, many organizations end up in what can be described as a 'data swamp' – a single environment that houses a large amount of raw data that cannot be easily accessed for any purpose, let alone multiple purposes. Creating a data bazaar with these management layers breaks this predicament by building data governance and self-service data preparation capabilities, putting security at the foundation of the approach.

When we speak with clients that have data lakes, many realize they do not fully understand the risks associated with what they have built. Since each source system has different governance and security policies, it is difficult for companies to audit their lakes as part of compliance measures. This struggle is caused by the self-service nature of data lakes, where data can be used for almost any purpose, which makes it unclear whether a company is protecting PII data as part of regulations such as GDPR.

When companies are in this situation, vendors and service providers are opening up an internal role for the chief data officer (CDO) that can help the business get back on track. This group can work together to figure out a remedy for this situation. One solution is to build a "sandbox" environment that includes company-wide policy, controls and metadata management with a 'citizen' data integrator tool which allows the user to give back or develop analytics on how they are using the data. With this type of tool, users can still access data in a self-service way and allow that access to be overseen by the IT group or CDO before it moves to production as a data product.

In addition to this self-service 'sandbox' data preparation layer, IT service providers can help companies with data governance and the data supply chain. Such providers assist in sourcing, managing and enriching the data, and sell managed services for policing data consumption. For example, in an audit, organizations need to know the data they hold, who uses it and what for. This regulation provides a strong opportunity for developing the enterprise data bazaar.

Furthermore, the self-service analytics and governance layers need to be architected the right way to enable a range of use cases over time, and this is often not what results from the development of a single-use-case project. Therefore a CDO role is so very important: this individual is the internal champion with authority to get agreement on a company-wide strategy for the capture, management and sharing of data.

Katy Ring, research director of IT services at 451 Research, examines the benefits of enterprise data bazaar, the technologies, service providers and strategies used to enable them in her Technology and Business Impact report on the Enterprise Data Bazaar.