Customer Experience Management (CXM), Information Management, Social Business
 
 
 

Project Barcelona: Dataflow, Metadata Crawlers for Microsoft Enterprise Products

While there is much research to chronicle how chaotic the management of information is in the enterprise at the moment, there is little enough in way of practical advice, or even tools, to suggest a way to tame unruly content. However, with Project Barcelona, Microsoft aims to provide an information crawler that will go into your network and map your dataflows and metadata.

Information Chaos

Recently, we have seen in the AIIM State of the ECM Industry report that content chaos remains a challenge for most, and we have also seen advice from Forrester on how to build content strategies. But if you don’t know what you have, or, even worse, know what you have but don’t know where to find it, no amount of strategizing is going to help.

This is where this new project comes in, although we probably won’t even have a release roadmap until later in the summer. The concept and project, led by Microsoft researcher Andrew Conrad, appears to be well on the way, and, on the face of it, heading in a direction many enterprises will welcome.

Enterprise as a Web

But let’s start at the beginning. In a recent blog post for the Barcelona Project, Conrad outlined where the idea comes from and how they the team is approaching the problem.

To understand the way they are thinking, they have drawn an analogy between the web, with its vast decentralized topology of sites and services, and an enterprise, where the information that is being used is increasing at an exponential rate.

It is here that Conrad has identified the core of the problem. While we all know that this is happening in the enterprise, on the web, numerous complex tools for navigating and understanding it have developed as the web has developed.

As a result, the information and content that we see on the web is, arguably, under control; under control,  that is,  in comparison to information and content in the enterprise.

And behind it all, one of the tools that gives it order is web crawlers that are constantly crawling around the web, indexing content, without anyone having to understand the technology behind it.

Few, if any, such tools exist for the enterprise, and it is this that Conrad and his jolly band aim to change.

Enterprise Data Explosion

Historically, in the enterprise, this problem has evolved because, in the early days, data was stored, for the most part, in relational databases with the IT department controlling most of the data, simply because the skills required in managing these went beyond those of your average IT worker.

That, however, in recent times, has changed, Conrad argues, so that:

  • Data acquisition and storage is cheap
  • Transforming and moving data is easy and cheap
  • Easy-to-use technologies have meant that personal databases, web portals and business intelligence tools can be built by people with no development experience
  • The proliferation of Excel, Access and SharePoint as enterprise data management tools

While this has led to productivity gains in most enterprise — assuming, again, it is possible to locate data — it has made life more difficult for DBAs (Database Administrators) and ELTs (Extract, Load Transform Administrators), or the people who pull data out of one database and place it into another.

 

Continue reading this article:

 
 
Useful article?
  Email It      

Related Articles:
Tags: , , , , , , , ,
 
 

Most Popular Articles

 

Featured Events  View all | Add event | feed RSS

Who's Hiring?  View all | Post a job | feed RSS


 
Are you hiring?    Post your job today ($45 for 45 days)!