If you're considering whether your organization should go with cloud-based archiving for some of your documents, take a two step approach in your decision making. First, understand what your archiving requirements are — whether on premises or in the cloud. Then clarify what the pros and cons are of on premises versus cloud based archiving and decide which approach makes sense for your organization right now.
What is an Archive?
An “archive” is a system that at minimum:
- Securely stores documents; and here I use “documents” as shorthand for user content, including email, docs, social media and web pages
- Retains the documents as long as needed
- Purges documents when they are no longer needed for legal, compliance or business purposes
- Provides authorized users (internal and external) with access to the documents for various purposes (e.g. for business processes, customer service, customer or agent self-service, and discovery)
In service of the above requirements, archives typically include deduplication, indexing and some e-discovery capabilities.
If you look at the way your company and your peer's companies have done archiving in the past, you can see how it has evolved. For unstructured data or content (“documents”) most archiving was historically focused on fixed content like system generated output (such as statements, EOBs, correspondence). Images were also included.
When email became an enterprise concern due to its volume and risk, it was addressed as a content type requiring archiving. Email archiving has not been an unqualified success — and I tell that story below. More recently — again because of the entailed volume and risk — the chaotic swamp of dynamic documents (like Microsoft Office docs), web content and collaboration content started to be archived, along with other forms of e-communications like instant messages.
Which brings us to today, when most large companies are interested in archiving all of the above, plus transactional, unstructured data from business systems, plus rich media like audio and video, plus on occasion entire applications.
A Lesson from the History of Email Archiving
It’s important to understand what you want your archive to do, since there are lots of options out there and you need a good fit.
Let me tell you the story of email archiving to put things in perspective. In the early 2000s a lot of vendors from the ECM space tried to move into the email and related archiving space. How hard could email management be? So they tried to use their general ECM capabilities for archiving and add RM capabilities — thus providing more features and functions than the less fancy pure play archives were offering. But the ECM vendors couldn’t do the basic blocking and tackling for email archiving. They failed at all four points above:
- They couldn't scale to handle the numbers of users and mailboxes (1)
- They failed to provide reliable, fast access to users who wanted to find and retrieve older emails and attachments (4)
- Some of them “lost” attachments (1, 2 and 4),
- And they failed to provide reliable disposition — because users defected and squirrelled away emails, not trusting the enterprise archive to do its advertised job (3).
So many organizations dumped their ECM-based archive approaches and went back to the archive specialists, who were able to scale, etc.
Archiving now offers many more options than it did 12 years ago. You can archive everything from social media chats to web pages to movies to old fashioned email and mainframe print streams. You can use the archive for compliance, for active use in complex and demanding business processes, for beyond-the-firewall customer access and participation, and for rigorous e-discovery.
These are all very different scenarios with different requirements. And — in a nod to this article’s focus — you can do it in house or via the cloud. So you have to be clear about what you want the archive for.
What Should Your Archive Do?
Start with these key general requirements for archiving. You will weight these according to your situation, and will probably insert additional, more specialized requirements, such as compliance supervision (e.g. for financial services), advanced e-discovery, focus on particular file types (IM, Groupwise, video, web page archiving, salesforce.com), etc. The most important high level requirements for enterprise document archiving are:
- Scalability and Performance
- Accessibility and Availability
- Security and Protection
- Retention and Integrity
Let’s address each briefly in turn.
- Sitecore Takes on the Competition with Version 8 #SYMNA
- Discussion Point: Why are We Still Stuck on Email?
- A Look at the Top of Gartner's Mobile App Development World
- Microsoft Leaves Ballmer Bleeding as It Moves On
- 6 Things to Consider Before Buying Enterprise Social Software
- Gartner Names Wise Choices for Workplace Social Software
- A Bigger iPhone Will Just Make My Butt Look Big