So you have a ton of information residing in one or more content repositories and you are having a heck of a time finding what you need to get your work done. Guess that Document Management system you invested in just isn't cutting it for you. Here's a product that will help you classify and categorize all that information -- and it supports over 30 languages to boot -- SAS Content Categorization.
Classifying Your Information - All of It
Now one would think if you had a ton of information floating around, that you would have a document management system of some kind to help store and categorize it. But there's also a lot of information stored in blogs and wikis and other systems not normally supported by a document management system.
So to help you organize, classify and categorize all that information in one central location, you could use a product like SAS Content Categorization.
SAS Content Categorization helps you parse and analyze your content for entities and events and can create metadata that will trigger business processes. It can parse multiple repositories providing you a single view of your information regardless of where it resides in your organization. It does this using natural language processing and advanced linguistics techniques.
Features of SAS Content Categorization
Supporting a number of features required to set up taxonomies, classification schemes and entity definitions, this solution can support both automated and manual processes.
- Taxonomy Creation: You get a number of pre-built taxonomies out-of-the-box or you can create your own taxonomy with unlimited nodes that are either hierarchical or flat. There are a number of services offered to help you get the work done.
- Category Classification: You can create simple or complex category rules and definitions to help categorize both structured and unstructured information; utilize client APIs to automatically classify incoming text.
- Entity Extraction: Define a few specific entities to categorization information. These can be grammar-based, dictionary based or regular-expression based and can be automated.
- Multi-Lingual: With support for more than 30 languages, language tools include part-of-speech recognition and tagging, stemming and segmentation for Asian languages.
- Collaboration: Have multiple people work on a categorization project, with permissions ranging from read and write to category rules and concept definitions.
SAS Content Categorization is based on the Teragram technology but is a standalone solution. It runs on just about any Windows client and any server platform.
We aren't really sure how this solution would work if you had a document management or enterprise content management solution in house. It appears to be a solution that would standalone and just connect to external repositories, whatever they are. But don't be fooled by thinking it's all done with the snap of the fingers, there is still a ton of pre-configuration that needs to be done.
Learn more about SAS Content Categorization from their website.