RainStor, a database designed to manage and analyse big data, announced on Tuesday the availability of a new Archive Application for Hadoop 2.0 with its latest release of RainStor 6. The new feature is designed to make it easier for storage administrators to deploy an end-to-end solution on Hadoop for managing and analysing high-value, sensitive business data.

With RainStor’s Archive App, users conduct high performance queries against secure multi-structured data. An Archive is deployed when an organization has rapidly growing data that needs to be retained for ongoing business queries or when governance rules mandate that data be online and fully accessible for specific timeframes. Business users require analytic access to multiple years of history storing raw detailed data in order to derive business value and insights.

The company also said that while RainStor 6 is ideal for running on the Hadoop Distributed File System (HDFS), new capabilities can now take advantage of the latest Hadoop innovations, such as the cluster management technology YARN.

“RainStor integrates with YARN to ensure full co-operation in managing resources across a busy Hadoop cluster,” the company said in a statement. “From a cluster monitoring perspective, RainStor integrates with Apache Ambari. RainStor also provides connectivity through HCatalog, the de facto interface to relational data. These capabilities offer users increased flexibility in selecting the tools that best fit their needs.”

Mark Cusack, chief architect at RainStor, also commented on the announcement: “RainStor has been delivering analytical archive solutions to the world’s largest enterprises for a decade, and with RainStor 6, you can now take advantage of those capabilities running on Hadoop 2.0. We believe RainStor goes a step further to securing Hadoop as a ‘first-class citizen’ in the enterprise.”

New Archive capabilities include:

Analytics Performance Speed-up: Building on RainStor’s existing interactive SQL-on-Hadoop stack, the new archive application features XQuery for hierarchical data and documents, and extends analytics support to SQL 2003. Users benefit from a 10-100X-query boost using native SQL against a mix of structured, semi-structured data and documents in the same cluster. Performance improvements also apply to queries against Hive, Pig and MapReduce. An archive on Hadoop should achieve performance levels on par with the source environment, which is typically a data warehouse.

RainStor Application Management on Hadoop 2.0: RainStor is open, standards-based, and is suited to run on HDFS. Certified on Hortonworks 2.1 and Cloudera Enterprise 5, RainStor integrates with YARN to ensure full co-operation in managing resources across a busy Hadoop cluster. RainStor integrates with Apache Ambari for cluster monitoring and with Hue for managing archive workflows. RainStor also provides connectivity through HCatalog, the de facto interface to relational data. These capabilities offer users increased flexibility in selecting the tools that best fit their needs.

Governance for Greater Control: With the new Archive app, users gain enterprise-grade control of the data in Hadoop, through life-cycle data management features for retention and expiry. With a rules-based workflow users specify a record or groups of records to keep or delete, as they are loaded. Adhering to data governance practices has become a critical requirement and the platform provides control of data, which eliminates manual intervention or lost data.

The RainStor Archive App on Hadoop 2.0 is available beginning July 2014.

Read more here 


Interested in more content like this? Sign up to our newsletter, and you wont miss a thing!

[mc4wp_form]

(Image Credit: Rainstor)

Previous post

IBM Looking to Put Watson to the Test with NY's Startups

Next post

Three Forecasts You Should Know: Big Data, Business Intelligence & Analytics